強化深度學習對於自然語言處理的強韌度-以假新聞偵測為例

因為互聯網與社群媒體的推波助瀾，網路新聞已經成為重要的新聞來源。近幾年因為對抗式攻擊研究議題興起，使得運用深度學習模型偵測假新聞的辨識正確性備受挑戰。本研究嘗試透過 TFIDF、TextRank、KeyBERT 等文字探勘方法，以及測試模型輸出 LogitOut 方法，找到文本中容易受到 TextFooler 擾動的標的，再將找到的關鍵單詞進行同義詞置換生成模擬對抗樣本，透過對抗式訓練的方式強化 BERT 假新聞判別器對於 TextFooler 攻擊的強韌度。實驗結果發現：(1) 文字探勘方法中 KeyBERT 較能找出 TextFooler 攻擊單詞，而模型輸出 LogitOut 又明顯優於文字探勘方法。(2) 關鍵字搜尋方法對於 TextFooler 攻擊單詞命中率越高，越能透過同義詞置換生成模擬對抗範例，並藉由訓練模擬對抗範例後提升 BERT 假新聞判別器對於 TextFooler 對抗式攻擊的強韌度。

關鍵字

假新聞偵測；對抗式攻擊；假新聞偵測

並列摘要

In recent years, the research of adversarial attack has emerged, making the fake news detection by using deep learning method challenging again. In this study, we try to increase the robustness of BERT fake news detector against TextFooler by training simulated adversarial samples. To generate simulated adversarial samples, we use both text mining method such as TFIDF, TextRank, KeyBERT and method by testing model ouput (LogitOut) combining with synonyms replacement strategy. The experimental results found that (1) KeyBERT is more capable of identifying the attacked subject by TextFooler comparing with other text mining methods, and testing model output(LogitOut) method is much better than text mining methods. (2) The robustness of BERT fake news detector against TextFooler can be improved after adding the simulated adversarial examples mentioned above.

並列關鍵字

Fake news detection ； Adversarial attack ； Adversarial Defence ； TextFooler

參考文獻

[1] Nic Newman, Richard Fletcher, and David A. L. Levy, et al. digital-newsreport2016. Digital Journalism. https://reutersinstitute.politics.ox.ac.uk/

Google Scholar

our-research/digital-news-report-2016, 2016.

Google Scholar

[2] Edson C., Tandoc Jr., and Zheng Wei Lim, et al. Defining fake news. Digital Jour-nalism. https://doi.org/10.1080/21670811.2017.1360143, 2018.

Google Scholar

[3] Ashish Vaswani, Noam M. Shazeer, and Niki Parmar, et al. Attention is all you need.

Google Scholar

arXiv preprint arXiv:1706.03762, 2017.

Google Scholar

主題瀏覽