假新聞依據型態,動機或寫作風格有著不同的寫法。先前的假新聞欺騙檢測的相關研究使用人工方式提取特徵,這樣的方法受限於人類自身的語言理解能力。即便這樣,假新聞中內的語言變異特徵的提取還是有其困難。在本研究中,我們探討了使用自動提取重要語義特徵方法的可能性。這些被提取的語意特徵不受限於人類本身的語言理解,同時我們也探討是否這些方法可以捕獲演變中的語言變異性。我們的實驗結果顯示,我們的模型可以與使用傳統機器學習並由人工進行特徵篩選的模型達到相當的效果。
Fake news articles are differently written, depending on the type, motivation and writing style. Previous work in deception detection in fake news use features that are manually made and are limited to predefined human understandings of linguistics. That being said, it is difficult to extract the shifts in linguistic variability in fake news articles. In this work, we investigate the possibility of using a method that will be able to automatically extract important linguistic-based features. The extracted linguistic-features are not limited to our understandings of linguistics, and we will investigate if they can to capture evolving linguistic variability in fake news. Our experimental results show that our model achieves results that are comparable to the models that use traditional machine learning, which are limited to manual feature selection.