透過您的圖書館登入
IP:18.117.102.205
  • 學位論文

具長短期記憶之序列特徵選取方法

Feature Selection with Long Short-Term Memory from Sequential Data

指導教授 : 蕭舜文

摘要


由單個物件有序組成的序列型資料在我們的日常生活中被廣泛的應用,如文本、視頻、語音信號和網站使用日誌……等。通常資料分析需要大量的人工和時間,然而近年來,神經網路在分類和各種自然語言處裡任務方面有很好的表現,儘管這些技術已經很完善,但是我們很難理解這些技術是使用什麼樣的訊息來實現其目標,如果只是一個簡單的分類器可能沒辦法達成知道是什麼樣的訊息這樣的需求,因此,我們提出了一個基於神經網路的特徵過濾器,用於分析序列型資料,以便從原始資料中過濾出有用也人類可讀的訊息,並在之後用於分類。 本文中,我們設計了一個神經網路框架 - filteRNN,該框架有一個過濾器的結構,透過這個過濾器,我們可以過濾有價值、人類可讀的特徵以進行後續分類,我們使用了惡意軟體及評論的文本資料來展示從原始資料過濾的功能,並將過濾後的資料輸入分類器進行分類,這個模型能夠過濾掉一半的原始數據。因為過濾器和分類器在這個框架中很重要,因此我們也透過嘗試不同的過濾器和分類器來檢查框架的有效性,同時我們也將注意力模型用來跟我們的框架進行比較。實驗結果顯示,我們可以提取不同序列型資料中,各類別的共有特徵,以供進一步研究。

並列摘要


Sequential data which consists of an ordered list of single object is in a wide range of applications in our daily life, such as texts, videos, speech signals and web usages logs. In general, the analysis of data requires a lot of human work and time. In recent years, neural networks (NNs) have achieved state-of-the-art performance in classification and a variety of NLP tasks. Although such techniques are well-developed, it is difficult for us to understand what information is used for reaching its goal. Such needs may not be satisfied by a simple classifier; hence we proposed an NN-based characteristics filter for analyzing sequential data in order to filter useful and human-readable information from the raw data for a further classifier. In this paper, we design an NN framework (filteRNN) which embeds a filter structure that can filter valuable, human-readable features for latter classification. We use the datasets of malwares and reviews to demonstrate the capability of filtering data from raw data, and the filtered data are fed into a classifier for classification. The models are able to filter out half of the raw data. Since filters and classifiers play key roles in this task, we implement different filters and classifiers to examine the effectiveness of the framework. Besides, attention model is used to do the comparison with our framework. Experimental results indicate that we can extract the commonly shared characteristics of categories in different sequential datasets for further study.

參考文獻


[1]Y. Lecun, L. Bottou, Y. Bengio and P. Haffner, “Gradient-based learning applied to document recognition”,Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.Available: 10.1109/5.726791.
[2]I. Sutskever, O. Vinyals and Q. Le, “Sequence to sequence learning with neural networks”,in Advances in Neural Information Processing Systems, 2014, pp. 3104-3112.
[3]A. Rush, S. Chopra and J. Weston, “A Neural Attention Model for Abstractive Sentence Summarization”,arXiv.org, 2019. [Online]. Available: https://arxiv.org/abs/1509.00685.[Accessed: 18- Apr- 2019].
[4]D. Golub and X. He, “Character-level question answering with attention”,arXiv.org, 2016.[Online]. Available: https://arxiv.org/abs/1604.00727. [Accessed: 18- Apr- 2019].
[5]R. Agrawal and R. Srikant, “Mining sequential patterns”,Proc. of the 11th Int. Conf. on Data Engineering (ICDE), 1995, pp. 3-14. Available: 10.1109/icde.1995.380415 [Accessed18 April 2019].

延伸閱讀