透過您的圖書館登入
IP:3.144.86.138
  • 學位論文

基於深度學習之中文詞性標記研究與實現

Research and Implementation of Chinese Part-of-Speech Tagging Based on Deep Learning

指導教授 : 黃乾綱
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


近年來,隨著人工智慧的快速發展,深度學習(Deep Learning)的技術也隨之蓬勃發展,並廣泛應用在各個領域,包括自然語言處理(Natural Language Processing,簡稱NLP)。   詞性標記(Part-of-Speech tagging,簡稱POS tagging)是自然語言處理中的一項基礎任務,為句子中的每個詞都標上一個詞性類別的過程,是幫助電腦理解語言含義的關鍵。   本論文主要針對現有基於深度學習的中文詞性標記方法,設計一個改善其模型方法並提升其標記準確率的詞性標記模型,採用Word2vec模型訓練詞嵌入(Word Embedding),並結合基於雙向長短期記憶網路(Bidirectional Long Short-Term Memory Network,簡稱BLSTM)的字符嵌入(Character Embedding)作為詞向量表示方法(Word Representation),再送入雙向長短期記憶網路模型提取上下文的特徵,進行詞性標記的任務。實驗結果顯示,使用此模型在中國大陸《人民日報》1998年1月份語料庫的詞性標記之整體準確率為96.28%,與未加入字符嵌入的基線模型(Baseline Model)相比提升0.76%;且未知詞(Out-of-Vocabulary,簡稱OOV)的詞性標記之準確率為81.51%,與基線模型相比提升10.81%。

並列摘要


In recent years, with the rapid development of artificial intelligence, deep learning technology has also been widely applied to various fields, including natural language processing (NLP).   Part-of-Speech tagging (POS tagging) is a basic task in NLP. It is a process of marking up a word in a text (corpus) with a particular part of speech to help the computer understand the meaning of the language.   This thesis focuses on improving the model and the accuracy of the existing Chinese POS tagging model based on deep learning. The improved model uses bidirectional long short-term memory (BLSTM) to extract the context features applied to the Chinese POS tagging. The input is word representation which is the concatenation of word embedding trained by Word2vec model and the character embedding trained by BLSTM. Experimental results show that the overall accuracy of POS tagging of the corpus in the People’s Daily in China in January 1998 achieves 96.28%, which is 0.76% higher than the baseline model without the character embedding; the accuracy of the POS tagging of out-of-vocabulary (OOV) is 81.51%, which is 10.81% higher than the baseline model.

參考文獻


[1] Krizhevsky, A., I. Sutskever, and G.E. Hinton, ImageNet Classification with Deep Convolutional Neural Networks, in Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1. 2012, Curran Associates Inc.: Lake Tahoe, Nevada. p. 1097-1105.
[2] Hinton, G., et al., Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups. IEEE Signal Processing Magazine, 2012. 29(6): p. 82-97.
[3] Kumar, A., et al. Ask Me Anything: Dynamic Memory Networks for Natural Language Processing. arXiv e-prints, 2015.
[4] 維基百科編者. 自然語言. 2018-10-26]; Available from: https://zh.wikipedia.org/w/index.php?title=%E8%87%AA%E7%84%B6%E8%AF%AD%E8%A8%80&oldid=51788729.
[5] 維基百科編者. 自然語言處理. 2019-05-03]; Available from: https://zh.wikipedia.org/w/index.php?title=%E8%87%AA%E7%84%B6%E8%AF%AD%E8%A8%80%E5%A4%84%E7%90%86&oldid=54272656.

延伸閱讀