透過您的圖書館登入
IP:3.21.104.109
  • 學位論文

主題人物互動網絡建構之研究

A Study of Constructing Topic Person Interaction Network

指導教授 : 陳建錦
共同指導教授 : 許聞廉(Wen-Lian Hsu)

摘要


一個主題事件是由特定的時間、地點以及人物所構成,因此探索主題事件中人與人的互動關係能夠協助使用者建構起該主題事件的背景知識,進而快速理解該主題文章中所描述的內容。不同於以往的研究,本研究所進行的主題人物互動關係研究並不局限於一特定關係,或尋找一特定關係的關係擷取規則,且此互動關係是會隨著主題事件的不同而改變。在探索某主題下之人物互動關係時,主題文章透過互動關係偵測方法分割成多個片段,並且判斷哪些片段是含有主題人物互動關係存在,接著應用資訊擷取演算法從互動文句片段中抽取互動關係的組成元素,進而建構出主題人物互動網絡。 本論文將辨識互動文句片段辨識視為一個分類的問題,為了探索不同知識對辨識主題人物互動關係之效益,提出了一個以特徵為基礎的互動文句片段辨識方法,該方法共包含了19種不同特徵知識,考量了語法結構、文章脈絡以及語意資訊。在探索不同知識對本研究之效益後,為了能有效地表達與整合不同知識之結構,本論文進而提出了一個豐富互動樹狀結構,透過此樹狀結構有效地整合句法結構、文章脈絡以及語意資訊,並透過樹狀核心方法學習該結構,藉此捕捉人際互動關係描述。本研究蒐集大量的新聞事件進行效能評估,根據實驗結果顯示,結合不同的特徵知識能夠有效地提升辨識人際互動關係之效能,且本研究所提出的豐富互動樹狀結構能有效地偵測主題人物互動,而其效能也優於其他的比較方法;此外,根據案例分析之結果顯示,本研究方法確實能有效地建構主題人物互動關係網絡,此一網絡呈現出人際互動關係之多樣且變動之特性,並能提供讀者快速掌握該主題之背景知識。

並列摘要


The development of a topic in a set of topic documents is constituted by a series of person interactions at a specific time and place. Knowing the interactions of the persons mentioned in these documents is helpful for readers to better comprehend the documents. To discover person interactions, we need a detection method that can identify text segments containing information about the interactions. Information extraction algorithms then analyse the segments to extract interaction tuples and construct an interaction network of topic persons. In this dissertation, we define interaction detection as a classification problem. We first recognize person interactions from topic documents by exploring various types of knowledge. We present a feature-based approach called FISER, exploits 19 features covering syntactic, context-dependent, and semantic information in text to detect person interactions. Then, we design the rich interactive tree structure to represent syntactic, context, and semantic information of text, and this structure is incorporated into a tree-based convolution kernel to identify interactive segments. Experiment results based on real world topics demonstrate that effective incorporation of divers features enable our system recognize person interactions efficiently. Moreover, the proposed rich interactive tree structure effectively detects the topic person interaction and that our method outperforms many well-known relation extraction and protein-protein interaction methods.

參考文獻


[6] Y.C. Chang, C.C. Chen, and W.L. Hsu, "A Composite Kernel Approach for Detecting Interactive Segments in Chinese Topic Documents," Proc. 9th Asia Information Retrieval Societies Conference, pp. 215-226, 2013.
[11] C.C. Chen and M.C. Chen, "TSCAN: A Content Anatomy Approach to Temporal Topic Summarization," IEEE Trans. Knowledge and Data Eng., vol. 24, pp. 170-183, 2012.
[20] J.H. Eom, S. Kim, S.H. Kim, B.K. Zhang, "A Tree Kernel-based Method for Protein-Protein Interaction Mining from Biomedical Literature," Knowledge Discovery in Life Science Literature, pp 42-52, 2006.
[23] C. Fellbaum, "WordNet: An Electronic Lexical Database," MIT Press, Cambridge, 1998.
[25] H. Feng, K. Chen, X. Deng, and W. Zheng, "Accessor variety criteria for Chinese word extraction," Computational Linguistics, 30(1), pp. 75-93, 2004.

延伸閱讀