透過您的圖書館登入
IP:18.224.39.32
  • 學位論文

MOBILE01和PTT兩個不同論壇相同面向習慣用詞之探討-以電信、寬頻版為例

MOBILE01 and PTT two different forums discussion on the same face habit words - a case study of telecommunications, broadband

指導教授 : 蔣璿東

摘要


本論文將針對兩種不同類型的論壇做比較,對相同領域而言,觀察兩種不同類型論壇詞典外的習慣用語是否相同?進而討論(1)利用其中一個較專業和發表與特定領域相關文章較多之論壇所訓練完成的詞庫當作另一論壇的初始詞庫是否有助於準確率和回收率的提升?(2)有了初始詞庫,為節省大量人力,是否可跳過第一階段人工標註的動作,直接利用系統第二階段的動作直接擷取文章中的意見詞或排除字?本論文會利用PTT與Mobile01的電信版和寬頻版來討論上述問題。依據實驗數據顯示,如果使用Mobile01排除字詞庫當初始詞庫,確實可以改善系統的準確率和回收率;但就非冷門的詞庫外意見詞使用習慣而言,PTT和Mobile01使用者的習慣用語仍有些許不同且數量不多,所以即便是使用Mobile01訓練完成的詞庫當作PTT初始詞庫,為了維護較高的準確率和回收率,不建議跳過第一階段人工標註的動作;但如果為了節省人力可以跳過第一階段人工標註的動作,直接利用系統第二階段的動作直接擷取文章中的意見詞或排除字。

關鍵字

初始詞庫 人工標註

並列摘要


This paper compares the forums of two different types to observe whether the idioms/terminologies outside the dictionaries of the two different types of forums are the same for the same field before further discussions (1) whether it is helpful in improving precision and recall to use the trained lexicon of a more professional forum with more published articles relating to a specific field as the initial lexicon of another forum. (2)Using the initial lexicon, this paper attempts to discuss whether the manual tagging operation at the first stage can be skipped to directly capture the opinion words or exclusion words of the articles by using the system operations of the second stage. This paper discusses the above problems by using PTT and Mobile01 telecommunications page and broadband page. According to experimental data, using Mobile01 exclusion words lexicon as the initial lexicon can improve the system precision and recall. However, for the usage of unpopular opinion words outside the lexicon, idioms/phrases of PTT and Mobile01 users may slightly differ in small number. Therefore, even if using the Mobile01 trained lexicon as the PTT initial lexicon, to keep relatively high level of precision and recall, it is not recommended to skip the manual tagging operation of the first stage. However, in order to save manpower, the manual tagging operation of the first stage can be skipped to directly use the system operations of the second stage to directly capture opinion words or exclusion words from the article.

並列關鍵字

Initial Lexicon Manual Tagging

參考文獻


[21] 林偉揚, "應用種子詞彙延伸方式於BBS電影評論之口碑分析" 元智大學資訊管理學系, 2011.
[24] 平震孙, " 一個適用於行動裝置的網頁搜尋結果分群系統之研究" 元智大學資訊管理研究所碩士論文, 2007.
[26] 陳子龍, "中文意見探勘系統之句法分析" 淡江大學資訊工程學系資訊網路與通訊研究所碩士論文, 2012
[3] 簡立, "意見探勘系統設計" 淡江大學資訊工程研究所碩士論文, 2012.
[5] N. Kobayashi, K. Inui, and Y. Matsumoto "Extracting aspect-evaluation and aspect-of relations in opinion mining," 2007, pp. 1065-1074.

被引用紀錄


吳冠陞(2013)。中文句法規則搭配意見元素之研究〔碩士論文,淡江大學〕。華藝線上圖書館。https://doi.org/10.6846/TKU.2013.00462
陳子龍(2012)。中文意見探勘系統之句法分析〔碩士論文,淡江大學〕。華藝線上圖書館。https://doi.org/10.6846/TKU.2012.01149

延伸閱讀