透過您的圖書館登入
IP:54.196.27.122
  • 學位論文

應用文件探勘技術進行立法文本自動化分析

Automatic Content Analysis of Legislative Documents by Text Mining Techniques

指導教授 : 林福仁
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


在立法院國會圖書館網站裡,提供了一個公開且客觀的管道,讓公民可以追蹤了解立法院每天發生的事情,諸如立委的質詢等等。然而,這些公開的資訊量其實非常大,也非常凌亂,一般民眾可能無法有效消化這些資訊,或很難透過這些資訊去清楚了解立委的問政績效,因而浪費了此公開管道的美意,因此,為了克服這個困難,本研究目的就在於透過文件探勘技術去有效分辯每位立委立法表現的類別,然後展現出他們在各領域裡的問政績效。 此研究根據中山政治所專家所建構的立法分類架構為基礎,透過兩階段分群(two-stage clustering)去做特徵值擷取,再採用支持向量機(support vector machine)去建立模型來自動預測立委立法表現到最適合的分類。 為了讓此系統可以永續執行下去,此研究同時也對政治專家與一般民眾在分類標籤貢獻上的內容差別做了實驗驗證,呈現的結果沒有顯著差別,將支持未來系統可以直接透過網路讓一般民眾做維護與更新分類的動作。 本研究提出的自動預測分類方法,輔以視覺化雷達圖的呈現,希望幫助公民更能了解立法院活動與立委的問政績效,根據實驗的結果顯示,使用本方法可以有效自動分辨立法表現類別,進而可持續利用國會圖書館的公開立法資訊,有效做到監督立委在各種面向下的問政績效。

並列摘要


The Parliamentary Library of Taiwan’s Legislative Yuan website provides a fair and objective channel for the public to track daily activities of the Legislative Yuan and legislators’ inquiries. However the quantity of generated documents is so large that the general public may not be able to update of the legislative performance of each legislator from these contents. To mitigate the gap of legislative document generation and the sense making by the general public, this study proposed a text mining mechanism to automatically classify legislative documents referring to each legislator, and then represent the proportion of their legislative performance on certain categories. This study first initiated a basic legislative categorical structure by domain experts. Then a two-stage clustering was applied to perform feature selection for legislative documents. The SVM method was applied to build a model to classify the new document to the appropriate category. In order to maintain the classification categories up to date, in this study, we also evaluate the difference from labeling contents by domain experts and the general public. If the categories labeled by both do not have significant difference, we can call for the general public via internet to maintain the updated categories of newly generated legislative documents. Experimental results show the effectiveness of the proposed test mining mechanism, which automatically classifies legislative documents to reveal legislators’ performance accordingly. With this result, people can monitor legislators and track their legislative activities using the information from the Parliamentary Library of Legislative Yuan to update their perception on legislative performance in various categories.

參考文獻


Liao, Y. (2006). The Research of Voter Turnout: Case Study in Taiwan. The Journal of Chinese Public Administration, (3), 185-202.
Lin, J. J. (2006). The Study of Interpellation System of Legislative Yuan in R.O.C. Journal of TOKO, 1(1).
Liao, D. L., Lin, F. R., Huang, Y. C., Liu, Z. Y., & Lee, C. X. (2012). The Establishment of Taiwanese Legislators' Campaign Promise Database. Journal of Electoral Studies, 19(1), 129-158.
I. Political Science references:
II. Technical mechanism references:

被引用紀錄


薛仱芸(2014)。改善網路操弄評論分類績效之研究〔碩士論文,朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-0905201416542666

延伸閱讀