長時間地觀察專利文件之內容,可以用來分析科技的發展與演變。在本論文中,我們從中華民國專利網站,自動下載收集專利文件,然後,利用外部記憶體擷取出顯要詞的方法,將顯要詞在連續時間區間(如:每年)之出現頻率分佈,作為查詢之關鍵字歷史資料。實驗結果顯示,藉由從數十年來的中華民國專利中文文件中(2009年10月10日以前,共1,086,653$筆),所擷取出的顯要詞歷史資料,使用者可以查詢專利相關的關鍵詞歷史資料。另一方面,我們使用網際網路查詢介面,並且結合Google Chart API所提供的雲端運算,動態產生的統計圖表,使用者可以藉此觀察中華民國專利相關事件之趨勢變化。
With a long-term observation of the contents of patent documents, one could analysis the evolution and variation of technology. In this thesis, the R.O.C patent documents were downloaded and collected automatically; then the frequency distribution of significant terms over consecutive time periods (e.g. yearly) were computed as the resources for querying keywords histories where significant patterns were extracted via an external memory approach. Experimental results showed that the users could query the histories of keywords related to patents via the significant terms histories extracted from the R.O.C. patents documents in Chinese for several decades (There were $1,086,653$ records before October 10, 2009). On the other hand, we used the Web interface for query and combined the cloud computing provide by Google Chart API to generate statistics diagrams dynamically. Therefore, the users could observe the trend variation of events related to the R.O.C patents.