A Rule Set to Select Representative Nouns from a Noun Synonym Set for a Japanese Fishing Website

Japanese documents have noun synonyms. These use kanji notation, hiragana notation, and katakana notation for words. Sometimes words have alternate kanji expressions: alternate names for an object, different suffixes for kanji, etc. This is why noun synonym sets are formed for Japanese nouns. Thesauruses and dictionaries can be used to select a representative expression from a noun synonym set. However, these references do not consider the type of document. Representative nouns are often different depending on the type of articles. For example, in articles in newspapers, kanji is preferred. In contrast, in articles in encyclopedias, katakana is preferred. The problem is to form a rule set to select a representative noun from a noun synonym set, and the rule set must consider the type of document. We propose a rule set arranged for the WEB Fish Encyclopedia (in Japanese, Sakanazukan). We introduce a keyword category in the rule set to increase the correctness of the selected representative noun. As a result, most of the representative expressions were selected appropriately from noun synonyms. We expressed these noun synonyms as feature vectors. By using three numerical values and four Boolean values, all noun synonyms were expressed.

並列關鍵字

Noun Synonym ； Japanese Syntax Analysis ； Keyword Dictionary

參考文獻

Shinmura, I.(1991)。Kojien fourth edition (Japanese Dictionary)。Tokyo:Iwanami。

Google Scholar

Wikimedia Foundation, Inc. Wikipedia. Retrieved on January 31, 2012, from http://ja.wikipedia.org/

Google Scholar

Fishing-Forum, The WEB fish encyclopedia (in Japanese, Sakanazukan). Retrieved on January 31, 2012, from http://www.fishing-forum.org/zukan/

Google Scholar

Kawahara, D.,Kurohashi, S.(2006).Case frame compilation from the web using high-performance computing.Proceedings of the 5th International conference on Language Resource and Evaluation.(Proceedings of the 5th International conference on Language Resource and Evaluation).:

Google Scholar

Ravi, S.,Knight, K.(2009).Minimized models for unsupervised part-of-speech tagging.Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language.(Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language).:

Google Scholar

被引用紀錄

黃崧芥（2014）。磁性電容中不同方向與大小磁矩對介電性質提升之研究〔碩士論文，國立臺灣大學〕。華藝線上圖書館。https://doi.org/10.6342/NTU.2014.00010

Chen, L. T. (2010). 同源重組蛋白酶的結構功能分析與合理設計之縮氨酸可調控同源重組蛋白酶 [doctoral dissertation, National Taiwan University]. Airiti Library. https://doi.org/10.6342/NTU.2010.01457

國際替代計量

A Rule Set to Select Representative Nouns from a Noun Synonym Set for a Japanese Fishing Website

全文下載

主題瀏覽