透過您的圖書館登入
IP:3.145.16.90
  • 期刊

Analyses on the Used Vocabulary in the Corpus of Taiwanese Learner of Japanese (CTLJ): Comparisons between CTLJ and Self-Constructed Natural Corpus

「台灣日語學習者語料庫」(CTLJ)之使用語彙分析-與自然語料庫之比較為本

摘要


本論文針對台灣日語學習者語料庫(CTLJ)之原文部分,先以詞素解析器MeCab將其中之語彙加以分割,針對解析錯誤,前後歷時三年並經兩次校正後,再進行使用語彙分析。為了凸顯學習者語彙之特徵,分析時透過與筆者自行建構之自然語料庫進行比較。經分析結果得知:CTLJ原文部分之詞素總數超過39萬詞,其中個別詞素約1萬3千詞,名詞最多,連7千4百餘詞(佔57.2%);其次為動詞,逾3千1百餘詞(佔24.2%)。此外,藉由比較CTLJ與筆者自行建構之自然語料庫,可以掌握學習者使用語彙之實際狀況與易錯語彙之使用情形,提供學習者強化學習之參考。

關鍵字

語料庫 詞素 出現頻率 易錯語彙

並列摘要


This paper presents an in-depth analysis of the use of vocabulary covered by the Corpus of Taiwanese Learner of Japanese. Our method consists, firstly, in applying the Japanese morphological analyzer, MeCab, to segment vocabularies of the original writings in Japanese in CTLJ, and then proceeding with morpheme-level analysis of errors in grammar and usage, which process has been repeated twice in the recent three years. In order to highlight the words characteristic of the Taiwanese Learners' Japanese, comparisons are made between CTLJ and a corpus of current Japanese, which have been constructed by the author. The result indicates that the number of morpheme tokens used in the original students' essays in Japanese in CTLJ is more than 390 thousand, or around 13 thousand morpheme types. The number of nouns amounts to 7,400, which accounts for 57.2% of morpheme types. The number of verbs is 3,100 (24.2%). In addition, comparisons between CTLJ and the above-mentioned natural corpus help the instructors to grasp the actual situations of how the learners use and reveal what sort of items are particularly prone to errors, thereby enabling them to provide apt and systematic instructions to the learners.

參考文獻


黃淑妙、山本卓司、関口要(2009)。『台湾人日本語学習者コーパス』(CTLJ)試行版の公開。台灣日本語文學報。25,269-292。
陳毓敏(2004)。台湾人日本語学習者の漢語の意味認知における難易度の階層性の検証。台灣日本語文學報。19,291-315。
陳淑娟(2006)。作文における語彙習得についての—考察—使用語数と語類の変化を中心に。東呉日語教育学報。29,29-63。
(2003)。第二言語習得研究への招待。????。
Krashen, S. D.(1982).Principles and practice in second language acquisition.Oxford:Pergamon.

延伸閱讀