透過您的圖書館登入
IP:3.137.218.215
  • 期刊
  • OpenAccess

Development and Testing of Transcription Software for a Southern Min Spoken Corpus

並列摘要


The usual challenges of transcribing spoken language are compounded for Southern Min (Taiwanese) because it lacks a generally accepted orthography. This study reports the development and testing of software tools for assisting such transcription. Three tools are compared, each representing a different type of interface with our corpus-based Southern Min lexicon (Tsay, 2007): our original Chinese character-based tool (Segmentor), the first version of a romanization-based lexicon entry tool called Adult-Corpus Romanization Input Program (ACRIP 1.0), and a revised version of ACRIP that accepts both character and romanization inputs and integrates them with sound files (ACRIP 2.0). In two experiments, naive native speakers of Southern Min were asked to transcribe passages from our corpus of adult spoken Southern Min (Tsay and Myers, in progress), using one or more of these tools. Experiment 1 showed no disadvantage for romanization-based compared with character-based transcription even for untrained transcribers. Experiment 2 showed significant advantages of the new mixed-system tool (ACRIP 2.0) over both Segmentor and ACRIP 1.0, in both speed and accuracy of transcription. Experiment 2 also showed that only minimal additional training brought dramatic improvements in both speed and accuracy. These results suggest that the transcription of non-Mandarin Sinitic languages benefits from flexible, integrated software tools.

參考文獻


Academia Sinica. (2002). Southern Min archives: A database of historical change and language distribution. National Digital Archives Program. (Retrieved 2010/10/25)http://southernmin.sinica.edu.tw/
Baayen, R. H.(2008).Analyzing linguistic data: A practical introduction to statistics using R.Cambridge, UK:Cambridge University Press.
Chui, K,Lai, H. L.(2009).The NCCU corpus of spoken Chinese: Mandarin, Hakka, and Southern Min.Taiwan Journal of Linguistics.6(2),119-144.
Iunn, U. G. (2003a). Online Taiwanese syllable dictionary. (Retrieved 2010/10/25)http://iug.csie.dahan.edu.tw/TG/jitian/.
Iunn, U. G. (2003b). Online Taiwanese concordancer system. (Retrieved 2010/10/25)http://iug.csie.dahan.edu.tw/TG/concordance/.

被引用紀錄


Yang, Y. H. (2015). 臺灣閩南語句法剖析器初探 [master's thesis, National Chung Cheng University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0033-2110201614033820

延伸閱讀