透過您的圖書館登入
IP:3.19.31.73
  • 期刊
  • OpenAccess

Mandarin Topic-oriented Conversations

並列摘要


This paper describes the collection and processing of a pilot speech corpus annotated in dialogue acts. The Mandarin Topic-oriented Conversational Corpus (MTCC) consists of annotated transcripts and sound files of conversations between two familiar persons. Particular features of spoken Mandarin, such as discourse particles and paralinguistic sounds, are taken into account in the orthographical transcription. In addition, the dialogue structure is annotated using an annotation scheme developed for topic-specific conversations. Using the annotated materials, we present the results of a preliminary analysis of dialogue structure and dialogue acts. Related transcription tools and web query applications are also introduced in this paper.

並列關鍵字

Taiwan Mandarin dialogue act speech corpus

參考文獻


Alexandersson,J.,B. Buschbeck-Wolf,T. Fujinanti,M. Kipp(1998).Dialogue Acts in VERBMOBIL-2.(Report no).
Anderson,A.,M. Bader,B. Bard,E. Boyle(1991).The HCRC Map Task Corpus.(Language and Speech).
Barns,C.,E. Geoffrois,Z. Wu,M. Liberman(2001).Transcriber Development and Use of a Tool for Assisting Speech Corpora Production.Speech Communication.33,5-22.
Chao,Y.-R.(1968).University of California Press.
Chen,K.-J.,C.-R. Huang(1996).SINICA CORPUS: Design Methodology for Balanced Corpora.(Proceedings of the Eleventh Pacific Asia Conference on Language, Information and Computation).

延伸閱讀