透過您的圖書館登入
IP:18.220.137.164
  • 期刊
  • OpenAccess

A Comparative Study of Four Language Identification Systems

並列摘要


In this paper, we compare four typical spoken language identification (LID) systems. We introduce a novel acoustic segment modeling approach for the LID system frontend. It is assumed that the overall sound characteristics of all spoken languages can be covered by a universal collection of acoustic segment models (ASMs) without imposing strict phonetic definitions. The ASM models are used to decode spoken utterances into strings of segment units in parallel phone recognition (PPR) and universal phone recognition (UPR) frontends. We also propose a novel approach to LID system backend design, where the statistics of ASMs and their co-occurrences are used to form ASM-derived feature vectors, in a vector space modeling (VSM) approach, as opposed to the traditional language modeling (LM) approach, in order to discriminate between individual spoken languages. Four LID systems are built to evaluate the effects of two different frontends and two different backends. We evaluate the four systems based on the 1996, 2003 and 2005 NIST Language Recognition Evaluation (LRE) tasks. The results show that the proposed ASM-based VSM framework reduces the LID error rate quite significantly when compared with the widely-used parallel PRLM method. Among the four configurations, the PPR-VSM system demonstrates the best performance across all of the tasks.

參考文獻


Adda-Decker, M.,F. Antoine,P.B. Mareuil,I. Vasilescu,L. Lamel,J. Vaissiere,E. Geoffrois,J.-S. Lienard(2003).Phonetic Knowledge, Phonotactics and Perceptual Validation for Automatic Language Identification.In Proceedings of the 15th International Congress of Phonetic Sciences.747-750.
Bellegarda, J.R.(2000).Exploiting Latent Semantic Information in Statistical Language Modeling.In Proceedings of IEEE.88(8),1279-1296.
Berkling, K.M.,E. Barnard(1994).Analysis of phoneme-based features for language identification.International Conference on Acoustics, Speech & Signal Processing.1,289-292.
Berkling, K.M.,E. Barnard(1994).Language identification of six languages based on a common set of broad phonemes.International Conference on Spoken Language Processing.1891-1894.
Corredor-Ardoy, C.,J.L. Gauvain,M. Adda-Decker,L. Lamel(1997).Language identification with language-independent acoustic models.5th European Conference on Speech Communication and Technology.1,55-58.

延伸閱讀