透過您的圖書館登入
IP:3.20.224.152
  • 期刊

Improvement in Speech to Text for Bahasa Indonesia Through Homophone Impairment Training

摘要


In this research, an approach for increasing accuracy in speech to text application is done using Mel Frequency Cepstral Coefficient (MFCC) trained by Backpropagation Neural Network (BPNN). A set of Bahasa Indonesia homophones data speech is used for training and validation. The record is taken from 6 native adults comprising 3 males and 3 females. Working in 16 KHz sampling mode, the data is stored in WAV format. A confusion matrix is used to validate the system with and without homophone locking learning. A significant improvement is observed from the experiment. The percentage of accuracy is increased from 53.33 to 93.4 from male samples. From females’ records, the increment is even higher. The accuracy percentage has risen from 36.8 to 93.33.

關鍵字

BPNN confusion matrix homophone MFCC speech to text

被引用紀錄


張敬昕(2009)。宋代景德鎮青白瓷研究--以湖田窯為中心〔碩士論文,國立臺灣師範大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0021-1610201315172863

延伸閱讀