透過您的圖書館登入
IP:3.144.161.116
  • 學位論文

音訊檔案之確認及分類方法研究

A Study of Audio File Verification and Classification

指導教授 : 蔡偉和
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


隨著資料儲存容量的迅速增長,自動分類資料檔案的技術已日益顯得重要。特別是影音等多媒體資料檔案,若要透過人工檢視或聆聽來判斷其內容,將是極耗時費力的事。本論文嘗試探討音訊檔案的自動確認與分類方法,目標是判斷未知內容的檔案是否為音訊,若是,則接著判斷其是否為語音、清唱、純音樂、含伴奏之歌唱或噪音等。並且,若該檔案為語音、清唱、或含伴奏之歌唱,則我們更進一步判斷其所含語者或歌手之性別。本論文發展一種階層式的識別系統,透過音高及音色的分析,將未知檔案分為九大類。實驗結果證實其可行性。

關鍵字

音訊檔案分類 音高 音色

並列摘要


With the rapid increase in the capability of data storage, it has become more and more important to develop automated techniques for data classification. In particular, multimedia material like audio is difficult to browse, because it takes time and energy to play and listen to the audio data. Recognizing this, our work tries to design a system for verifying whether or not an unknown file in a personal computer (PC) belongs to audio data and further identifying which acoustic class the file is, if the file belongs to audio. The acoustic classes we considered here encompass noise, speech, singing, instrument-sole music, and accompanied singing. For the case that a test file is speech, singing, or accompanied singing, we further identify the gender of the person involved in the audio data, e.g., male/female speaker or male/female singer. The proposed system is of a hierarchical structure that divides audio files into nine classes. Our experiments show that the system is feasible in PC file management.

參考文獻


[1] J. L. Hsu, C. C. Liu, A.L.P. Chen, “Discovering nontrivial repeating patterns in music data,” IEEE Trans. on Multimedia, pp. 311-325, Sep. 2001.
[2] W. Chai and B. Vercoe, “Music Thumbnailing via Structural Analysis,” in proc. ACM Multimedia Conference, Nov. 2003.
[3] M. Goto and Y. Muraoka, “Real-time beat tracking for drumless audio signals: chord change detection for musical decision,” Speech Communication, 1999.
[6] S. A. Abdallah and M. Plumbley, “An ICA approach to automatic music transcription,” in proc. 114th AES Convention, 2003.
[7] G. Tzanetakis and P. Cook, “Musical Genre Classification of Audio Signals,” IEEE Trans. on Speech and Audio Processing, pp. 293-302, 2002.

延伸閱讀