透過您的圖書館登入
IP:3.134.118.95
  • 學位論文

同時使用旋律與歌詞資訊之改良型哼唱檢索系統

An Improved Query by Singing/Humming System Using Melody and Lyrics Information

指導教授 : 張智星
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


本論文提出了一種改進的哼唱檢索系統,能同時使用旋律和歌詞的資訊,以實現更好的性能。首先會進行哼唱分辨,以將「唱」和「哼」分離開來。對於「哼」的查詢,我們套用了只含音高的旋律辨識方法,其在參加MIREX的哼唱檢索項目時被使用,並在該比賽中排名第一。對於「唱」的查詢,我們將旋律辨識和歌詞辨識的分數結合,以利用額外的歌詞資訊。歌詞辨識是基於一個改良過的樹狀網路,此技術常用在語音辨識上。系統整體效能,以錯誤減少率來看,在兩個不同的實驗參數下的前20名之結果中,分別達到 39.01%和23.53%,說明了此系統的可行性。

並列摘要


This paper proposes an improved query by singing/humming (QBSH) system using both melody and lyrics information for achieving better performance. Singing/humming discrimination (SHD) is first performed to distinguish singing from humming queries. For a humming query, we apply a pitch-only melody recognition method that has been used for QBSH task at MIREX with rank-1 per-formance. For a singing query, we combine the scores from melody recognition and lyrics recognition to take advantage of the extra lyrics information. Lyrics recognition is based on a modified tree lexicon that is commonly used in speech recognition. The performance of the overall QBSH system achieves 39.01% and 23.53% error reduction rates, respectively, for top-20 recognition under two experimental settings, indicating the feasibility of the proposed method.

並列關鍵字

無資料

參考文獻


[5] T. Wang, D.-J. Kim, K.-S. Hong, and J.-S. Youn, “Music Information Retrieval System using Lyrics and Melody Information,” Asia-Pacific Conference on Information Processing, pp. 601–604, 2009.
[7] J.-S. R. Jang, H.-R. Lee, M.-Y. Kao, “Content-based Music Retrieval Using Linear Scaling and Branch-and-Bound Tree search,” in Proc. of IEEE International Conference on Multimedia and Expo, August 2001.
[8] Cambridge University Engineering Department , HTK Web-Site, http://htk.eng.cam.ac.uk/, 2006
[13] M. Suzuki, T. Hosoya, A. Ito, and S. Makino, “Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information,” EURASIP Journal on Advances in Signal Processing, vol. 2007, Article ID 38727, 8 pages, 2007. doi:10.1155/2007/38727
[1] A. J. Ghias, D. C. Logan, and B. C. Smith, “Query by humming-musical information retrieval in an audio database,” in Proc. ACM Multimedia’95, San Francisco, 1995, pp. 216–221.

延伸閱讀