透過您的圖書館登入
IP:18.220.136.165
  • 學位論文

利用MPEG-7之音樂特徵值做歌曲檢索系統

Music Retrieval System Using MPEG-7 Audio Descriptor

指導教授 : 尤信程

摘要


本論文的主題是建構一個音樂檢索系統。主要的概念是透過MPEG-7音樂特徵值(Audio Descriptor)來判別某歌曲片段是否存在於歌曲資料庫當中。然而此系統能否達到實用的目標,端賴是否有高效率之搜尋方法。若歌曲片段比對花費太多時間,則會降低系統的實用性。我們以MPEG-7之聲音簽章特徵值(Audio Signature Descriptor)為基礎,提出降低資料維度的方法,並且利用多維度最近鄰居搜尋法(Multidimensional Nearest Neighbor Searching)加快搜尋速度,讓整體比對時間能大幅降低。我們也提出一改善誤警率(False Alarm Rate)的方法,進而降低錯誤接收率(FAR)與錯誤拒絕率(FRR),並以實驗探討其有效性。最後,我們利用多重解析度搜尋技巧實做本系統。

並列摘要


In this thesis, we propose a musical retrieval system. The main concept is to identify whether one piece of sound track is the same as another one in the song database by using MPEG-7 audio descriptor. However, the practicability of this system is based on whether it has some efficient searching method. If the comparison between query song and songs in database costs too much time, it will decrease system’s practicability. Based on Audio Signature Descriptor, we propose some methods about dimension reduction and the use of KD-tree for multidimensional nearest neighbor searching. It decreases the overall comparison time to increases practical value of our system. We also use some methods to improve system’s false alarm rate (i.e., decrease FAR and FRR) and benchmark those methods by ROC graph. Finally, we use multi-resolution search to implement our system.

參考文獻


[2] ISO/IEC, “Multimedia content description interface – part 4: Audio,” ISO/IEC, International Standard 15938-4, 2002.
[4] J. Lukasiak, D. Stirling, N. Harders, S. Perrow, “Performance of MPEG-7 low level audio descriptors with compressed data,” Proceedings of IEEE Multimedia and Expo, vol. 3, pp. III-237-6, July 2003.
[5] J. Herre, O. Hellmuth, M. Cremer, “Scalable robust audio fingerprinting using MPEG-7 content description,” Proceeding of IEEE Workshop on Multimedia Signal Processing, pp. 165-168, Dec. 2002.
[6] M. Sert, B. Baykal, A. Yazici, “A Robust and Time-efficient Fingerprinting Model for Musical Audio,” IEEE International Symposium on Consumer Electronics – ISCE’06, July 2006.
[7] ISO/IEC, “Multimedia content description interface – part 6: Reference Software,” ISO/IEC, International Standard 15938-6, 2003.

被引用紀錄


李育瑋(2014)。基於MPEG-7的歌者辨識與歌唱評分系統〔碩士論文,國立交通大學〕。華藝線上圖書館。https://doi.org/10.6842/NCTU.2014.00284
鄭凡寓(2008)。音訊編碼空間定位評估系統〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2008.00155
洪名人(2010)。利用獨立成分分析及因素分析對MPEG-7音訊特徵描述元資料降維進行歌曲辨識與檢索之研究〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0006-0302201015404500
蒲羿翰(2012)。利用立體聲資訊做歌曲檢索系統〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0006-2001201214353000
林祐竹(2014)。利用MPEG-7特徵值於手機錄音歌曲檢索之效能評估〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0006-2508201412595300

延伸閱讀