耳蝸物理模型為基礎的音高辨識方法

本論文提出了一個新的音高辨識方法：利用簡單的耳蝸物理模型，可以讓電腦模擬人耳聽覺的產生，來辨識音高。傳統音高辨識方法主要分為兩大類：第一類型方法利用簡單的自相關函式，將一段聲音波形輸入函式後，經由簡單的運算，可以迅速得到主要頻率音高；第二類型方法則是將一段時域資料的聲音波形，經過傅立葉轉換後，轉成頻域資料得到頻譜，再分析頻譜得到音高。本論文藉由一個簡單的耳蝸物理模型，可以直接利用時域資料去振動耳蝸中的基底膜，再藉由分析基底膜的振動情形抓出音高。不同於第一類方法只能抓到一個主要的頻率，我們的方法，因為整條基底膜的彈性並不一致，所以可以同時抓出各個頻率的組成大小。另外，少了頻譜轉換的步驟，因此我們的方法，運算速度比起第二類方法快速許多。

關鍵字

音高辨識；耳蝸；耳蝸模型；基底膜

並列摘要

In this paper, an algorithm for pitch recognition is designed. This algorithm is based on a simplified cochlear model. The traditional methods are mainly divided into two categories: one is to utilize and analyze the amplitude of sound in time domain directly; the other is to transform the sound into the frequency domain first, and then do some analysis to recognize the pitch. The operation amount in time domain is relatively small, but mostly it can only detect a single frequency. The second type of methods needs to do the transform first, so the speed is relatively slow. After getting the frequency spectrum, we can apply some algorithm to do the pitch recognition. My algorithm, which is called CM (Cochlear Model), combines the advantages of above-mentioned two kinds of methods. CM utilizes the amplitude of sound directly. Through the simple cochlea physical model, the vibration situation of the BM(basement membrane) in the cochlea can tell the pitch. For the elasticity in the BM is not uniform, we can tell more than one single frequency at the same time.

並列關鍵字

pitch ； pitch tracking ； pitch recognition ； pitch determination algorithm ； PDA ； cochlea ； cochlear model ； basement membrane ； BM

參考文獻

[1] S. Uppgard, “Implementation and Analysis of Pitch Tracking Algorithms,” Report

[3] B. Kedem, “Spectral Analysis and Discrimination by Zero-Crossings,” Proc. IEEE, Vol. 74, pp.1477-1493, 1986-11.

[4] L. R. Rabiner, “On the Use of Autocorrelation Analysis for Pitch Detection,” IEEE Trans. ASSP, Vol. 25, pp. 24-33, 1997-2

[5] L. B. Jackson, Digital Filters and Signal Processing with MATLAB Exercises, 3rd Edition, Chap. 7, pp. 189-227, 1995-9-30

[6] J. C. Brown, “Calculation of a Constant Q Spectral Transform,” JASA, Vol. 89, pp. 425-434, 1991

被引用紀錄

Lin, P. J. (2006). 利用相位編碼來做音訊隱藏 [master's thesis, National Taiwan University]. Airiti Library. https://doi.org/10.6342/NTU.2006.10103

國際替代計量

耳蝸物理模型為基礎的音高辨識方法

全文下載

主題瀏覽