透過您的圖書館登入
IP:216.73.216.60
  • 學位論文

應用於耳機之內容認知的聲學音場增強系統

Context-Aware Sound Field Enhancement for Headphones

指導教授 : 陳自強
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


本論文所提出之內容感知的聲學音場增強(acoustic field enhancement)技術,以耳機為主,架構分為主副音分離、聲學音場增強和低音增強三大部分。在主副音分離的部分,透過商業用之音樂及電影之混音規則相關的數學特徵,進行二位元時頻遮蔽,分離出主音源:歌唱聲、對話聲,副音源:配樂伴奏聲及環境音等;聲學音場增強為針對不同的內容提供適當的增強方法,配樂伴奏聲的部分,透過音場寬度增強降低相關性方法,形成頭部音場外側化(externalization),減低聽覺疲勞感,擴大感知音場,增加臨場感,在對話聲的部分,在總能量不超過配樂聲的範圍內,針對其子音頻帶透過參數型等化器(equalizer)予以增強,歌唱聲的部分則透過在主要的臨界頻帶(critical band)加入延遲脈衝、增加其空間感使其能夠和伴奏音場相匹配,又不失其主音源清晰度,最後再將加強完後之主音源和副音源合成,並考慮耳機之硬體特性,針對其低頻衰減處進行低音增強。實驗結果在電影的音訊檔部分,能夠有效的降低以往在聲學音場增強對主音源所造成的模糊感,總和上述三大架構,本論文之方法能夠帶給聽者絕佳的耳機聆聽感知。 關鍵詞:歌聲和伴奏聲分離,降低左右聲道相關性,頭部音場外側化,虛擬低音增強,語音清晰度增強

並列摘要


In this thesis, we proposed a technique of context-aware acoustic field enhancement under a listening condition using a stereo headphone. The architecture consists of three parts which are blind audio separation, acoustic field enhancement and virtual bass enhancement. In the blind audio separation system, we use commercial audio tracks’ mixture mathematical characteristics of left and right channels to carry out time-frequency binary mask, and then separate dominate sound (vocal and dialogues) and subordinate sound (incidental music and surrounding sound). In the acoustic field enhancement system, based on different audio property, we provide appropriate enhancers as follows: In the incidental music part, reduce left and right channels’ correlation to achieve externalization, decrease listening fatigue and expand the acoustic sound field as more as possible. In the speech part, we utilize an equalizer to enhance the consonants’ frequency bands, to achieve speech intelligibility. In the vocal part, the full search of two delay impulses at a time domain is conducted at difference critical bands, in order to make the vocal with the accompaniment more matchable. Finally, considering the physical characteristic of headphones, we adopt the missing fundamental phenomenon to enhance low frequency bands. The experimental results reveal that our method can make the dominate sound of movie audio more transparency compare with conventional acoustic field enhancement method. Summing of the above three systems, our method present excellent sound-field spaciousness, clear dialog, deep bass and immersive surrounding sense. Keywords:Blind Audio Separation, De-correlation, Externalization, Virtual Bass Enhancement, Speech Intelligibility Enhancement

參考文獻


[2] N. Iwanaga, W. Kobayashi, K. Furuya, T. Onoye, and I. Shirakawa, “Embedded implementation of acoustic field enhancement for stereo sound sources,” IEEE Transaction on Consumer Electronics, vol. 49, pp. 737-741, 2003.
[3] Naraji Sakamoto, Toshiyuki Gotoh, and Yoichi Kimura, “On out-of-head localization in headphone listening,” Journal of Audio Engineering Society, vol. 24, no.9, pp. 710-716, November 1976.
[4] L. Wang, F. Yin, and Z. Chen, “An “out of head” sound field enhancement system for headphone,” Proc. of IEEE International Conference on Neural Networks and Signal Processing, pp. 517-521, June 2008.
[8] M. R. Schroeder, D. Gottlob, and K. F. Siebrasse, “Comparative study of European concert halls: correlation of subjective preference with geometric and acoustic parameters,” Journal of the Acoustical Society of America, Volume 56, Issue 4, pp. 1195-1201, 1974.
[9] Jens Blauert, Spatial Hearing-The Psychophysics of Human Sound Localization, 1997.

延伸閱讀