Head-related transfer functions (HRTFs) serve the increasingly dominant role of implementation 3-D audio systems, which have been realized in some commercial applications. However, the cost of a 3-D audio system cannot be brought down because the efficiency of computation, the size of memory, and the synthesis of unmeasured HRTFs remain to be made better. This paper presents a way to moderate the memory requirement and computational complexity in order to reduce the cost of a 3-D audio system. We employ the library of HRTF measurements called Knowles Electronics Mannequin for Acoustic Research (KEMAR) as the original data [8]. First of all, each HRTF measurement has to be approximated in the minimum phase, and the length of the HRTF is limited by use of a modified Hamming window function, if necessary. Second, we propose an improved LBG-based clustering algorithm to lower the huge number of HRTFs. During the clustering, each HRTF is represented by its power cepstrum. Only portions of the HRTFs are reserved, and the others are neglected on condition that the minimal average mismatch distance between measured and synthesized HRTFs is achieved. Before applying localization of 3-D sounds, both unmeasured and removed HRTFs can be synthesized by linear interpolation and interaural time difference insertion. Experimental results reveal that the average and the maximum mismatch distances deriving from our improved LBG-based clustering method are less than those from the uniform clustering and LBG-based clustering methods [13].
為了持續優化網站功能與使用者體驗,本網站將Cookies分析技術用於網站營運、分析和個人化服務之目的。
若您繼續瀏覽本網站,即表示您同意本網站使用Cookies。