台語 前音節 後音節 的 分割及合成

在語音合成時，將前音後與後音前相同的母音做疊合的方法，稱為同母音疊合(same vowel overlap and add, SVOLA)，使用同母音疊合的方式，就可以利用基礎音節去合成出所有的聲調，但同母音疊合時，會遇到前音節的前音前與前音後該切在何處的問題。本論文的目標就是去尋找前音前與前音後的切割點，為了做音節的分段，我們使用四種不同的音節分段的方法：獨立音框的最大概似標籤、不用訓練資料的最大分段總概似的切割法、使用訓練資料的最大分段總概似的切割法或 VDS 數列切割法去對音節做分段，其中 VDS 數列切割法可以很有效的切割前音前與前音後。

關鍵字

同母音疊合；前音節；後音節；音節分段

並列摘要

Given two simple syllables with a common vowel in them, we find that a new quite intelligible syllable can be synthesized by splitting the two syllables at this common vowel, and then concatenate the first part of the first syllable and the second part of second syllable. We call these two simple syllables front syllable and back syllable respectively, and call this method SVOLA, or “same vowel overlap and add” syllable synthesis. With a somewhat minimal set of front and back syllables as basis, a Taiwanese syllable synthesis system can be built easily using SVOLA. In addition to implement such a synthesis system, this thesis also explores four methods of splitting of the basis syllables. The first is independent frame maximum log-likelihood. The second is maximum segmental total likelihood without training data. The third is maximum segmental total likelihood with training data. For the fourth method we transform the feature vectors into a DS matrix (smoothing and then taking difference), and we split the basis syllable using the VDS sequence, or the sequence of the column variances of the DS matrix. The VDS approach is more effective in splitting the front syllables.

並列關鍵字

same vowel overlap and add ； front syllabels ； back syllabels ； segmentation of syllabels

參考文獻

[2] 吳德祥 (2009). 台華語音節雙拼合成. 清華大學統計學研究所學位論文, 2009 年, 1-42. 新竹:清華大學.

[3] 陳雅婷 (2012). 使用擴展修剪演算法決定語音音週標記及在台語語音合成的應用. 清華大學統計學研究所學位論文, (2012 年), 1-40. 新竹:清華大學.

[4] Kumar, N., & Andreou, A. G. (1998). Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition. Speech communication, 26(4), 283-297.

[5] Kortekaas, R. W., & Kohlrausch, A. (1997). Psychoacoustical evaluation of the pitch-synchronous overlap-and-add speech-waveform manipulation technique using single-formant stimuli. The Journal of the Acoustical Society of America, 101, 2202.

[6] Verhelst, W., & Roelands, M. (1993). An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech. In Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on (Vol. 2, pp. 554-557).

國際替代計量

台語前音節後音節的分割及合成

全文下載

主題瀏覽