表達式語音合成之文獻回顧

近年來，文字轉語音的合成音質已有顯著的提升，然而在自然度的表現上，仍有很大的進步空間。主要是因為合成語音多為中性語氣，欠缺個人說話特色或者自然的感情流露。因此，開啟了表達式語音合成的研究，以期提升合成語音的自然度。目前已有相當多的文獻探討如何開發基於表達式語音合成的文字轉語音系統。本論文提供了完整的回顧並歸納出五個研究主題－說話方式，情緒分類，語料庫建構，表達式語音合成方法以及合成語音評比。

關鍵字

文字轉語音；表達式語音合成；說話方式；情緒分類

並列摘要

In recent years the quality of the speech generated by text-to-speech synthesis has been improved dramatically. However the naturalness of the synthesized sound can be further improved to have more emotions to imitate human kind speaking. This inspires the research of expressive speech synthesis (ESS) to improve naturalness. Currently, there have been a lot of papers focusing on the development of ESS based text-to-speech systems. This study tries to give an overview of ESS studies and summarize five research topics-speaking styles, emotion categories, corpus construction, ESS approaches, and the evaluation of synthetic sounds.

並列關鍵字

Text To Speech ； Expressive Speech Synthesis ； Speaking Styles ； Emotion Categories

被引用紀錄

蔡昀庭（2009）。基於隱藏式馬可夫模型之中文語音合成系統〔碩士論文，國立清華大學〕。華藝線上圖書館。https://doi.org/10.6843/NTHU.2009.00722

馬汶汶（2011）。電子書語音閱讀輔助裝置之設計〔碩士論文，大同大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0081-3001201315111854

國際替代計量

表達式語音合成之文獻回顧

全文下載

主題瀏覽