以數位語音處理技術解決異質視訊會議之同步問題

林孝蒲

doi:10.6342/NTU.2008.00363

透過您的圖書館登入 IP:216.73.216.60

透過您的圖書館登入

IP:216.73.216.60

繁體中文
English
简体中文

精確檢索 : 冠狀病毒
模糊檢索 : 冠狀病毒
冠狀病毒感染

冠狀病毒疾病
查詢出版品: 冠狀病毒

進階查詢

查詢歷史

主題瀏覽

【下載完整報告】國民法官、工作與心理健康成熱門研究議題？熱門研究焦點一次看！

學位論文

以數位語音處理技術解決異質視訊會議之同步問題

On Using Digital Speech Processing Techniques for Synchronization among Heterogeneous Teleconferencing Devices

林孝蒲(Hsiao-Pu Lin)

指導教授：謝宏昀

國立臺灣大學/電機資訊學院/電信工程學研究所/碩士(2008年)

https://doi.org/10.6342/NTU.2008.00363

全文下載

摘要

無資料

關鍵字

數位語音處理技術；語音影像同步；異質網路；視訊會議

並列摘要

As the popularity of multi-functional telephony devices grows, traditional audio conference now may involve heterogeneous teleconferencing devices, including POTS phone, dual-mode smart phones, pocket PCs, and so on. Among these conferencing devices, some may have the capability of accessing IP networks and supporting video conferencing with peer devices in the audio conference so as to have better conferencing experience. In this scenario, it becomes necessary to synchronize between audio streams, traversed the PSTN network, and video streams, traversed the IP network. While related work has investigated the problem of audio/video synchronization, their scenario is limited to the synchronization within homogeneous network, hence they cannot be applied in the target scenario. Therefore, in this thesis we propose an end-to-end framework for audio/video synchronization. We then simplify the problem as one that requires only synchronization between PSTN and IP audio streams. We first employ a time-domain algorithm based on cross correlation and identify its ineffectiveness in synchronizing distorted audio streams, due to noises or packet losses. Hence, we seek to extract distortion-tolerant audio features by Digital Speech Processing techniques for synchronization. We apply MFCC in the synchronization algorithm and obtain respectable performance for audio streams distorted by codec and packet losses. However, MFCC is inherently vulnerable to overlapping speakers. Therefore, we leverage the sparsity of speeches in spectrograms to design the spectrogram-based synchronization algorithm, and achieve favorable performance for speech mixtures and noisy speech. Evaluation results show that using DSP techniques is helpful in solving the synchronization problem across PSTN audio streams and IP video streams in terms of accuracy and robustness.

並列關鍵字

Digital Speech Processing ； Audio/Video Synchronization ； Heterogeneous Network ； Teleconferencing

參考文獻

toward end-to-end support for handoffs across heterogeneous telephony systems

Issue on End-to-End Support over Heterogeneous Wired-Wireless Networks, article

in press, April 2007.

[2] J. Grudin, E. Steven, and Poltrock, “Videoconferencing: Recent experiments

andreassessment,” in Proceedings of the 38th Annual Hawaii International Conference

國際替代計量

以數位語音處理技術解決異質視訊會議之同步問題

全文下載

主題瀏覽

以數位語音處理技術解決異質視訊會議之同步問題

On Using Digital Speech Processing Techniques for Synchronization among Heterogeneous Teleconferencing Devices

摘要

關鍵字

並列摘要

並列關鍵字

參考文獻

延伸閱讀

國際替代計量

本網站使用Cookies