學術產出-學位論文

文章檢視/開啟

書目匯出

Google ScholarTM

政大圖書館

引文資訊

TAIR相關學術產出

題名 大型網路語音會談中回音消除方法
Echo Cancellation In Large-Scale VoIP Conferencing
作者 祁立誠
Chi, Li-Chen
貢獻者 連耀南
Lien, Yao-Nan
祁立誠
Chi, Li-Chen
關鍵詞 回音消除
網路語音會談
語音動態偵測
echo cancellation
VoIP
VAD
日期 2009
上傳時間 9-四月-2010 13:24:02 (UTC+8)
摘要 隨著網路技術的發展,目前網路電話(VoIP)已有逐漸取代傳統電話的趨勢。尤其能夠允許多人同時在線上進行會談是其最大的優勢之一。但在多人參與網路會談時,因為聲音在空間中傳遞或反射等因素,使得由喇叭發出的聲音再次被麥克風收回,造成回音的產生。會談中只要有一位使用者的裝置發生回音時,回音訊號就會在與會者之間擴散,使得所有使用者均會受到影響,進而嚴重影響網路通話的進行。此狀況在參與會談人數越多時,發生機率越高,且對通話品質影響越嚴重。
傳統電話在一對一通話時,通常使用遠端回音消除機制(Near End Echo Canceller),由接收端在接收聲音後先暫存在記憶體中再播放,再將麥克風擷取的聲音與事先暫存的訊號反向後混合,以抵銷回音。網路會談的環境下,由於沒有標準的聽筒設備,使得回音發生的時間難以預估。且多人參與的網路會談中,由於收聽者所聽到的聲音可能混合多個使用者說話的聲音與回音,使得回音訊號難以偵測。另外,由於網路傳輸的特性,回音訊號到達的時間與順序都難以預估,這使得回音消除機制在多人網路回談中經常失效。
本研究提出藉由語音動態偵測(Voice Activity Detection-VAD)的方式分辨回音訊號,藉由本研究所提出的語音能量VAD判定機制,能夠有效區別正常語音與回音的差異,即可有效的消除回音,同時發揮靜音抑制(Slience Suppression)的效果,阻擋不含語音內容的封包,降低網路頻寬耗用。本研究以自行開發的VoIP軟體進行實地測試實驗,實驗中顯示,我們的方法能消除85%以上的回音。
With the prosperous development of Internet technology, traditional phone service is being replaced gradually byVoice-over-IP (VoIP) technology. One of the critical problem that is yet to be improved is the echo problem. Due to the difference in working environment, conventional echo cancellation technology may not work well on VoIP system. The echo problem is becoming more critical as the number of participants in a talk session increases. As long as one user fails to depress echos, every other participant in the conference will be infected. The more participant, the higher probability of echo infection.

We propose an energy based Voice Activity Detection (VAD) mechnism that effectively differentiate echo from speech signal. Our VAD algrouthm records a user’s speech volume, and based on this information to determine whether the frame is echo or not. By applying this mechnism to network conference, we can filter out echo frames and suppress slience at same time to save bandwidth consumption. We experimented on a self-developed VoIP software platform, the experiment result shows that our method can eliminate more than 85% of the echo.
參考文獻 [1] U.I. Choudhry, JongWon Kim, and Hong Kook Kim, "A Highly Adaptive Acoustic Echo Cancellation Solution for VoIP Conferencing Systems", IEEE International Conference on Computer Systems and Applications, 2006, pp. 433-436.
[2] G. S. Fang, "Voice Channel Echo Cancellation", IEEE Communications Magazine, Vol. 21, Issue 9, Dec. 1983, pp.11-14.
[3] Xiaohui Gu, Zhen Wen, Philip S. Yu, and Zon-Yin Shae, "peerTalk: A Peer-to-Peer Multi-Party Voice-Over-IP System", Parallel and Distributed Systems, IEEE Trans. on Publication, Vol. 19, No. 4, April 2008.
[4] Perry P. He, Roman A. Dyba, and Lucio F.C. Pessoa, "Network Echo Cancellers: Requirements, Applications and Solutions", AnalogZONE, 2004.
[5] Brant M. Helf, "Far end echo cancellation method and apparatus”, U.S. Patent 4,995,030, Feb. 19, 1991.
[6] M. Hiraguchi, "Full duplex modem having two echo cancellers for a near end echo and a far end echo", U.S. Patent 4,935,919, 19 Jun. 19, 1990.
[7] Yao-Nan Lien, Li-Cheng Chi and Yuh-Sheng Shaw, "A Walkie-Talkie-Like Emergency Communication System for Catastrophic Natural Disasters", Proc. of 10th International Symposium on Pervasive Systems, Algorithms and Networks (ISPAN09), Dec. 14-16, 2009.
[8] P. Marques, F. Sousa, and J. Leitao, "A DSP Based Long Distance Echo Canceller using Short Length Centered Adaptive Filters", Proc. of ICASSP, 1997.
[9] B. S. Nollet, and D. L. Jones, "Nonlinear Echo Cancellation For Hands-Free Speakerphones", Proc. of NSIP`97, Michigan USA, Sep. 1997.
[10] K. Ochiai, T. Araseki, and T. Ogihara, "Echo canceller with two echo path models", IEEE Trans. on Commun., Vol. COM-25, No. 6, June 1977, pp. 589-595.
[11] G. Periakarruppan, and H. A. Abdul-Rashid, "Packet based echo cancellation for VoIP networks", Computers and Electrical Engineering, Vol. 33, No. 2, 2007, pp. 139-148.
[12] Petr Pollak, Pavel Sovka, and Jan Uhlir, "Noise Sup-pression System for a Car", Proc. of the Third European Conference on Speech, Communication and Technology -EUROSPEECH`93, Berlin, Sep. 1993, pp. 1073-1076.
[13] R. V. Prasad, A. Sangwan, H. S. Jamadagni, and M. C. Chiranth, "Comparison of voice activity detection algorithms for voip”, Proc. of IEEE Symposium on Computer and Communications, July 2002, pp. 530-535.
[14] R. V. Prasad, R. Muralishhankar, S. Vijay, H. N. Shankar, P. Pawelczak, and I. Miemegeers, “Voice activity detection for VoIP-an information theoretic approach”, Proc. of IEEE Global Telecommunications Conference, 2006, pp. 1-6.
[15] I. Rassameeroj, and S. Tangwongsan, "Echo Cancellation in Voice over IP", Proc of 5th International Conference on Information Technology and Applications (ICITA), 2008, pp. 570-575.
[16] P. Renevey, and A. Drygajlo, “Entropy based voice activity detection in very noisy conditions”, Proc. of European Conference on Speech Communication and Technology (ISCA EUROSPEECH ’01), Sep. 2001, pp. 1887-1890.
[17] B. Widrow, and M. E. Hoff, "Adaptive switching circuits", IRE WESCON Convention Record part 4, 1960, pp. 96-104.
[18] B. Widrow, and SD Stearns, “Adaptive Signal Processing”, Prentice-Hall, Nglewood Cliffs, NJ, 1985.
[19] Echo cancellation, http://en.wikipedia.org/wiki/Echo_cancellation, Retrieved at November 11, 2009.
[20]Echo suppressor, http://en.wikipedia.org/wiki/Echo_suppressor, Retrieved at November 11, 2009.
[21] Mean Opinion Score, http://en.wikipedia.org/wiki/Mean_Opinion_Score, Retrieved at July 09, 2009.
描述 碩士
國立政治大學
資訊科學學系
96753013
98
資料來源 http://thesis.lib.nccu.edu.tw/record/#G0096753013
資料類型 thesis
dc.contributor.advisor 連耀南zh_TW
dc.contributor.advisor Lien, Yao-Nanen_US
dc.contributor.author (作者) 祁立誠zh_TW
dc.contributor.author (作者) Chi, Li-Chenen_US
dc.creator (作者) 祁立誠zh_TW
dc.creator (作者) Chi, Li-Chenen_US
dc.date (日期) 2009en_US
dc.date.accessioned 9-四月-2010 13:24:02 (UTC+8)-
dc.date.available 9-四月-2010 13:24:02 (UTC+8)-
dc.date.issued (上傳時間) 9-四月-2010 13:24:02 (UTC+8)-
dc.identifier (其他 識別碼) G0096753013en_US
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/38543-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 資訊科學學系zh_TW
dc.description (描述) 96753013zh_TW
dc.description (描述) 98zh_TW
dc.description.abstract (摘要) 隨著網路技術的發展,目前網路電話(VoIP)已有逐漸取代傳統電話的趨勢。尤其能夠允許多人同時在線上進行會談是其最大的優勢之一。但在多人參與網路會談時,因為聲音在空間中傳遞或反射等因素,使得由喇叭發出的聲音再次被麥克風收回,造成回音的產生。會談中只要有一位使用者的裝置發生回音時,回音訊號就會在與會者之間擴散,使得所有使用者均會受到影響,進而嚴重影響網路通話的進行。此狀況在參與會談人數越多時,發生機率越高,且對通話品質影響越嚴重。
傳統電話在一對一通話時,通常使用遠端回音消除機制(Near End Echo Canceller),由接收端在接收聲音後先暫存在記憶體中再播放,再將麥克風擷取的聲音與事先暫存的訊號反向後混合,以抵銷回音。網路會談的環境下,由於沒有標準的聽筒設備,使得回音發生的時間難以預估。且多人參與的網路會談中,由於收聽者所聽到的聲音可能混合多個使用者說話的聲音與回音,使得回音訊號難以偵測。另外,由於網路傳輸的特性,回音訊號到達的時間與順序都難以預估,這使得回音消除機制在多人網路回談中經常失效。
本研究提出藉由語音動態偵測(Voice Activity Detection-VAD)的方式分辨回音訊號,藉由本研究所提出的語音能量VAD判定機制,能夠有效區別正常語音與回音的差異,即可有效的消除回音,同時發揮靜音抑制(Slience Suppression)的效果,阻擋不含語音內容的封包,降低網路頻寬耗用。本研究以自行開發的VoIP軟體進行實地測試實驗,實驗中顯示,我們的方法能消除85%以上的回音。
zh_TW
dc.description.abstract (摘要) With the prosperous development of Internet technology, traditional phone service is being replaced gradually byVoice-over-IP (VoIP) technology. One of the critical problem that is yet to be improved is the echo problem. Due to the difference in working environment, conventional echo cancellation technology may not work well on VoIP system. The echo problem is becoming more critical as the number of participants in a talk session increases. As long as one user fails to depress echos, every other participant in the conference will be infected. The more participant, the higher probability of echo infection.

We propose an energy based Voice Activity Detection (VAD) mechnism that effectively differentiate echo from speech signal. Our VAD algrouthm records a user’s speech volume, and based on this information to determine whether the frame is echo or not. By applying this mechnism to network conference, we can filter out echo frames and suppress slience at same time to save bandwidth consumption. We experimented on a self-developed VoIP software platform, the experiment result shows that our method can eliminate more than 85% of the echo.
en_US
dc.description.tableofcontents 目錄
摘要 i
Abstract ii
誌謝辭 iii
目錄 iv
圖目錄 vi
表目錄 viii
第一章 簡介 1
1.1多人/大型網路語音會談 1
1.2網路會談的常見問題 1
1.3回音現象 4
1.4回音消除基本原理 6
1.5 VoIP中的回音 6
1.5.1單一回音產生者 8
1.5.2多個回音產生者 9
1.5.2 Proximity Problem造成的回音 11
1.6名詞定義 11
1.7研究動機與目的 11
第二章 背景與相關研究 13
2.1回音消除技術演進 13
2.2回音消除原理 13
2.2.1回音消除演算法 14
2.3回音消除方法分類 16
2.3.1 Listener Echo Cancellation 16
2.3.2 Listener Echo Cancellation失效原因 16
2.3.3 Talker Echo Cancellation 17
2.3.4 Talker Echo Cancellection的挑戰 17
2.4一對一VoIP回音消除機制 21
2.5總結 21
第三章 MET VAD靜音及回音消除機制 23
3.1需求分析及研究目標 23
3.2解決方法 23
3.3 VAD語音動態偵測 23
3.4系統架構 24
3.5細部設計 27
3.5.1聲音能量紀錄 28
3.5.2 LED VAD演算法 29
3.5.3 MET VAD演算法 30
第四章 效能分析 34
4.1實驗目的 34
4.2實驗設計 34
4.2.1以聲音樣本評比各種VAD演算法 34
4.2.2以網路會談實測MET VAD之效能 34
4.2.3 Proximity Problem的回音消除測試 34
4.3評估指標 34
4.3.1誤判率 35
4.3.2 MOS 35
4.4實驗一:以聲音樣本評比各種VAD 35
4.4.1實驗目標 35
4.4.2實驗環境 36
4.4.3實驗流程 37
4.4.4實驗結果分析 37
4.5實驗二:網路會談實測 47
4.5.1實驗環境 47
4.5.2實驗流程 47
4.5.3實驗結果分析 48
4.6實驗三:Proximity Problem的回音消除測試 53
4.6.1實驗目標 53
4.6.2實驗環境 53
4.6.3實驗結果分析 54
第五章 結論與未來研究方向 56
zh_TW
dc.format.extent 1061479 bytes-
dc.format.mimetype application/pdf-
dc.language.iso en_US-
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#G0096753013en_US
dc.subject (關鍵詞) 回音消除zh_TW
dc.subject (關鍵詞) 網路語音會談zh_TW
dc.subject (關鍵詞) 語音動態偵測zh_TW
dc.subject (關鍵詞) echo cancellationen_US
dc.subject (關鍵詞) VoIPen_US
dc.subject (關鍵詞) VADen_US
dc.title (題名) 大型網路語音會談中回音消除方法zh_TW
dc.title (題名) Echo Cancellation In Large-Scale VoIP Conferencingen_US
dc.type (資料類型) thesisen
dc.relation.reference (參考文獻) [1] U.I. Choudhry, JongWon Kim, and Hong Kook Kim, "A Highly Adaptive Acoustic Echo Cancellation Solution for VoIP Conferencing Systems", IEEE International Conference on Computer Systems and Applications, 2006, pp. 433-436.zh_TW
dc.relation.reference (參考文獻) [2] G. S. Fang, "Voice Channel Echo Cancellation", IEEE Communications Magazine, Vol. 21, Issue 9, Dec. 1983, pp.11-14.zh_TW
dc.relation.reference (參考文獻) [3] Xiaohui Gu, Zhen Wen, Philip S. Yu, and Zon-Yin Shae, "peerTalk: A Peer-to-Peer Multi-Party Voice-Over-IP System", Parallel and Distributed Systems, IEEE Trans. on Publication, Vol. 19, No. 4, April 2008.zh_TW
dc.relation.reference (參考文獻) [4] Perry P. He, Roman A. Dyba, and Lucio F.C. Pessoa, "Network Echo Cancellers: Requirements, Applications and Solutions", AnalogZONE, 2004.zh_TW
dc.relation.reference (參考文獻) [5] Brant M. Helf, "Far end echo cancellation method and apparatus”, U.S. Patent 4,995,030, Feb. 19, 1991.zh_TW
dc.relation.reference (參考文獻) [6] M. Hiraguchi, "Full duplex modem having two echo cancellers for a near end echo and a far end echo", U.S. Patent 4,935,919, 19 Jun. 19, 1990.zh_TW
dc.relation.reference (參考文獻) [7] Yao-Nan Lien, Li-Cheng Chi and Yuh-Sheng Shaw, "A Walkie-Talkie-Like Emergency Communication System for Catastrophic Natural Disasters", Proc. of 10th International Symposium on Pervasive Systems, Algorithms and Networks (ISPAN09), Dec. 14-16, 2009.zh_TW
dc.relation.reference (參考文獻) [8] P. Marques, F. Sousa, and J. Leitao, "A DSP Based Long Distance Echo Canceller using Short Length Centered Adaptive Filters", Proc. of ICASSP, 1997.zh_TW
dc.relation.reference (參考文獻) [9] B. S. Nollet, and D. L. Jones, "Nonlinear Echo Cancellation For Hands-Free Speakerphones", Proc. of NSIP`97, Michigan USA, Sep. 1997.zh_TW
dc.relation.reference (參考文獻) [10] K. Ochiai, T. Araseki, and T. Ogihara, "Echo canceller with two echo path models", IEEE Trans. on Commun., Vol. COM-25, No. 6, June 1977, pp. 589-595.zh_TW
dc.relation.reference (參考文獻) [11] G. Periakarruppan, and H. A. Abdul-Rashid, "Packet based echo cancellation for VoIP networks", Computers and Electrical Engineering, Vol. 33, No. 2, 2007, pp. 139-148.zh_TW
dc.relation.reference (參考文獻) [12] Petr Pollak, Pavel Sovka, and Jan Uhlir, "Noise Sup-pression System for a Car", Proc. of the Third European Conference on Speech, Communication and Technology -EUROSPEECH`93, Berlin, Sep. 1993, pp. 1073-1076.zh_TW
dc.relation.reference (參考文獻) [13] R. V. Prasad, A. Sangwan, H. S. Jamadagni, and M. C. Chiranth, "Comparison of voice activity detection algorithms for voip”, Proc. of IEEE Symposium on Computer and Communications, July 2002, pp. 530-535.zh_TW
dc.relation.reference (參考文獻) [14] R. V. Prasad, R. Muralishhankar, S. Vijay, H. N. Shankar, P. Pawelczak, and I. Miemegeers, “Voice activity detection for VoIP-an information theoretic approach”, Proc. of IEEE Global Telecommunications Conference, 2006, pp. 1-6.zh_TW
dc.relation.reference (參考文獻) [15] I. Rassameeroj, and S. Tangwongsan, "Echo Cancellation in Voice over IP", Proc of 5th International Conference on Information Technology and Applications (ICITA), 2008, pp. 570-575.zh_TW
dc.relation.reference (參考文獻) [16] P. Renevey, and A. Drygajlo, “Entropy based voice activity detection in very noisy conditions”, Proc. of European Conference on Speech Communication and Technology (ISCA EUROSPEECH ’01), Sep. 2001, pp. 1887-1890.zh_TW
dc.relation.reference (參考文獻) [17] B. Widrow, and M. E. Hoff, "Adaptive switching circuits", IRE WESCON Convention Record part 4, 1960, pp. 96-104.zh_TW
dc.relation.reference (參考文獻) [18] B. Widrow, and SD Stearns, “Adaptive Signal Processing”, Prentice-Hall, Nglewood Cliffs, NJ, 1985.zh_TW
dc.relation.reference (參考文獻) [19] Echo cancellation, http://en.wikipedia.org/wiki/Echo_cancellation, Retrieved at November 11, 2009.zh_TW
dc.relation.reference (參考文獻) [20]Echo suppressor, http://en.wikipedia.org/wiki/Echo_suppressor, Retrieved at November 11, 2009.zh_TW
dc.relation.reference (參考文獻) [21] Mean Opinion Score, http://en.wikipedia.org/wiki/Mean_Opinion_Score, Retrieved at July 09, 2009.zh_TW