學術產出-學位論文
題名 | 大型網路語音會談中回音消除方法 Echo Cancellation In Large-Scale VoIP Conferencing |
作者 | 祁立誠 Chi, Li-Chen |
貢獻者 | 連耀南 Lien, Yao-Nan 祁立誠 Chi, Li-Chen |
關鍵詞 | 回音消除 網路語音會談 語音動態偵測 echo cancellation VoIP VAD |
日期 | 2009 |
上傳時間 | 9-四月-2010 13:24:02 (UTC+8) |
摘要 | 隨著網路技術的發展,目前網路電話(VoIP)已有逐漸取代傳統電話的趨勢。尤其能夠允許多人同時在線上進行會談是其最大的優勢之一。但在多人參與網路會談時,因為聲音在空間中傳遞或反射等因素,使得由喇叭發出的聲音再次被麥克風收回,造成回音的產生。會談中只要有一位使用者的裝置發生回音時,回音訊號就會在與會者之間擴散,使得所有使用者均會受到影響,進而嚴重影響網路通話的進行。此狀況在參與會談人數越多時,發生機率越高,且對通話品質影響越嚴重。 傳統電話在一對一通話時,通常使用遠端回音消除機制(Near End Echo Canceller),由接收端在接收聲音後先暫存在記憶體中再播放,再將麥克風擷取的聲音與事先暫存的訊號反向後混合,以抵銷回音。網路會談的環境下,由於沒有標準的聽筒設備,使得回音發生的時間難以預估。且多人參與的網路會談中,由於收聽者所聽到的聲音可能混合多個使用者說話的聲音與回音,使得回音訊號難以偵測。另外,由於網路傳輸的特性,回音訊號到達的時間與順序都難以預估,這使得回音消除機制在多人網路回談中經常失效。 本研究提出藉由語音動態偵測(Voice Activity Detection-VAD)的方式分辨回音訊號,藉由本研究所提出的語音能量VAD判定機制,能夠有效區別正常語音與回音的差異,即可有效的消除回音,同時發揮靜音抑制(Slience Suppression)的效果,阻擋不含語音內容的封包,降低網路頻寬耗用。本研究以自行開發的VoIP軟體進行實地測試實驗,實驗中顯示,我們的方法能消除85%以上的回音。 With the prosperous development of Internet technology, traditional phone service is being replaced gradually byVoice-over-IP (VoIP) technology. One of the critical problem that is yet to be improved is the echo problem. Due to the difference in working environment, conventional echo cancellation technology may not work well on VoIP system. The echo problem is becoming more critical as the number of participants in a talk session increases. As long as one user fails to depress echos, every other participant in the conference will be infected. The more participant, the higher probability of echo infection. We propose an energy based Voice Activity Detection (VAD) mechnism that effectively differentiate echo from speech signal. Our VAD algrouthm records a user’s speech volume, and based on this information to determine whether the frame is echo or not. By applying this mechnism to network conference, we can filter out echo frames and suppress slience at same time to save bandwidth consumption. We experimented on a self-developed VoIP software platform, the experiment result shows that our method can eliminate more than 85% of the echo. |
參考文獻 | [1] U.I. Choudhry, JongWon Kim, and Hong Kook Kim, "A Highly Adaptive Acoustic Echo Cancellation Solution for VoIP Conferencing Systems", IEEE International Conference on Computer Systems and Applications, 2006, pp. 433-436. [2] G. S. Fang, "Voice Channel Echo Cancellation", IEEE Communications Magazine, Vol. 21, Issue 9, Dec. 1983, pp.11-14. [3] Xiaohui Gu, Zhen Wen, Philip S. Yu, and Zon-Yin Shae, "peerTalk: A Peer-to-Peer Multi-Party Voice-Over-IP System", Parallel and Distributed Systems, IEEE Trans. on Publication, Vol. 19, No. 4, April 2008. [4] Perry P. He, Roman A. Dyba, and Lucio F.C. Pessoa, "Network Echo Cancellers: Requirements, Applications and Solutions", AnalogZONE, 2004. [5] Brant M. Helf, "Far end echo cancellation method and apparatus”, U.S. Patent 4,995,030, Feb. 19, 1991. [6] M. Hiraguchi, "Full duplex modem having two echo cancellers for a near end echo and a far end echo", U.S. Patent 4,935,919, 19 Jun. 19, 1990. [7] Yao-Nan Lien, Li-Cheng Chi and Yuh-Sheng Shaw, "A Walkie-Talkie-Like Emergency Communication System for Catastrophic Natural Disasters", Proc. of 10th International Symposium on Pervasive Systems, Algorithms and Networks (ISPAN09), Dec. 14-16, 2009. [8] P. Marques, F. Sousa, and J. Leitao, "A DSP Based Long Distance Echo Canceller using Short Length Centered Adaptive Filters", Proc. of ICASSP, 1997. [9] B. S. Nollet, and D. L. Jones, "Nonlinear Echo Cancellation For Hands-Free Speakerphones", Proc. of NSIP`97, Michigan USA, Sep. 1997. [10] K. Ochiai, T. Araseki, and T. Ogihara, "Echo canceller with two echo path models", IEEE Trans. on Commun., Vol. COM-25, No. 6, June 1977, pp. 589-595. [11] G. Periakarruppan, and H. A. Abdul-Rashid, "Packet based echo cancellation for VoIP networks", Computers and Electrical Engineering, Vol. 33, No. 2, 2007, pp. 139-148. [12] Petr Pollak, Pavel Sovka, and Jan Uhlir, "Noise Sup-pression System for a Car", Proc. of the Third European Conference on Speech, Communication and Technology -EUROSPEECH`93, Berlin, Sep. 1993, pp. 1073-1076. [13] R. V. Prasad, A. Sangwan, H. S. Jamadagni, and M. C. Chiranth, "Comparison of voice activity detection algorithms for voip”, Proc. of IEEE Symposium on Computer and Communications, July 2002, pp. 530-535. [14] R. V. Prasad, R. Muralishhankar, S. Vijay, H. N. Shankar, P. Pawelczak, and I. Miemegeers, “Voice activity detection for VoIP-an information theoretic approach”, Proc. of IEEE Global Telecommunications Conference, 2006, pp. 1-6. [15] I. Rassameeroj, and S. Tangwongsan, "Echo Cancellation in Voice over IP", Proc of 5th International Conference on Information Technology and Applications (ICITA), 2008, pp. 570-575. [16] P. Renevey, and A. Drygajlo, “Entropy based voice activity detection in very noisy conditions”, Proc. of European Conference on Speech Communication and Technology (ISCA EUROSPEECH ’01), Sep. 2001, pp. 1887-1890. [17] B. Widrow, and M. E. Hoff, "Adaptive switching circuits", IRE WESCON Convention Record part 4, 1960, pp. 96-104. [18] B. Widrow, and SD Stearns, “Adaptive Signal Processing”, Prentice-Hall, Nglewood Cliffs, NJ, 1985. [19] Echo cancellation, http://en.wikipedia.org/wiki/Echo_cancellation, Retrieved at November 11, 2009. [20]Echo suppressor, http://en.wikipedia.org/wiki/Echo_suppressor, Retrieved at November 11, 2009. [21] Mean Opinion Score, http://en.wikipedia.org/wiki/Mean_Opinion_Score, Retrieved at July 09, 2009. |
描述 | 碩士 國立政治大學 資訊科學學系 96753013 98 |
資料來源 | http://thesis.lib.nccu.edu.tw/record/#G0096753013 |
資料類型 | thesis |
dc.contributor.advisor | 連耀南 | zh_TW |
dc.contributor.advisor | Lien, Yao-Nan | en_US |
dc.contributor.author (作者) | 祁立誠 | zh_TW |
dc.contributor.author (作者) | Chi, Li-Chen | en_US |
dc.creator (作者) | 祁立誠 | zh_TW |
dc.creator (作者) | Chi, Li-Chen | en_US |
dc.date (日期) | 2009 | en_US |
dc.date.accessioned | 9-四月-2010 13:24:02 (UTC+8) | - |
dc.date.available | 9-四月-2010 13:24:02 (UTC+8) | - |
dc.date.issued (上傳時間) | 9-四月-2010 13:24:02 (UTC+8) | - |
dc.identifier (其他 識別碼) | G0096753013 | en_US |
dc.identifier.uri (URI) | http://nccur.lib.nccu.edu.tw/handle/140.119/38543 | - |
dc.description (描述) | 碩士 | zh_TW |
dc.description (描述) | 國立政治大學 | zh_TW |
dc.description (描述) | 資訊科學學系 | zh_TW |
dc.description (描述) | 96753013 | zh_TW |
dc.description (描述) | 98 | zh_TW |
dc.description.abstract (摘要) | 隨著網路技術的發展,目前網路電話(VoIP)已有逐漸取代傳統電話的趨勢。尤其能夠允許多人同時在線上進行會談是其最大的優勢之一。但在多人參與網路會談時,因為聲音在空間中傳遞或反射等因素,使得由喇叭發出的聲音再次被麥克風收回,造成回音的產生。會談中只要有一位使用者的裝置發生回音時,回音訊號就會在與會者之間擴散,使得所有使用者均會受到影響,進而嚴重影響網路通話的進行。此狀況在參與會談人數越多時,發生機率越高,且對通話品質影響越嚴重。 傳統電話在一對一通話時,通常使用遠端回音消除機制(Near End Echo Canceller),由接收端在接收聲音後先暫存在記憶體中再播放,再將麥克風擷取的聲音與事先暫存的訊號反向後混合,以抵銷回音。網路會談的環境下,由於沒有標準的聽筒設備,使得回音發生的時間難以預估。且多人參與的網路會談中,由於收聽者所聽到的聲音可能混合多個使用者說話的聲音與回音,使得回音訊號難以偵測。另外,由於網路傳輸的特性,回音訊號到達的時間與順序都難以預估,這使得回音消除機制在多人網路回談中經常失效。 本研究提出藉由語音動態偵測(Voice Activity Detection-VAD)的方式分辨回音訊號,藉由本研究所提出的語音能量VAD判定機制,能夠有效區別正常語音與回音的差異,即可有效的消除回音,同時發揮靜音抑制(Slience Suppression)的效果,阻擋不含語音內容的封包,降低網路頻寬耗用。本研究以自行開發的VoIP軟體進行實地測試實驗,實驗中顯示,我們的方法能消除85%以上的回音。 | zh_TW |
dc.description.abstract (摘要) | With the prosperous development of Internet technology, traditional phone service is being replaced gradually byVoice-over-IP (VoIP) technology. One of the critical problem that is yet to be improved is the echo problem. Due to the difference in working environment, conventional echo cancellation technology may not work well on VoIP system. The echo problem is becoming more critical as the number of participants in a talk session increases. As long as one user fails to depress echos, every other participant in the conference will be infected. The more participant, the higher probability of echo infection. We propose an energy based Voice Activity Detection (VAD) mechnism that effectively differentiate echo from speech signal. Our VAD algrouthm records a user’s speech volume, and based on this information to determine whether the frame is echo or not. By applying this mechnism to network conference, we can filter out echo frames and suppress slience at same time to save bandwidth consumption. We experimented on a self-developed VoIP software platform, the experiment result shows that our method can eliminate more than 85% of the echo. | en_US |
dc.description.tableofcontents | 目錄 摘要 i Abstract ii 誌謝辭 iii 目錄 iv 圖目錄 vi 表目錄 viii 第一章 簡介 1 1.1多人/大型網路語音會談 1 1.2網路會談的常見問題 1 1.3回音現象 4 1.4回音消除基本原理 6 1.5 VoIP中的回音 6 1.5.1單一回音產生者 8 1.5.2多個回音產生者 9 1.5.2 Proximity Problem造成的回音 11 1.6名詞定義 11 1.7研究動機與目的 11 第二章 背景與相關研究 13 2.1回音消除技術演進 13 2.2回音消除原理 13 2.2.1回音消除演算法 14 2.3回音消除方法分類 16 2.3.1 Listener Echo Cancellation 16 2.3.2 Listener Echo Cancellation失效原因 16 2.3.3 Talker Echo Cancellation 17 2.3.4 Talker Echo Cancellection的挑戰 17 2.4一對一VoIP回音消除機制 21 2.5總結 21 第三章 MET VAD靜音及回音消除機制 23 3.1需求分析及研究目標 23 3.2解決方法 23 3.3 VAD語音動態偵測 23 3.4系統架構 24 3.5細部設計 27 3.5.1聲音能量紀錄 28 3.5.2 LED VAD演算法 29 3.5.3 MET VAD演算法 30 第四章 效能分析 34 4.1實驗目的 34 4.2實驗設計 34 4.2.1以聲音樣本評比各種VAD演算法 34 4.2.2以網路會談實測MET VAD之效能 34 4.2.3 Proximity Problem的回音消除測試 34 4.3評估指標 34 4.3.1誤判率 35 4.3.2 MOS 35 4.4實驗一:以聲音樣本評比各種VAD 35 4.4.1實驗目標 35 4.4.2實驗環境 36 4.4.3實驗流程 37 4.4.4實驗結果分析 37 4.5實驗二:網路會談實測 47 4.5.1實驗環境 47 4.5.2實驗流程 47 4.5.3實驗結果分析 48 4.6實驗三:Proximity Problem的回音消除測試 53 4.6.1實驗目標 53 4.6.2實驗環境 53 4.6.3實驗結果分析 54 第五章 結論與未來研究方向 56 | zh_TW |
dc.format.extent | 1061479 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.language.iso | en_US | - |
dc.source.uri (資料來源) | http://thesis.lib.nccu.edu.tw/record/#G0096753013 | en_US |
dc.subject (關鍵詞) | 回音消除 | zh_TW |
dc.subject (關鍵詞) | 網路語音會談 | zh_TW |
dc.subject (關鍵詞) | 語音動態偵測 | zh_TW |
dc.subject (關鍵詞) | echo cancellation | en_US |
dc.subject (關鍵詞) | VoIP | en_US |
dc.subject (關鍵詞) | VAD | en_US |
dc.title (題名) | 大型網路語音會談中回音消除方法 | zh_TW |
dc.title (題名) | Echo Cancellation In Large-Scale VoIP Conferencing | en_US |
dc.type (資料類型) | thesis | en |
dc.relation.reference (參考文獻) | [1] U.I. Choudhry, JongWon Kim, and Hong Kook Kim, "A Highly Adaptive Acoustic Echo Cancellation Solution for VoIP Conferencing Systems", IEEE International Conference on Computer Systems and Applications, 2006, pp. 433-436. | zh_TW |
dc.relation.reference (參考文獻) | [2] G. S. Fang, "Voice Channel Echo Cancellation", IEEE Communications Magazine, Vol. 21, Issue 9, Dec. 1983, pp.11-14. | zh_TW |
dc.relation.reference (參考文獻) | [3] Xiaohui Gu, Zhen Wen, Philip S. Yu, and Zon-Yin Shae, "peerTalk: A Peer-to-Peer Multi-Party Voice-Over-IP System", Parallel and Distributed Systems, IEEE Trans. on Publication, Vol. 19, No. 4, April 2008. | zh_TW |
dc.relation.reference (參考文獻) | [4] Perry P. He, Roman A. Dyba, and Lucio F.C. Pessoa, "Network Echo Cancellers: Requirements, Applications and Solutions", AnalogZONE, 2004. | zh_TW |
dc.relation.reference (參考文獻) | [5] Brant M. Helf, "Far end echo cancellation method and apparatus”, U.S. Patent 4,995,030, Feb. 19, 1991. | zh_TW |
dc.relation.reference (參考文獻) | [6] M. Hiraguchi, "Full duplex modem having two echo cancellers for a near end echo and a far end echo", U.S. Patent 4,935,919, 19 Jun. 19, 1990. | zh_TW |
dc.relation.reference (參考文獻) | [7] Yao-Nan Lien, Li-Cheng Chi and Yuh-Sheng Shaw, "A Walkie-Talkie-Like Emergency Communication System for Catastrophic Natural Disasters", Proc. of 10th International Symposium on Pervasive Systems, Algorithms and Networks (ISPAN09), Dec. 14-16, 2009. | zh_TW |
dc.relation.reference (參考文獻) | [8] P. Marques, F. Sousa, and J. Leitao, "A DSP Based Long Distance Echo Canceller using Short Length Centered Adaptive Filters", Proc. of ICASSP, 1997. | zh_TW |
dc.relation.reference (參考文獻) | [9] B. S. Nollet, and D. L. Jones, "Nonlinear Echo Cancellation For Hands-Free Speakerphones", Proc. of NSIP`97, Michigan USA, Sep. 1997. | zh_TW |
dc.relation.reference (參考文獻) | [10] K. Ochiai, T. Araseki, and T. Ogihara, "Echo canceller with two echo path models", IEEE Trans. on Commun., Vol. COM-25, No. 6, June 1977, pp. 589-595. | zh_TW |
dc.relation.reference (參考文獻) | [11] G. Periakarruppan, and H. A. Abdul-Rashid, "Packet based echo cancellation for VoIP networks", Computers and Electrical Engineering, Vol. 33, No. 2, 2007, pp. 139-148. | zh_TW |
dc.relation.reference (參考文獻) | [12] Petr Pollak, Pavel Sovka, and Jan Uhlir, "Noise Sup-pression System for a Car", Proc. of the Third European Conference on Speech, Communication and Technology -EUROSPEECH`93, Berlin, Sep. 1993, pp. 1073-1076. | zh_TW |
dc.relation.reference (參考文獻) | [13] R. V. Prasad, A. Sangwan, H. S. Jamadagni, and M. C. Chiranth, "Comparison of voice activity detection algorithms for voip”, Proc. of IEEE Symposium on Computer and Communications, July 2002, pp. 530-535. | zh_TW |
dc.relation.reference (參考文獻) | [14] R. V. Prasad, R. Muralishhankar, S. Vijay, H. N. Shankar, P. Pawelczak, and I. Miemegeers, “Voice activity detection for VoIP-an information theoretic approach”, Proc. of IEEE Global Telecommunications Conference, 2006, pp. 1-6. | zh_TW |
dc.relation.reference (參考文獻) | [15] I. Rassameeroj, and S. Tangwongsan, "Echo Cancellation in Voice over IP", Proc of 5th International Conference on Information Technology and Applications (ICITA), 2008, pp. 570-575. | zh_TW |
dc.relation.reference (參考文獻) | [16] P. Renevey, and A. Drygajlo, “Entropy based voice activity detection in very noisy conditions”, Proc. of European Conference on Speech Communication and Technology (ISCA EUROSPEECH ’01), Sep. 2001, pp. 1887-1890. | zh_TW |
dc.relation.reference (參考文獻) | [17] B. Widrow, and M. E. Hoff, "Adaptive switching circuits", IRE WESCON Convention Record part 4, 1960, pp. 96-104. | zh_TW |
dc.relation.reference (參考文獻) | [18] B. Widrow, and SD Stearns, “Adaptive Signal Processing”, Prentice-Hall, Nglewood Cliffs, NJ, 1985. | zh_TW |
dc.relation.reference (參考文獻) | [19] Echo cancellation, http://en.wikipedia.org/wiki/Echo_cancellation, Retrieved at November 11, 2009. | zh_TW |
dc.relation.reference (參考文獻) | [20]Echo suppressor, http://en.wikipedia.org/wiki/Echo_suppressor, Retrieved at November 11, 2009. | zh_TW |
dc.relation.reference (參考文獻) | [21] Mean Opinion Score, http://en.wikipedia.org/wiki/Mean_Opinion_Score, Retrieved at July 09, 2009. | zh_TW |