dc.contributor.advisor | 李蔡彥 | zh_TW |
dc.contributor.advisor | Li , Tsai-Yen | en_US |
dc.contributor.author (Authors) | 廖峻鋒 | zh_TW |
dc.contributor.author (Authors) | Liao , Chun-Feng | en_US |
dc.creator (作者) | 廖峻鋒 | zh_TW |
dc.creator (作者) | Liao , Chun-Feng | en_US |
dc.date (日期) | 2003 | en_US |
dc.date.accessioned | 17-Sep-2009 14:06:12 (UTC+8) | - |
dc.date.available | 17-Sep-2009 14:06:12 (UTC+8) | - |
dc.date.issued (上傳時間) | 17-Sep-2009 14:06:12 (UTC+8) | - |
dc.identifier (Other Identifiers) | G0917530041 | en_US |
dc.identifier.uri (URI) | https://nccur.lib.nccu.edu.tw/handle/140.119/32707 | - |
dc.description (描述) | 碩士 | zh_TW |
dc.description (描述) | 國立政治大學 | zh_TW |
dc.description (描述) | 資訊科學學系 | zh_TW |
dc.description (描述) | 91753004 | zh_TW |
dc.description (描述) | 92 | zh_TW |
dc.description.abstract (摘要) | 近年來3D虛擬環境與語音界面(Voice User Interface)在個人電腦上的應用逐漸受到重視。說話是人類最自然的溝通方式,若能在虛擬環境中加入語音界面,將使人物間的互動更為流暢。近年來雖有許多研究致力於3D虛擬環境與語音界面的整合,但在多人環境中對話管理(Dialog Management)等相關問題上,一直缺乏有效的解決方案。本研究的主要目的,即在解決語音界面整合及對話管理等問題,並實現多人虛擬環境的語音互動機制。我們針對虛擬環境中語音與動畫同步、對話管理機制與多人環境中之語音處理機制等問題,設計一個以VoiceXML為基礎的XAML-V (eXtensible Animation Markup Language – Voice extension ) 語言,並將其實作結果於一個多人虛擬環境系統中驗証其可行性及有效性。 | zh_TW |
dc.description.abstract (摘要) | The applications of 3D virtual environments and voice user interface (VUI) on personal computers have received significant attentions in recent years. Since speech is the most natural way of communication, incorporating VUI into virtual environments can enhance user interaction and immersiveness. Although there have been many researches addressing the issue of integrating VUI and 3D virtual environment, most of the proposed solutions do not provide an effective mechanism for multi-user dialog management. The objective of this research is on providing a solution for VUI integration and dialog management and realizing such a mechanism in a multi-user virtual environment. We have designed a dialog scripting language called XAML-V (eXtensible Animation Markup Language – Voice Extension), based on the VoiceXML standard, to address the issues of synchronization between VUI and animation and dialog management for multi-user interaction. We have also implemented such a language and realized it on a multi-user virtual environment to evaluate the effectiveness of this design. | en_US |
dc.description.tableofcontents | 第一章 導論 1.1 研究動機 1.1.1語音系統如何與人物動畫同步 1.1.2須引入適當之對話管理機制 1.1.3 執行緒的控制與效能 1.1.4 多人虛擬環境中的對話及語音傳播處理 1.2 研究目標 1.3 本論文的貢獻 1.4 研究限制 1.5 論文章節架構第二章 相關研究 2.1 多人虛擬環境系統 2.2 腳本動畫語言 2.3 語音合成與語音辨識技術 2.3.1 語音合成(Speech Synthesis) 2.3.2 語音辨識(Speech Recognition) 2.4 語音與虛擬環境的整合 2.4.1整合問題的種類 2.4.2整合問題相關研究2.4.3虛擬環境和語音界面整合時的考量2.5 VoiceXML與互動式對話管理2.5.1 VoiceXML與FIA2.5.2 對話控制機制相關研究 2.6 軟體樣式(Software Patterns)第三章 多人虛擬環境中的角色互動與對話管理 3.1 動畫與語音訊息在虛擬環境中之傳播 3.1.1 Client與Client間腳本(Scripts)的傳播3.1.2 語音腳本的傳播 3.2 多人虛擬環境中的對話管理 3.2.1 多人虛擬環境中的對話管理架構 3.2.2 對話權利(Dialog Lock) 3.2.3 啟始對話的協定 3.2.4 多人環境中對話協議的同步問題 3.2.5 結束對話的協定 3.3 多人虛擬環境中的對話傳播 3.3.1 多人虛擬環境中對話進行方式第四章 互動式語音界面之對話管理腳本語言- XAML-V 4.1 XAML動畫腳本語言 4.2 XAML-V(XAML Voice Extension)概觀4.2.1 動畫整合機制4.2.2 對話協議(Dialog Negotiation)機制4.2.3 對話廣播機制4.2.4 對話腳本擷取的代理(Proxy Request) 4.3 XAML-V的對話管理與動畫整合 4.4 XAML-V在多人虛擬環境下的對話協議機制4.4.1 Client端與Server端間傳送之訊息格式4.4.2 XAML-V中的對話協議機制 4.5 XAML-V對話腳本擷取代理(Proxy Request)第五章 系統實作與討論 5.1 XAML-V平台系統架構設計 5.2 XAML-V平台系統元件設計5.2.1 語音系統與虛擬環境之整合5.2.2 Protocol Framework5.2.3 XAML-V 解譯器(Interpreter)5.2.4 DialogLock的實作5.2.5 使用者輸入機制(Input Device)之實作5.2.6 使用軟體樣式來解決XAML-V的設計問題 5.3 XAML-V平台主要元件與佈署方式 5.4 XAML-V平台實作成果第六章 結論與未來研究方向 6.1 結論 6.2 未來研究方向參考文獻附錄 XAML-V文件定義檔 (xaml-v.dtd) | zh_TW |
dc.format.extent | 80928 bytes | - |
dc.format.extent | 107965 bytes | - |
dc.format.extent | 110536 bytes | - |
dc.format.extent | 156961 bytes | - |
dc.format.extent | 179197 bytes | - |
dc.format.extent | 835554 bytes | - |
dc.format.extent | 538240 bytes | - |
dc.format.extent | 667892 bytes | - |
dc.format.extent | 1986978 bytes | - |
dc.format.extent | 136600 bytes | - |
dc.format.extent | 147495 bytes | - |
dc.format.extent | 95873 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.language.iso | en_US | - |
dc.source.uri (資料來源) | http://thesis.lib.nccu.edu.tw/record/#G0917530041 | en_US |
dc.subject (關鍵詞) | 虛擬環境 | zh_TW |
dc.subject (關鍵詞) | 對話管理 | zh_TW |
dc.subject (關鍵詞) | 語音 | zh_TW |
dc.subject (關鍵詞) | Virtual Environment | en_US |
dc.subject (關鍵詞) | VoiceXML | en_US |
dc.subject (關鍵詞) | Dialog Management | en_US |
dc.subject (關鍵詞) | Speech | en_US |
dc.title (題名) | 多人虛擬環境中互動式語音界面的實現 | zh_TW |
dc.title (題名) | Realizing the Interactive Speech Interface in a Multi-user Virtual Environment | en_US |
dc.type (資料類型) | thesis | en |
dc.relation.reference (參考文獻) | [1] ActiveWorlds, URL:<http://www.activeworlds.com>. | zh_TW |
dc.relation.reference (參考文獻) | [2] C.Alexander, A Pattern Language: Towns, Buildings, Construction, Oxford University Press, 1977. | zh_TW |
dc.relation.reference (參考文獻) | [3] D.Alur, J.Crupi and D.Malks, Core J2EE Patterns,2nd edtion, Prentice Hall, 2003. | zh_TW |
dc.relation.reference (參考文獻) | [4] S.W.Ambler, Process Patterns: Building Large-Scale Systems Using Object Technology, Cambridge University Press, 1998. | zh_TW |
dc.relation.reference (參考文獻) | [5] O.Apaydin. “Networked Humanoid Animation Driven by Human Voice using Extensible 3D(X3D),H-Anim and Java Speech Open Standards,” Master Thesis, Naval Postgraduate School, March 2002. | zh_TW |
dc.relation.reference (參考文獻) | [6] S.P. Berczuk, B.Appleton, Software Configuration Management Patterns: Effective Teamwork, Practical Integration, Addison-Wesley, 2002. | zh_TW |
dc.relation.reference (參考文獻) | [7] Blaxxun, URL:<http://www.blaxxun.com> | zh_TW |
dc.relation.reference (參考文獻) | [8] J.Bloch, Effective Java: Programming Language Guide, Addison Wesley, 2001. | zh_TW |
dc.relation.reference (參考文獻) | [9] F.Buschmann, R.Meunier, H.Rohnert, P.Sommerlad, and M.Stal, Pattern-Oriented Software Architecture, Volume 1:A System of Patterns, John Wiley & Son, 1996. | zh_TW |
dc.relation.reference (參考文獻) | [10] J. Carey and B. Carlson, Framework Process Patterns: Lessons Learned Developing Application Frameworks, Addison-Wesley, 2002 | zh_TW |
dc.relation.reference (參考文獻) | [11] B. Carpenter, S. Caskey, K. Dayanidhi, C. Drouin, and R. Pieraccini, “A portable, server-side dialog framework for VoiceXML, ” Proceedings of 2002 International Conference on Spoken Language Processing, Denver, Colorado, 2002. | zh_TW |
dc.relation.reference (參考文獻) | [12] M. Cernak and A. Sannier, “Command Speech Interface to Virtual Reality Applications,” Virtual Reality Applications Center at Iowa State University of Science and Technology, June 2002. | zh_TW |
dc.relation.reference (參考文獻) | [13] Cloud Garden API for Java Speech API, URL:< http://www.cloudgarden.com/JSAPI/index.html > | zh_TW |
dc.relation.reference (參考文獻) | [14] S. Descamps, H. Prendinger, and M. Ishizuka, “A multimodal presentation mark-up language for enhanced affective presentation,” Proceedings of the International Conference on Intelligent Multimedia and Distant Education (ICIMADE-01), Advances in Educational Technologies: Multimedia, WWW and Distance Education, pp. 9–16, 2001. | zh_TW |
dc.relation.reference (參考文獻) | [15] Distributed Interactive Virtual Environment, DIVE, URL:< http://www.sics.se/dive/ > | zh_TW |
dc.relation.reference (參考文獻) | [16] M. E. Fayad, D. C. Schmidt, and R. E. Johnson, “Application Frameworks,” Building Application Frameworks - Object-Oriented Foundations of Framework Design, Wiley Computer Publishing, Chap. 1, pp. 1-28, 1999. | zh_TW |
dc.relation.reference (參考文獻) | [17] M.Fowler, Analysis Patterns, Addison-Wesley, 1996. | zh_TW |
dc.relation.reference (參考文獻) | [18] M. Fowler, D. Rice, M. Foemmel, E. Hieatt, R. Mee and R. Stafford, Patterns of Enterprise Application Architecture, Addison-Wesley, 2003. | zh_TW |
dc.relation.reference (參考文獻) | [19] E. Frecon and M. Stenius, “DIVE: A Scalable network architecture for distributed virtual environments,” Distributed Systems Engineering Journal (Special issue on Distributed Virtual Environments), Vol. 5, No. 3, p.91-100, September 1998. | zh_TW |
dc.relation.reference (參考文獻) | [20] FreeTTS, URL:<http://freetts.sourceforge.net/> | zh_TW |
dc.relation.reference (參考文獻) | [21] E. Gamma, R. Helm, R. Johnson and J. Vlissides, Design Patterns: Elements of Reusable Object-oriented Software, Addison-Wesley,1995. | zh_TW |
dc.relation.reference (參考文獻) | [22] C. Greenhalgh and S. Benford, “MASSIVE: a collaborative virtual environment for teleconferencing,” ACM Transaction CHI, Volume 2, p.239-261, 1995. | zh_TW |
dc.relation.reference (參考文獻) | [23] P. Haggar, Practical Java – Programming Language Guide, Addison-Wesley, 2000. | zh_TW |
dc.relation.reference (參考文獻) | [24] H-Anim, URL:<http://www.h-anim.org> | zh_TW |
dc.relation.reference (參考文獻) | [25] Z. Huang, A. Eliens, and C. Visser, “STEP: A Scripting Language for Embodied Agents,” Proceedings of the Workshop on Lifelike Animated Agents, 2002. | zh_TW |
dc.relation.reference (參考文獻) | [26] Intelligent Media Net, URL: <http://imlab.cs.nccu.edu.tw/bResearch.jsp#9> | zh_TW |
dc.relation.reference (參考文獻) | [27] Introducing Computer Speech Technology, MSDN, URL:<http://msdn.microsoft.com/library/default.asp?url=/library/en-us/sasdk_getstarted/html/intro_speechtech_intro.asp> | zh_TW |
dc.relation.reference (參考文獻) | [28] Java Speech API, URL:< http://java.sun.com/products/java-media/speech/> | zh_TW |
dc.relation.reference (參考文獻) | [29] S. Kawamoto, H. Shimodaira, T. Nitta, T. Nishimoto, S. Nakamura, K. Itou, S. Morishima, T. Yotsukura, A. Kai, A. Lee, Y. Yamashita, T. Kobayashi, K. Tokuda, K, Hirose, N. Minematsu, A. Yamada, Y. Den, T. Utsuro, and S. Sagayama, “Open-source software for developing anthropomorphic spoken dialog agent,” Proceedings of 2002 International Workshop on Lifelike Animated Agents, pp.64-69, Aug 2002. | zh_TW |
dc.relation.reference (參考文獻) | [30] S. Kshirsagar, A. Guye-Vuilleme, and K. Kamyab, “Avatar Markup Language,” Proceedings of 8th Eurographics Workshop on Virtual Environments, pp. 169-177, May, 2002. | zh_TW |
dc.relation.reference (參考文獻) | [31] D. Lea, Concurrent Programming in Java:Design Principles and Patterns,2nd edtion, Addison-Wesley, 1999. | zh_TW |
dc.relation.reference (參考文獻) | [32] T.Y Li, M.Y Liao, and C.F Liao. "An Extensible Scripting Language for Interactive Animation in a Speech-Enabled Virtual Environment," Proceedings of IEEE International Conference on Multimedia and Expo (ICME2004), Taipei, Taiwan. | zh_TW |
dc.relation.reference (參考文獻) | [33] M.Y Liao and T.Y Li, ”A Scripting Language for Extensible Animation,” Proceedings of 2003 Computer Graphics Workshop, Taiwan, 2003. | zh_TW |
dc.relation.reference (參考文獻) | [34] Maja Matijasevic, “A Review of Networked Multi-User Virtual Environment,” URL: <http://citeseer.nj.nec.com/matijasevic97review.html>, 1997 | zh_TW |
dc.relation.reference (參考文獻) | [35] S. McGlashan, “Speech Interfaces to Virtual Reality,” Proceedings of 2nd International Workshop on Military Applications of Synthetic Environments and Virtual Reality, 1995. | zh_TW |
dc.relation.reference (參考文獻) | [36] J. Nielsen, Object-oriented reuse: experience in developing a framework for speech recognition applications, 1999. | zh_TW |
dc.relation.reference (參考文獻) | [37] T. Nishimoto and S. Sagayama, “The VoiceXML Interperter for the Anthropomorphic Agent Software Galatea,” Proceedings of 17th Annual Conference of the Japanese Society for Artificial Intelligence, 2003. | zh_TW |
dc.relation.reference (參考文獻) | [38] E. Nyberg, T. Mitamura, P. Placeway, M. Duggan, and N. Hataoka, “DialogXML: Extending VoiceXML for Dynamic Dialog Management,” Proceedings of the Human Language Technology Conference, 2002. | zh_TW |
dc.relation.reference (參考文獻) | [39] K. Perlin, and A. Goldberg, “Improv: A System for Scripting Interactive Characters in Virtual Worlds,” Proceedings of SIGGRAPH 96, ACM Press, pp. 205-216, 1996. | zh_TW |
dc.relation.reference (參考文獻) | [40] N. Ramakrishnan, R. Capra, and M.A. Perez-quinones, “Mixed-Initiative Interaction = Mixed Computation,” Proceedings of ACM SIGPLAN Workshop PEPM’02, January 2002. | zh_TW |
dc.relation.reference (參考文獻) | [41] S. Sagayama, S. Kawamoto, H. Shimodaira, T. Nitta, T. Nishimoto, S. Nakamura, K. Itou, S. Morishima, T. Yotsukura, A.Kai, A.Lee, Y. Yamashita, T. Kobayashi, K. Tokuda, K. Hirose, N. Moinematsu, A. Yamada, Y. Den, and T. Utsuro, “Galatea:An Anthropomorphic Spoken Dialogue Agent Toolkit,” IPSJ SIG-SLP, Feburary 2003 | zh_TW |
dc.relation.reference (參考文獻) | [42] Sphinx, URL:<http://www.speech.cs.cmu.edu/sphinx/> | zh_TW |
dc.relation.reference (參考文獻) | [43] S. Srinivasan and J. Vergo, ”Object oriented reuse: experience in developing a framework for speech recognition applications,” Proceedings of the 20th international conference on Software engineering, pp. 322 – 330, Kyoto, Japan, 1998. | zh_TW |
dc.relation.reference (參考文獻) | [44] R. Stuart, The Design of Virtual Environments, McGraw-Hill, New York, 1996. | zh_TW |
dc.relation.reference (參考文獻) | [45] A. Ujwal and N. Mehrotra, NAVxI: A VoiceXML interpreter, Technical Report, Indian Institute of Technology Kanpur, December 2002. | zh_TW |
dc.relation.reference (參考文獻) | [46] VNet, URL:<http://www.csclub.uwaterloo.ca/u/sfwhite/vnet/ > | zh_TW |
dc.relation.reference (參考文獻) | [47] VoiceXML, URL:< http://www.w3.org/Voice/ > | zh_TW |
dc.relation.reference (參考文獻) | [48] VRML, URL:<http://www.web3d.org/ > | zh_TW |
dc.relation.reference (參考文獻) | [49] R.C.Waters and J.W.Barrus, "The rise of shared virtual environments," IEEE Spectrum, Volume 34, Issue 3, pp. 20-25, 1997. | zh_TW |
dc.relation.reference (參考文獻) | [50] Wauchope, K., S. Everett and D. Tate, T. Maney, “Speech-Interactive Virtual Environments for Ship Familiarization,” Proceedings of 2nd International EuroConference on Computer and IT Applications in the Maritime Industries (COMPIT `03), pp. 70-83, Hamburg, Germany, May 2003. | zh_TW |
dc.relation.reference (參考文獻) | [51] 岡崎直觀,Santi Saeyor,土肥浩,石塚滿,“記述言語MPML的3次元VRML空間的擴張”,電子情報通信學會論文誌採錄決定. | zh_TW |