dc.contributor.advisor | 劉吉軒 | zh_TW |
dc.contributor.advisor | Liu, Jyi Shane | en_US |
dc.contributor.author (Authors) | 洪培鈞 | zh_TW |
dc.contributor.author (Authors) | Hung, Pei Chun | en_US |
dc.creator (作者) | 洪培鈞 | zh_TW |
dc.creator (作者) | Hung, Pei Chun | en_US |
dc.date (日期) | 2006 | en_US |
dc.date.accessioned | 17-Sep-2009 14:03:28 (UTC+8) | - |
dc.date.available | 17-Sep-2009 14:03:28 (UTC+8) | - |
dc.date.issued (上傳時間) | 17-Sep-2009 14:03:28 (UTC+8) | - |
dc.identifier (Other Identifiers) | G0094753032 | en_US |
dc.identifier.uri (URI) | https://nccur.lib.nccu.edu.tw/handle/140.119/32685 | - |
dc.description (描述) | 碩士 | zh_TW |
dc.description (描述) | 國立政治大學 | zh_TW |
dc.description (描述) | 資訊科學學系 | zh_TW |
dc.description (描述) | 94753032 | zh_TW |
dc.description (描述) | 95 | zh_TW |
dc.description.abstract (摘要) | 對於大部分的人而言,寫作是一種深度的表達過程,需要詳盡而深入的描述能力及精準而嚴謹的語意呈現。對於非英語為母語的學習者(ESL/EFL)來說,英語寫作尤其是一個困難的過程,常常會因為單字、搭配字(collocation)、句型結構等方面的認知不足,或是受到母語認知的牽制影響,而造成用詞、語法、甚至語意的錯誤。 近年來,語料庫的發展帶來了豐富的語言資源,不僅可以對語言的使用提供許多統計分析上的資訊,也可以做為語言學習者在特定詞語使用上的參照對象。目前語料庫的詞語使用參照以concordance技術為主,提供以特定字詞為基準的上下文例句排列,這種功能對ESL/EFL學習者於寫作過程所提供的參照仍然相當有限;而對於不同的查詢條件,若沒有良善的輸入和比對機制,往往搜尋上的數量和例句品質無法滿足學習者的參照需求;除此之外,對於檢索結果語料庫也沒有評估與排序的概念,學習者往往需要在大量的資訊中篩選出有用的資訊;綜觀以上幾點,目前語料庫技術無法滿足不同程度的ESL/EFL寫作者在參照協助上的需求。 本研究提出一種文句推薦方法,針對ESL/EFL寫作者在寫作過程上的認知不足或不確定,從語料庫中尋找出可用的參照例句,進而提供針對性的協助。我們的文句推薦方法包含三個模組,第一模組是一個彈性的表達元素模組,能針對寫作過程中的語言資訊需求,以規定的表達元素呈現作者的認知需求。第二模組是一個檢索模組,指從語料庫中比對尋找符合作者語意需求的例句。第三模組是一個排序的模組,針對找出的例句,評估其符合作者語意需求的程度,並將之排序,以提升例句參照的使用效率。 本研究依據文句推薦方法實做雛形系統,以British National Corpus和科學人雜誌語料庫(Science America)當例句來源,並依據論文的客觀評估和學習者的問卷調查兩種評估方式來評量雛形系統的成效。經由上述兩種評估方式,本研究驗證雛形系統能針對ESL/EFL於寫作過程中給予一定程度的協助成效,證明本研究的文句推薦方法確實能達成寫作協助的既定目標。 | zh_TW |
dc.description.abstract (摘要) | For most people, writing is a process of profundity. It requires well description of capabilities and excellent semantic of representations. For ESL/EFL (English as a Second Language/English as a Foreign Language) learners, English writing is a particularly difficult process. They have error in words, syntax or semantics because their expressions are constrained by their mother tongue or by lacking of the awareness of vocabularies, collocation, or sentence-structures. In recent years, according to the development of corpus technique, we have rich resource in language. The resource can not only provide many statistical analyses of informations in language research but also can be used as a reference in the use of specific words for the learners. Current corpus technique use concordance which displays the words with their surrounding text to present sentences, but that approach provides very limited references in the writing process for ESL/EFL learners. Also considering the different inquired queries, if we have no well-defined input and retrieval module, the quantity and quality of results could not meet the demand for the learners. In addition, corpus technique doesn’t provide the ranking and evaluating skills to the retrieval results. As a result of that, learners often require filtering a large number of information to find something useful. In view of the above points, current corpus technique is unable to satisfy different degrees of ESL/EFL learners in the writing process. This paper presents a Sentence Recommendation Approach technique. With regard to the inadequate knowledge in the writing process for ESL/EFL learners, we search for useful sentences from the corpus and provide them to learners. Our Sentence Recommendation Approach technique has three modules: one is Expression Element Module. It allows the learners to use some expression elements to represent their demand. Another is Retrieval Module. It means the process of searching the corresponding sentences based on the expression elements in Expression Element Module. The last is Sort Module. It means the process of ranking the results derived from the Retrieval Module. Our research establishes the experimental system to verify the performance of our Sentence Recommendation Approach technique. We use British National Corpus and Science America Corpus for the source of sentences. The evaluation is divided into our objective evaluation and the questionnaire evaluation. Both of them prove that our experimental system does some favor in the writing process for ESL/EFL learners. In other words, our Sentence Recommendation Approach technique really helps the learners in the writing process. | en_US |
dc.description.tableofcontents | 第一章 緒論…11.1 英語寫作…11.1.1 辭典工具…21.1.2 語料庫協助工具…31.2 研究動機和目的…51.3 研究貢獻和論文結構…6第二章 背景知識和相關技術…7 2.1 相關研究…72.2 語料庫 (Corpus/Corpora)…82.3 語料庫語言學…92.4 詞性標籤 (Part-of-Speech)…92.5 Concordance…102.6 詞語搭配 (Collocation)…112.7 推薦式系統 (Recommender System)…152.8 多重序列排列 (Multiple sequence alignment (MSA))…162.8.1 Needleman-Wunsch algorithm…182.8.2 The Center Star algorithm…19第三章 文句推薦方法和實驗系統…213.1 文句推薦方法…213.2 實驗系統…233.2.1 表達元素模組…243.2.2 檢索模組…263.2.3 排序模組…273.2.4 案例流程…29 3.2.5 SAW系統查詢圖…30第四章 實驗分析討論…344.1 實驗測試資料…344.1.1 實驗語料庫…344.1.2 實驗比對系統…354.1.3 實驗案例來源…364.2 實驗評估方法…374.3 實驗指標…384.4 試題案例評估…414.4.1 量上的評估…414.4.2 單字片語的評估…444.4.3 子句和認知錯誤的評估…494.4.4 試題案例小結…524.5 問卷案例評估…524.5.1 受測者對於寫作協助的認知調查…564.5.2 有無使用系統的區別…574.5.3 使用系統前後的區別…634.5.4 問卷小結…664.6 學生作文範例…664.7 實驗分析小結…71第五章 結論和未來方向…725.1 結論…725.2 未來方向…73參考文獻…76附錄一 BNC POS…80附錄二 SA POS…83附錄三 SAW系統中POS和BNC、SA中POS置換規則…85附錄四 SAW系統中的MSA置換矩陣參數…86附錄五 SAW系統測試的關鍵字總攬…87附錄六 量上的評估實驗數據…88附錄七 單字片語的評估實驗數據…90附錄八 子句和認知錯誤的評估實驗數據…92附錄九 問卷調查 (英語字彙程度測試)…94附錄十 問卷調查 (認知調查部分)…95附錄十一 問卷調查 (所有試題部分)…96附錄十二 問卷調查 (SAW系統提供的參考例句(高符合度))…98附錄十三 問卷調查 (SAW系統提供的參考例句(低符合度))…119 | zh_TW |
dc.format.extent | 48508 bytes | - |
dc.format.extent | 67485 bytes | - |
dc.format.extent | 82282 bytes | - |
dc.format.extent | 94473 bytes | - |
dc.format.extent | 1144330 bytes | - |
dc.format.extent | 296202 bytes | - |
dc.format.extent | 563452 bytes | - |
dc.format.extent | 734552 bytes | - |
dc.format.extent | 112624 bytes | - |
dc.format.extent | 65897 bytes | - |
dc.format.extent | 599193 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.language.iso | en_US | - |
dc.source.uri (資料來源) | http://thesis.lib.nccu.edu.tw/record/#G0094753032 | en_US |
dc.subject (關鍵詞) | 寫作協助 | zh_TW |
dc.subject (關鍵詞) | 語料庫 | zh_TW |
dc.subject (關鍵詞) | 資訊檢索 | zh_TW |
dc.subject (關鍵詞) | 語索引 | zh_TW |
dc.subject (關鍵詞) | 推薦式系統 | zh_TW |
dc.subject (關鍵詞) | Writing Assistance | en_US |
dc.subject (關鍵詞) | Corpora | en_US |
dc.subject (關鍵詞) | Information Retrieval | en_US |
dc.subject (關鍵詞) | Concordance | en_US |
dc.subject (關鍵詞) | Recommender System | en_US |
dc.title (題名) | 以進階的文句推薦方式使用語料庫做為英文寫作之輔助 | zh_TW |
dc.title (題名) | A Sentence Recommendation Approach to Using Corpora for English Writing Assistance | en_US |
dc.type (資料類型) | thesis | en |
dc.relation.reference (參考文獻) | [1]http://tw.dictionary.yahoo.com/ 奇摩線上辭典 | zh_TW |
dc.relation.reference (參考文獻) | [2]http://www.dreye.com/tw/index.php/ 譯點通辭典 | zh_TW |
dc.relation.reference (參考文獻) | [3]http://www.kingsoft.com/ 金山詞霸 | zh_TW |
dc.relation.reference (參考文獻) | [4]http://www.collins.co.uk/ Collins Cobuild English | zh_TW |
dc.relation.reference (參考文獻) | Dictionary | zh_TW |
dc.relation.reference (參考文獻) | [5]http://www.collins.co.uk/corpus/CorpusSearch.aspx/ | zh_TW |
dc.relation.reference (參考文獻) | Collins Free Corpus | zh_TW |
dc.relation.reference (參考文獻) | [6]http://view.byu.edu/ VIEW system | zh_TW |
dc.relation.reference (參考文獻) | [7]http://elex.amu.edu.pl/~przemka/corpora/TOSCA- | zh_TW |
dc.relation.reference (參考文獻) | ICLE_tagset.htm TOSCAL | zh_TW |
dc.relation.reference (參考文獻) | [8]http://icame.uib.no/brown/bcm.html brown corpus | zh_TW |
dc.relation.reference (參考文獻) | [9]http://www.proftrans.com/TOEEL/ TOFEL 學習網 | zh_TW |
dc.relation.reference (參考文獻) | [10]http://www.lextutor.ca/ 1000 English Word Test | zh_TW |
dc.relation.reference (參考文獻) | [11] A. A. Aghbar, “Fixed Expressions in Written Texts: Implications for Assessing Sophistication,” East Lancing, MI: National Center for Research on Teacher Learning, ERIC Document Reproduction Service No. ED 352808, 1990. | zh_TW |
dc.relation.reference (參考文獻) | [12] A. B. M. Tsui, “ESL Teachers’ Questions and Corpus Evidence,” International Journal of Corpus Linguistics 10(3):335-356, 2005. | zh_TW |
dc.relation.reference (參考文獻) | [13] Adwait Ratnaparkhi, “A Maximum Entropy Part-Of-Speech Tagger,” Proceedings of Empirical Methods in Natural Language Processing Conference, Pennsylvania, USA, 1996. | zh_TW |
dc.relation.reference (參考文獻) | [14] Alexander Cruden, Cruden’s Concordance, Location:Lodon, 1769. | zh_TW |
dc.relation.reference (參考文獻) | [15] Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl, “Analysis of Recommendation Algorithms for E-Commerce,” Proceedings of ACM Conference on Electronic Commerce, Minneapolis, Minnesota, 2000. | zh_TW |
dc.relation.reference (參考文獻) | [16] Bengt Altenberg and Sylviane Granger,“The Grammatical and Lexical Pattern of MAKE in Native and Non-native Student Writing,” Applied Linguistics 22(2):173-195, 2001. | zh_TW |
dc.relation.reference (參考文獻) | [17] C. C. Shei and Helen Pain, “An ESL Writer’s Collocational Aid,” Computer Assisted Language Learning 13(2):167-182, 2000. | zh_TW |
dc.relation.reference (參考文獻) | [18] Chia-Chuan Li, “A Study of Collocational Error in ESL/EFL College Learners’ Writing,” Doctoral Dissertation, Department of Art, Ming Chuan University, Taiwan, 2005. | zh_TW |
dc.relation.reference (參考文獻) | [19] David Goldberg, David Nichols, B. M. Oki, and Douglas Terry, “Using Collaborative Filtering to Weave an Information Tapestry,” Communications of the ACM 35(12):61-71, 2000. | zh_TW |
dc.relation.reference (參考文獻) | [20] Helmut Schmid, “Probabilistic Part-of-Speech Tagging Using Decision Tree,” Proceedings of International Conference on New Methods in Language Processing, Manchester, UK, 1994. | zh_TW |
dc.relation.reference (參考文獻) | [21] Hyunsook Yoon, “An Investigation of Students’ Experiences with Corpus Technology in Second Language Academic Writing,” Doctoral Dissertation, Department of Philosophy, Ohio State University, USA, 2005. | zh_TW |
dc.relation.reference (參考文獻) | [22] I.de O’Sullivan and Angela Chambers, “Learners’ Writing Skills in French: Corpus Consultation and Learner Evaluation,” Journal of Second Language Writing 15(1):49-68, 2006. | zh_TW |
dc.relation.reference (參考文獻) | [23] J. Nattinger and J. DeCarrico, “Lexical Phrases and Language Teaching,” ELT Journal 49:87-90, 1995. | zh_TW |
dc.relation.reference (參考文獻) | [24] Jean-Jacques Weber, “A Concordance and Genre-informed Approach to ESP Essay Writing,” ELT Journal 55(1):14-20, 2001. | zh_TW |
dc.relation.reference (參考文獻) | [25] Johnnie-Johnson Hafernik, Dorothy Messerschmitt, and Stephanie Vandrick, “ESL in the Academy Today,” Education 116, 1996. | zh_TW |
dc.relation.reference (參考文獻) | [26] L. H. F. Lin, “Applying Information Technology to a Corpus of Student Report Writing to Help Students Write Better Reports,” Proceedings of ITMELT Conference, Kowloon, Hongkong, 1999. | zh_TW |
dc.relation.reference (參考文獻) | [27] M. Heilman, K. Collins-Thompson, J. Callan, and M. Eskenazi, “Classroom Success of an Intelligent Tutoring System for Lexical Practice and Reading Comprehension,” Proceedings of the Ninth International Conference on Spoken | zh_TW |
dc.relation.reference (參考文獻) | Language Processing, Pittsburgh, USA, 2006. | zh_TW |
dc.relation.reference (參考文獻) | [28] Mark Davies, “The Advantage of Using Relational Databases for Large Corpora,” International Journal of Corpus Linguistics 10(3):307-334, 2005. | zh_TW |
dc.relation.reference (參考文獻) | [29] Paul Nation, “Measuring Readiness for Simplified Material: A Test of the First 1000 Words of English,” In Simplification: Theory and Application RELC 31:193-203, 1993. | zh_TW |
dc.relation.reference (參考文獻) | [30] R. H. Shih, J. Y. Chiang, and J. F. Tien, “Part-of-speech Sequences and Distribution in a Learner Corpus of English,” Proceedings of Computational Linguistics Conference XIII (ROCLING XIII), Taipei, Taiwan, 2000. | zh_TW |
dc.relation.reference (參考文獻) | [31] Robert Ilson, Benson Morton, and Benson Evelyn, The BBI Dictionary of English Word Combinations, Amsterdam: John Benjamins Publishing Company, 1997. | zh_TW |
dc.relation.reference (參考文獻) | [32] S. B. Needleman and C. D. Wunsh, “A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins,” Journal of Molecular Biology 48:443-453, 1970. | zh_TW |
dc.relation.reference (參考文獻) | [33] S. M. Conrad, “The Importance of Corpus-based Research for Language Teachers,” System 27:1-18, 1999. | zh_TW |
dc.relation.reference (參考文獻) | [34] S. M. Conrad, “Will Corpus Linguistics Revolutionize Grammar Teaching in the 21st Century?” TESOL Quarterly 34(3):548-560, 2000. | zh_TW |
dc.relation.reference (參考文獻) | [35] Steffan Corley, Martin Corley, Frank Keller, Matthew Crocker, and Shari Trewin,“Finding Synyactic Structure in Unparsed Corpora: The Gsearch Corpus Query System,” Computers and the Humanities 35(2):81-94, 2001. | zh_TW |
dc.relation.reference (參考文獻) | [36] T. McEnery and A. Wilson, Corpus Linguistics, Edinburgh University Press, Edinburgh, 1996. | zh_TW |
dc.relation.reference (參考文獻) | [37] T. Saracevic, P. Kantor, A. Y. Chamis, and D. Tirvison, “A Study of Information Seeking and Retrieving: I. Background and Methodology,” Journal of the American Society for Information Science 39(3):161-176, 1988. | zh_TW |
dc.relation.reference (參考文獻) | [38] T. Saracevic, P. Kantor, A. Y. Chamis, and D. Tirvison, “A Study of Information Seeking and Retrieving: II. Users, Questions and Effectiveness,” Journal of the American Society for Information Science 39(3):177-196, 1988. | zh_TW |
dc.relation.reference (參考文獻) | [39] T. Saracevic, P. Kantor, A. Y. Chamis, and D. Tirvison, “A Study of Information Seeking and Retrieving: III. Searchers, Searches, and Overlap,” Journal of the American Society for Information Science 39(3):197-216, 1988. | zh_TW |
dc.relation.reference (參考文獻) | [40] Paul Resnick and H. R. Varian, “Recommender Systems,” Communication of the ACM 40(3):56-58, 1997. | zh_TW |
dc.relation.reference (參考文獻) | [41] Y. L. Francis, N. L. Ho, T. W. Lam, W. H. Wong, and M. Y. Chan, “Efficient Constrained Multiple Sequence Alignment with Performance Guarantee,” IEEE Computational Systems Bioinformatics 2:337-346, 2003. | zh_TW |
dc.relation.reference (參考文獻) | [42] Yu-Chih Sun, “Learning Process, Strategies and Web-based Concordancers: a Case Study,” British Journal of Educational Technology 34(5):601-613, 2003. | zh_TW |
dc.relation.reference (參考文獻) | [43] 林武聰,「線上英語學習環境」,碩士論文,電子與資訊工程研究所,國立雲林科技大學,2003。 | zh_TW |
dc.relation.reference (參考文獻) | [44] 陳相言,「多重序列排序應用於線上英語輔助學習—以Concordancer為例」,碩士論文,電子與資訊工程研究所,國立雲林科技大學,2003。 | zh_TW |