學術產出-Proceedings

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

  • No doi shows Citation Infomation
題名 Capturing errors in written Chinese words
作者 劉昭麟
Liu, Chao-Lin
貢獻者 ACL
國立政治大學資訊科學系
關鍵詞 Capturing errors;written Chinese words
algorithms; design; languages; modeling methodologies; natural language processing; performance
日期 2009-08
上傳時間 27-May-2010 16:47:37 (UTC+8)
摘要 A collection of 3208 reported errors of Chinese words were analyzed. Among which, 7.2% involved rarely used character, and 98.4% were assigned common classifications of their causes by human subjects. In particular, 80% of the errors observed in writings of middle school students were related to the pronunciations and 30% were related to the compositions of words. Experimental results show that using intuitive Web-based statistics helped us capture only about 75% of these errors. In a related task, the Web-based statistics are useful for recommending incorrect characters for composing test items for "incorrect character identification" tests about 93% of the time.
關聯 Proceedings of the Forty Seventh Annual Meeting of the Association for Computational Linguistics
資料類型 conference
dc.contributor ACLen_US
dc.contributor 國立政治大學資訊科學系en_US
dc.creator (作者) 劉昭麟zh_TW
dc.creator (作者) Liu, Chao-Lin-
dc.date (日期) 2009-08en_US
dc.date.accessioned 27-May-2010 16:47:37 (UTC+8)-
dc.date.available 27-May-2010 16:47:37 (UTC+8)-
dc.date.issued (上傳時間) 27-May-2010 16:47:37 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/39641-
dc.description.abstract (摘要) A collection of 3208 reported errors of Chinese words were analyzed. Among which, 7.2% involved rarely used character, and 98.4% were assigned common classifications of their causes by human subjects. In particular, 80% of the errors observed in writings of middle school students were related to the pronunciations and 30% were related to the compositions of words. Experimental results show that using intuitive Web-based statistics helped us capture only about 75% of these errors. In a related task, the Web-based statistics are useful for recommending incorrect characters for composing test items for "incorrect character identification" tests about 93% of the time.-
dc.language en-USen_US
dc.language.iso en_US-
dc.relation (關聯) Proceedings of the Forty Seventh Annual Meeting of the Association for Computational Linguisticsen_US
dc.subject (關鍵詞) Capturing errors;written Chinese wordsen_US
dc.subject (關鍵詞) algorithms; design; languages; modeling methodologies; natural language processing; performance-
dc.title (題名) Capturing errors in written Chinese wordsen_US
dc.type (資料類型) conferenceen