Visually and phonologically similar characters in incorrect Chinese words: Analyses, identification, and ... | Publication

Publications-Periodical Articles

Article View/Open

pdf(1654)

Publication Export

Google Scholar^TM

NCCU Library

Discovery System

Citation Infomation

Related Publications in TAIR

Simple Record
Full Record

題名	Visually and phonologically similar characters in incorrect Chinese words: Analyses, identification, and applications
作者	Liu, Chao-Lin Lai, Min-Hua Chuang, Yi-Hsuan Lee, Chia-Ying 劉昭麟
貢獻者	政大資訊科學系
關鍵詞	Error analysis of written Chinese text; student modeling; traditional Chinese; simplified Chinese; computer-assisted language learning; psycholinguistics
日期	2011-06
上傳時間	2-Nov-2012 15:31:03 (UTC+8)
摘要	Visually and phonologically similar characters are major contributing factors for errors in Chinese text. By defining appropriate similarity measures that consider extended Cangjie codes, we can identify visually similar characters within a fraction of a second. Relying on the pronunciation information noted for individual characters in Chinese lexicons, we can compute a list of characters that are phonologically similar to a given character. We collected 621 incorrect Chinese words reported on the Internet, and analyzed the causes of these errors. 83% of these errors were related to phonological similarity, and 48% of them were related to visual similarity between the involved characters. Generating the lists of phonologically and visually similar characters, our programs were able to contain more than 90% of the incorrect characters in the reported errors.
關聯	ACM Transactions on Asian Language Information Processing, 10(2), 1-39
資料類型	article
DOI	http://dx.doi.org/10.1145/1967293.1967297

dc.contributor	政大資訊科學系	en
dc.creator (作者)	Liu, Chao-Lin	en
dc.creator (作者)	Lai, Min-Hua	en
dc.creator (作者)	Chuang, Yi-Hsuan	en
dc.creator (作者)	Lee, Chia-Ying	en
dc.creator (作者)	劉昭麟	zh_TW
dc.date (日期)	2011-06	-
dc.date.accessioned	2-Nov-2012 15:31:03 (UTC+8)	-
dc.date.available	2-Nov-2012 15:31:03 (UTC+8)	-
dc.date.issued (上傳時間)	2-Nov-2012 15:31:03 (UTC+8)	-
dc.identifier.uri (URI)	http://nccur.lib.nccu.edu.tw/handle/140.119/55171	-
dc.description.abstract (摘要)	Visually and phonologically similar characters are major contributing factors for errors in Chinese text. By defining appropriate similarity measures that consider extended Cangjie codes, we can identify visually similar characters within a fraction of a second. Relying on the pronunciation information noted for individual characters in Chinese lexicons, we can compute a list of characters that are phonologically similar to a given character. We collected 621 incorrect Chinese words reported on the Internet, and analyzed the causes of these errors. 83% of these errors were related to phonological similarity, and 48% of them were related to visual similarity between the involved characters. Generating the lists of phonologically and visually similar characters, our programs were able to contain more than 90% of the incorrect characters in the reported errors.	en
dc.format.extent	213300 bytes	-
dc.format.mimetype	application/pdf	-
dc.language	zh_TW	en
dc.language.iso	en_US	-
dc.relation (關聯)	ACM Transactions on Asian Language Information Processing, 10(2), 1-39	en
dc.subject (關鍵詞)	Error analysis of written Chinese text; student modeling; traditional Chinese; simplified Chinese; computer-assisted language learning; psycholinguistics	-
dc.title (題名)	Visually and phonologically similar characters in incorrect Chinese words: Analyses, identification, and applications	en
dc.type (資料類型)	article	en
dc.identifier.doi (DOI)	10.1145/1967293.1967297	-
dc.doi.uri (DOI)	http://dx.doi.org/10.1145/1967293.1967297	-

Publications-Periodical Articles

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

Google Scholar^TM