Please use this identifier to cite or link to this item: https://ah.lib.nccu.edu.tw/handle/140.119/76129
題名: Using linguistic features to classify texts for reading comprehension tests at the high school levels
作者: Huang, Chao-Shainn;Kuo, Wei-Ti;Li, Chia Ling;Tsai, Chia Chi;Liu, Chao-Lin
黃昭憲;郭韋狄;李嘉玲;蔡家琦;劉昭麟
貢獻者: 資科系
關鍵詞: We investigate the issue of classifying short essays based their linguistic issues, for English at the high school levels. A good selection of appropriate essays is crucial for the language learners and for the reading comprehension tests, which is an important type of tests for language competence examinations. Although the text alone does not allow us to judge the difficulty of reading comprehension tests, the capability to identify the levels of high school students for whom the texts were used in the reading comprehension can be an important step toward computer assisted selection of reading comprehension test items. We employed word-level statistics, sentence-level statistics, and syntactic-level information of the text, and applied several machine learning techniques for this text classification problem. Experimental results show that, with the best performing combination of features and learning method, we achieved 53.6% in accuracy.
日期: 2010
上傳時間: 29-Jun-2015
關聯: Proceedings of the 22nd Conference on Computational Linguistics and Speech Processing, ROCLING 2010, 2010, Pages 98-112, 22nd Conference on Computational Linguistics and Speech Processing, ROCLING 2010; Nantou; Taiwan; 1 September 2010 到 2 September 2010; 代碼 98580
資料類型: conference
DOI: Computer assisted; High school students; Linguistic features; Machine learning techniques; Reading comprehension; Reading comprehension tests; Text classification; Word-level statistics; Classification (of information); Computational linguistics; Learning systems; Speech processing; Text processing; Testing
Appears in Collections:會議論文

Show full item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.