基於大數據語料自動生成之中文詞彙聯想與實驗常模之比較

林淑晏;宋曜廷;陳學志;張雨霖;陳彥丞; Lin, Shu-Yen;Sung, Yao-Ting;Chen, Hsueh-Chih;Chang, Yu-Lin;Chen, Yen-Cheng

Please use this identifier to cite or link to this item: https://ah.lib.nccu.edu.tw/handle/140.119/110402

題名:	基於大數據語料自動生成之中文詞彙聯想與實驗常模之比較
其他題名:	Comparing Chinese word associations based on big data and experimental norms
作者:	林淑晏;宋曜廷;陳學志;張雨霖;陳彥丞 Lin, Shu-Yen;Sung, Yao-Ting;Chen, Hsueh-Chih;Chang, Yu-Lin;Chen, Yen-Cheng
貢獻者:	國立政治大學邁向頂尖大學計畫創新研究團隊
日期:	2016
上傳時間:	19-Jun-2017
摘要:	本研究旨在比較基於大數據語料所自動生成之中文詞彙聯想（或稱詞彙共現）與基於真人實驗所建構之聯想常模。我們將Pecina（2010）中的57種詞彙共現強度計算法應用於巨量文本中，產生八萬五千多個常見中文詞彙兩兩間的共現強度（或稱聯想強度）。This study aims to compare two types of word association – the lexical collocations automatically generated using very large corpora and the association norms established in human experiments. Using very large text corpora, we computed the lexical association (or also called collocation) strengths between 85,346 Chinese words using the 57 word association measures described in Pecina (2010). Henceforth, we call the word association thus generated as the collocation dictionary. In order to validate the psychological reality of the automatically-generated word association, the Chinese word association norms established by Chen (1999) was used as a benchmark. The Chen word association norms consist of 1,200 stimulus words. In the free association experiment, each stimulus word was presented to 200 college students who were asked to write down the first word they came up with. For each stimulus word, the number of associate tokens is thus 200, but the average number of associate types is 86.
關聯:	2016創新研究國際學術研討會: 以人為本的在地創新之跨領域與跨界的對話 2016 International conference on innovation studies- human-centered indigenous innovation: trans-disciplinary dialogue 會議日期:2016.11.12-13
資料類型:	conference
Appears in Collections:	會議論文

Files in This Item:

File	Description	Size	Format
index.html		112 B	HTML2	View/Open

Show full item record

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Google Scholar^TM