Publications-Conference papers

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

題名 基於大數據語料自動生成之中文詞彙聯想與實驗常模之比較
其他題名 Comparing Chinese word associations based on big data and experimental norms
作者 林淑晏;宋曜廷;陳學志;張雨霖;陳彥丞
Lin, Shu-Yen;Sung, Yao-Ting;Chen, Hsueh-Chih;Chang, Yu-Lin;Chen, Yen-Cheng
貢獻者 國立政治大學邁向頂尖大學計畫創新研究團隊
日期 2016
上傳時間 19-Jun-2017 17:31:49 (UTC+8)
摘要 本研究旨在比較基於大數據語料所自動生成之中文詞彙聯想(或稱詞彙共現)與基於真人實驗所建構之聯想常模。我們將Pecina(2010)中的57種詞彙共現強度計算法應用於巨量文本中,產生八萬五千多個常見中文詞彙兩兩間的共現強度(或稱聯想強度)。This study aims to compare two types of word association – the lexical collocations automatically generated using very large corpora and the association norms established in human experiments. Using very large text corpora, we computed the lexical association (or also called collocation) strengths between 85,346 Chinese words using the 57 word association measures described in Pecina (2010). Henceforth, we call the word association thus generated as the collocation dictionary. In order to validate the psychological reality of the automatically-generated word association, the Chinese word association norms established by Chen (1999) was used as a benchmark. The Chen word association norms consist of 1,200 stimulus words. In the free association experiment, each stimulus word was presented to 200 college students who were asked to write down the first word they came up with. For each stimulus word, the number of associate tokens is thus 200, but the average number of associate types is 86.
關聯 2016創新研究國際學術研討會: 以人為本的在地創新之跨領域與跨界的對話 2016 International conference on innovation studies- human-centered indigenous innovation: trans-disciplinary dialogue
會議日期:2016.11.12-13
資料類型 conference
dc.contributor 國立政治大學邁向頂尖大學計畫創新研究團隊
dc.creator (作者) 林淑晏;宋曜廷;陳學志;張雨霖;陳彥丞zh_TW
dc.creator (作者) Lin, Shu-Yen;Sung, Yao-Ting;Chen, Hsueh-Chih;Chang, Yu-Lin;Chen, Yen-Cheng
dc.date (日期) 2016
dc.date.accessioned 19-Jun-2017 17:31:49 (UTC+8)-
dc.date.available 19-Jun-2017 17:31:49 (UTC+8)-
dc.date.issued (上傳時間) 19-Jun-2017 17:31:49 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/110402-
dc.description.abstract (摘要) 本研究旨在比較基於大數據語料所自動生成之中文詞彙聯想(或稱詞彙共現)與基於真人實驗所建構之聯想常模。我們將Pecina(2010)中的57種詞彙共現強度計算法應用於巨量文本中,產生八萬五千多個常見中文詞彙兩兩間的共現強度(或稱聯想強度)。This study aims to compare two types of word association – the lexical collocations automatically generated using very large corpora and the association norms established in human experiments. Using very large text corpora, we computed the lexical association (or also called collocation) strengths between 85,346 Chinese words using the 57 word association measures described in Pecina (2010). Henceforth, we call the word association thus generated as the collocation dictionary. In order to validate the psychological reality of the automatically-generated word association, the Chinese word association norms established by Chen (1999) was used as a benchmark. The Chen word association norms consist of 1,200 stimulus words. In the free association experiment, each stimulus word was presented to 200 college students who were asked to write down the first word they came up with. For each stimulus word, the number of associate tokens is thus 200, but the average number of associate types is 86.
dc.format.extent 112 bytes-
dc.format.mimetype text/html-
dc.relation (關聯) 2016創新研究國際學術研討會: 以人為本的在地創新之跨領域與跨界的對話 2016 International conference on innovation studies- human-centered indigenous innovation: trans-disciplinary dialogue
dc.relation (關聯) 會議日期:2016.11.12-13
dc.title (題名) 基於大數據語料自動生成之中文詞彙聯想與實驗常模之比較zh_TW
dc.title.alternative (其他題名) Comparing Chinese word associations based on big data and experimental norms
dc.type (資料類型) conference