基於大數據語料自動生成之中文詞彙聯想與實驗常模之比較 | NCCU Academic Hub

Publications-Conference papers

Article View/Open

html(696)

Publication Export

Google Scholar^TM

NCCU Library

Discovery System

Citation Infomation

No doi shows Citation Infomation

Related Publications in TAIR

Simple Record
Full Record

題名	基於大數據語料自動生成之中文詞彙聯想與實驗常模之比較
其他題名	Comparing Chinese word associations based on big data and experimental norms
作者	林淑晏;宋曜廷;陳學志;張雨霖;陳彥丞 Lin, Shu-Yen;Sung, Yao-Ting;Chen, Hsueh-Chih;Chang, Yu-Lin;Chen, Yen-Cheng
貢獻者	國立政治大學邁向頂尖大學計畫創新研究團隊
日期	2016
上傳時間	19-Jun-2017 17:31:49 (UTC+8)
摘要	本研究旨在比較基於大數據語料所自動生成之中文詞彙聯想（或稱詞彙共現）與基於真人實驗所建構之聯想常模。我們將Pecina（2010）中的57種詞彙共現強度計算法應用於巨量文本中，產生八萬五千多個常見中文詞彙兩兩間的共現強度（或稱聯想強度）。This study aims to compare two types of word association – the lexical collocations automatically generated using very large corpora and the association norms established in human experiments. Using very large text corpora, we computed the lexical association (or also called collocation) strengths between 85,346 Chinese words using the 57 word association measures described in Pecina (2010). Henceforth, we call the word association thus generated as the collocation dictionary. In order to validate the psychological reality of the automatically-generated word association, the Chinese word association norms established by Chen (1999) was used as a benchmark. The Chen word association norms consist of 1,200 stimulus words. In the free association experiment, each stimulus word was presented to 200 college students who were asked to write down the first word they came up with. For each stimulus word, the number of associate tokens is thus 200, but the average number of associate types is 86.
關聯	2016創新研究國際學術研討會: 以人為本的在地創新之跨領域與跨界的對話 2016 International conference on innovation studies- human-centered indigenous innovation: trans-disciplinary dialogue 會議日期:2016.11.12-13
資料類型	conference

dc.contributor	國立政治大學邁向頂尖大學計畫創新研究團隊
dc.creator (作者)	林淑晏;宋曜廷;陳學志;張雨霖;陳彥丞	zh_TW
dc.creator (作者)	Lin, Shu-Yen;Sung, Yao-Ting;Chen, Hsueh-Chih;Chang, Yu-Lin;Chen, Yen-Cheng
dc.date (日期)	2016
dc.date.accessioned	19-Jun-2017 17:31:49 (UTC+8)	-
dc.date.available	19-Jun-2017 17:31:49 (UTC+8)	-
dc.date.issued (上傳時間)	19-Jun-2017 17:31:49 (UTC+8)	-
dc.identifier.uri (URI)	http://nccur.lib.nccu.edu.tw/handle/140.119/110402	-
dc.description.abstract (摘要)	本研究旨在比較基於大數據語料所自動生成之中文詞彙聯想（或稱詞彙共現）與基於真人實驗所建構之聯想常模。我們將Pecina（2010）中的57種詞彙共現強度計算法應用於巨量文本中，產生八萬五千多個常見中文詞彙兩兩間的共現強度（或稱聯想強度）。This study aims to compare two types of word association – the lexical collocations automatically generated using very large corpora and the association norms established in human experiments. Using very large text corpora, we computed the lexical association (or also called collocation) strengths between 85,346 Chinese words using the 57 word association measures described in Pecina (2010). Henceforth, we call the word association thus generated as the collocation dictionary. In order to validate the psychological reality of the automatically-generated word association, the Chinese word association norms established by Chen (1999) was used as a benchmark. The Chen word association norms consist of 1,200 stimulus words. In the free association experiment, each stimulus word was presented to 200 college students who were asked to write down the first word they came up with. For each stimulus word, the number of associate tokens is thus 200, but the average number of associate types is 86.
dc.format.extent	112 bytes	-
dc.format.mimetype	text/html	-
dc.relation (關聯)	2016創新研究國際學術研討會: 以人為本的在地創新之跨領域與跨界的對話 2016 International conference on innovation studies- human-centered indigenous innovation: trans-disciplinary dialogue
dc.relation (關聯)	會議日期:2016.11.12-13
dc.title (題名)	基於大數據語料自動生成之中文詞彙聯想與實驗常模之比較	zh_TW
dc.title.alternative (其他題名)	Comparing Chinese word associations based on big data and experimental norms
dc.type (資料類型)	conference