詞義相似度的社會網路分析研究 | Publication

Publications-Theses

Article View/Open

pdf(694)pdf(749)pdf(870)pdf(983)pdf(1342)pdf(3546)pdf(1217)pdf(1137)pdf(958)pdf(806)pdf(772)

Publication Export

Google Scholar^TM

題名	詞義相似度的社會網路分析研究 A study on word similarity with social network analysis
作者	溫文喆
貢獻者	劉吉軒溫文喆
關鍵詞	社會網路分析詞義相似度 Social Network Analysis Word Similarity
日期	2008
上傳時間	9-Apr-2010 13:29:59 (UTC+8)
摘要	社會網路分析（social network analysis）將社會關係以網路形式表示，從原本純粹分析社會互動的工具，到近年來被廣泛被應用在社會學、組織研究、資訊科學、生物學、語言學等各種領域，藉由引入數學圖學理論與與日益精進的電腦處理能力，使得社會網路分析能從有別於以往的角度找出個體間行動的規律；而詞義相似度（word similarity）是資訊檢索等技術發展的基礎課題之一，近年來對詞義相似度的量測有許多方法的提出。本研究針對英語字詞利用社會網路分析這樣的工具，藉由提出不同的網路建構方式，以語料庫為資料來源，設定網路節點與連結關係，以共現網路（co-occurrence networks）為基礎，經由改變產生與篩選的條件，觀察以社會網路分析已有的性質或指標做調整，是否可以對詞義相似度提供另一種量測方式；同時以目前詞義相似度研究上已有同義詞標準評比對前述產生的網路與所計算的性質做驗證，並進一步探討使用社會網路分析在詞義相似度研究上的適用性。
參考文獻	[1]Wasserman, S. and Faust, K. (1994). Social network analysis: Method and application. New York: Cambridge Press. [2]Freeman, L.C. (2004). The development of social network analysis. Vancouver, Canada: Empirical Press. [3]Batagelj, V. and Mrvar, A. (1998). Pajek-program for large network analysis. Connections, 21(2), 47-57. [4]熊瑞梅，「社會網路的資料搜集、測量及分析」，社會調查與分析，台北，民國84年6月，頁313-356 [5]Rada, R., Mili, H., Bicknell, E. and Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 19(1), 17-30. [6]Mika, P. (2007). Ontologies are us: A unified model of social networks and semantics. Web Semantics: Science, Services and Agents on the World Wide Web, 5(1), 5-15. [7]Hull, D.A. (1996). Stemming algorithms: A case study for detailed evaluation. Journal of the American Society for Information Science, 47(1), 70-84. [8]Xu, J. and Croft, W.B. (1998). Corpus-based stemming using cooccurrence of word variants. ACM Transactions on Information Systems, 16(1), 61-79. [9]Turney, P. (2001). Mining the web for synonyms: PMI-IR versus LSA on TOEFL. ECML 2001, 491-502. [10]Landauer, T. and Dumais, S. (1997). A solution to Plato`s problem: A latent semantic analysis theroy of the acquisition, induction and representation of knowledge. Psychological Review, 104, 211-240. [11]Adamic, L.A. And Adar, E. (2003). Friends and Neighbors on the Web. Social Networks, 25(3), 211-230.
描述	碩士國立政治大學資訊科學學系 96753033 97
資料來源	http://thesis.lib.nccu.edu.tw/record/#G0096753033
資料類型	thesis

dc.contributor.advisor	劉吉軒	zh_TW
dc.contributor.author (Authors)	溫文喆	zh_TW
dc.creator (作者)	溫文喆	zh_TW
dc.date (日期)	2008	en_US
dc.date.accessioned	9-Apr-2010 13:29:59 (UTC+8)	-
dc.date.available	9-Apr-2010 13:29:59 (UTC+8)	-
dc.date.issued (上傳時間)	9-Apr-2010 13:29:59 (UTC+8)	-
dc.identifier (Other Identifiers)	G0096753033	en_US
dc.identifier.uri (URI)	http://nccur.lib.nccu.edu.tw/handle/140.119/38547	-
dc.description (描述)	碩士	zh_TW
dc.description (描述)	國立政治大學	zh_TW
dc.description (描述)	資訊科學學系	zh_TW
dc.description (描述)	96753033	zh_TW
dc.description (描述)	97	zh_TW
dc.description.abstract (摘要)	社會網路分析（social network analysis）將社會關係以網路形式表示，從原本純粹分析社會互動的工具，到近年來被廣泛被應用在社會學、組織研究、資訊科學、生物學、語言學等各種領域，藉由引入數學圖學理論與與日益精進的電腦處理能力，使得社會網路分析能從有別於以往的角度找出個體間行動的規律；而詞義相似度（word similarity）是資訊檢索等技術發展的基礎課題之一，近年來對詞義相似度的量測有許多方法的提出。本研究針對英語字詞利用社會網路分析這樣的工具，藉由提出不同的網路建構方式，以語料庫為資料來源，設定網路節點與連結關係，以共現網路（co-occurrence networks）為基礎，經由改變產生與篩選的條件，觀察以社會網路分析已有的性質或指標做調整，是否可以對詞義相似度提供另一種量測方式；同時以目前詞義相似度研究上已有同義詞標準評比對前述產生的網路與所計算的性質做驗證，並進一步探討使用社會網路分析在詞義相似度研究上的適用性。	zh_TW
dc.description.tableofcontents	第一章緒論 8 1.1社會網路分析 8 1.2詞義相似度 9 1.3詞義的網路分析研究 9 1.4研究動機與目的 10 1.5論文結構 10 第二章背景知識與相關技術 11 2.1社會網路分析 11 2.2社會網路模式 11 2.2.1社會網路性質 11 2.2詞義相似度 13 2.3資訊檢索與自然語言處理技術 14 2.3.1語料庫 14 2.3.2詞性標籤 14 2.3.3 Nature Language Parser 15 2.3.4 Stemming and Lemmatisation 15 2.3.5詞頻-逆向文件頻率 16 第三章字詞使用與社會網路模型 17 3.1研究架構流程 17 3.2資料前處理 18 3.2.1詞性標籤 18 3.2.2 Stemming，Lemmatisation與TF-IDF值計算 18 3.2.3 BNC語料庫 19 3.3建立網路 19 3.3.1建立整體網路 19 3.3.2建立特定字詞網路 20 3.3.3無向共現網路 20 3.3.3.1網路的取樣與連結關係 20 3.3.3.2網路性質指標 21 3.3.4有向共現網路 22 3.3.4.1網路的取樣與連結關係 22 3.3.4.2網路性質指標 24 3.4標準評比檢驗 25 第四章整體字詞網路模型 30 4.1對整體網路進行逐步篩選 30 4.2調整對整體網路的篩選方式 35 4.3整體網路中的相似度計算結果 45 4.4小結 47 第五章特定字詞局部網路 48 5.1無向共現網路 48 5.1.1對字詞篩選條件做調整 48 5.1.2調整挑選字詞的TF或IDF篩選條件 49 5.1.3調整與關鍵字共同出現的距離 52 5.1.4調整是否將同一詞幹視為同一節點 54 5.1.5調整相似度計算 55 5.1.6改進詞性篩選 56 5.1.7以lemmatisation做調整 60 5.1.8加入特定字詞前後文資訊作篩選 61 5.2有向共現網路 62 5.3討論 63 5.4小結 64 第六章結論與未來方向 65 6.1結論 65 6.2未來研究方向 65 參考文獻 67	zh_TW
dc.format.extent	120844 bytes	-
dc.format.extent	118896 bytes	-
dc.format.extent	150026 bytes	-
dc.format.extent	159923 bytes	-
dc.format.extent	211282 bytes	-
dc.format.extent	259055 bytes	-
dc.format.extent	308231 bytes	-
dc.format.extent	251209 bytes	-
dc.format.extent	291822 bytes	-
dc.format.extent	154819 bytes	-
dc.format.extent	165848 bytes	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.language.iso	en_US	-
dc.source.uri (資料來源)	http://thesis.lib.nccu.edu.tw/record/#G0096753033	en_US
dc.subject (關鍵詞)	社會網路分析	zh_TW
dc.subject (關鍵詞)	詞義相似度	zh_TW
dc.subject (關鍵詞)	Social Network Analysis	en_US
dc.subject (關鍵詞)	Word Similarity	en_US
dc.title (題名)	詞義相似度的社會網路分析研究	zh_TW
dc.title (題名)	A study on word similarity with social network analysis	en_US
dc.type (資料類型)	thesis	en
dc.relation.reference (參考文獻)	[1]Wasserman, S. and Faust, K. (1994). Social network analysis: Method and application. New York: Cambridge Press.	zh_TW
dc.relation.reference (參考文獻)	[2]Freeman, L.C. (2004). The development of social network analysis. Vancouver, Canada: Empirical Press.	zh_TW
dc.relation.reference (參考文獻)	[3]Batagelj, V. and Mrvar, A. (1998). Pajek-program for large network analysis. Connections, 21(2), 47-57.	zh_TW
dc.relation.reference (參考文獻)	[4]熊瑞梅，「社會網路的資料搜集、測量及分析」，社會調查與分析，台北，民國84年6月，頁313-356	zh_TW
dc.relation.reference (參考文獻)	[5]Rada, R., Mili, H., Bicknell, E. and Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 19(1), 17-30.	zh_TW
dc.relation.reference (參考文獻)	[6]Mika, P. (2007). Ontologies are us: A unified model of social networks and semantics. Web Semantics: Science, Services and Agents on the World Wide Web, 5(1), 5-15.	zh_TW
dc.relation.reference (參考文獻)	[7]Hull, D.A. (1996). Stemming algorithms: A case study for detailed evaluation. Journal of the American Society for Information Science, 47(1), 70-84.	zh_TW
dc.relation.reference (參考文獻)	[8]Xu, J. and Croft, W.B. (1998). Corpus-based stemming using cooccurrence of word variants. ACM Transactions on Information Systems, 16(1), 61-79.	zh_TW
dc.relation.reference (參考文獻)	[9]Turney, P. (2001). Mining the web for synonyms: PMI-IR versus LSA on TOEFL. ECML 2001, 491-502.	zh_TW
dc.relation.reference (參考文獻)	[10]Landauer, T. and Dumais, S. (1997). A solution to Plato`s problem: A latent semantic analysis theroy of the acquisition, induction and representation of knowledge. Psychological Review, 104, 211-240.	zh_TW
dc.relation.reference (參考文獻)	[11]Adamic, L.A. And Adar, E. (2003). Friends and Neighbors on the Web. Social Networks, 25(3), 211-230.	zh_TW

Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

Google Scholar^TM