Publications-Theses

題名 建構以語意社會網路為主的部落格入口網站
Building Semantic Social Network-Based Blog Portal
作者 余承遠
Yu,Cheng-Yuan
貢獻者 胡毓忠
Hu,Yuh-Jong
余承遠
Yu,Cheng-Yuan
關鍵詞 語意網
社會網路分析
本體論
入口網站
部落格
Semantic Web
Social Network Analysis
Ontology
Portal
Blog
Web 2.0
日期 2006
上傳時間 17-Sep-2009 13:55:40 (UTC+8)
摘要 Web 2.0的提出,主要的概念是以Web為平台,以「個人」為中心,透過群體智慧的方式來共享與產生知識,例如維基百科、部落格等。部落格提供了個人自由創作與發表文章空間,主要以RSS、Trackback為共有標準,服務提供者可另外加上自訂功能。然而部落格每天所產生的文章量相當龐大,我們是否有辦法在這些文章中,找出符合使用者想看的文章。本研究期望建構一個部落格入口網站,分析目前部落格使用的特徵,比較與目前Web環境差異;引入語意網技術,針對Metadata處理資訊,設計本體論(Ontology)來描述人、文章與標籤之間的關係並建立簡單分類;導入大眾既有經驗與人脈網路建立,觀察社會網路所能提供的貢獻;實作上將透過特徵分析來設計Crawler,自動抓取並解析文章,並建置入口網站,進行資料的分析與驗證,探討加入語意網與社會網路分析的結合所能帶來的效益。
The Web 2.0 is based on the main concept "individuals" as the center, through the collaborative wisdom to share and to generate knowledge on the Web, such as the Wikipedia, Blog, etc. Blog provides a space for the free creativity and posting articles from individuals. Based on RSS and Trackback service providers can set an additional function. However, the daily amount of articles issued from the Blog is enormous. How can we provide methods for users to find their interesting articles? This study hopes to build the Blog portal and analysis of the current Blog features compared with the web environment. We use semantic web technology and focus on metadata processing. The ontology describes the relationship among persons, articles, tags and a simple categorization. Folks experience and relationship are established and observed with the benefits from social network analysis. In this study, we implement a crawler, and automatically grab and analysis articles. With constructing the portal, we extract information and discuss the benefits of using combination semantic web and social network analysis
參考文獻 [1] Andreas Harth, Jurgen Umbrich, and Stefan Decker, MultiCrawler: A Pipelined Architecture for Crawling and Indexing Semantic Web Data. The fifth International Semantic Web Conference, Athens, GA, USA, November 5-9, 2006.
[2] Arvid Arasu, Junghoo Cho, Hector Garcia-Molina, et al., Searching the Web. ACM Transactions on Internet Technology, Vol. 1, No. 1, August 2001, Pages 2–43.
[3] Atanas Kiryakov, Borislav Popov, Ivan Terziev,et al., Semantic annotation, indexing, and retrieval. Web Semantics: Science, Services and Agents on the World Wide Web 2 (2004) 49–79.
[4] Blo.gs, http://blo.gs/
[5] BlogPulse, http://www.blogpulse.com/index.html
[6] Christopher H. Brooks and Vancy Montanez, Improve Annotation of the Blogosphere via Autotagging and Hierarchical Clustering. WWW 2006, May 23-26, 2006, Edinburgh, Scotland.
[7] D. Gruhl, R. Guha, David Liben-Nowell and A. Tomkins, Information Diffusion Through Blogspace. WWW 2004, May 17–22, 2004, New York, USA.
[8] David R. Karger and Dennis Quan, 2004, What Would It Mean to Blog on the Semantic Web?. The Third International Semantic Web Conference(ISWC 2004), Springer-Verlag Berlin Heidelberg 2004.
[9] Feedster, http://www.feedster.com/
[10] Friend of a Friend (FOAF) Vocabulary: http://xmlns.com/foaf/0.1/
[11] Google Blog Search (BETA), http://blogsearch.google.com/
[12] Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd, 1999. The PageRank citation ranking: Bringing order to the Web. Tech. Rep. Computer Systems Laboratory, Stanford University, Stanford, CA.
[13] Li Ding, Lina Zhou, Tim Finin, and Anupam Joshi, How the Semantic Web is Being Used: An Analysis of FOAF Documents. Proceedings of the 38th Hawaii International Conference on System Sciences 2005.
[14] OWL Web Ontology Language, http://www.w3.org/TR/owl-features/
[15] Peter Mika, Flink: SemanticWeb Technology for the Extraction and Analysis of Social Networks. Journal of Web Semantics, 3(2), 2005.
[16] R. Kosala and H. Blockeel, Web mining research: A survey. SIGKDD Esplorations, Volume 2, Issue 1, July 2000, Pages 1–15.
[17] Ravi Kumar, Jasmine Novak, Prabhakar Raghavan, and Andrew Tomkin, Structure and evolution of blogspace. COMMUNICATIONS OF THE ACM December 2004/Vol. 47, No. 12
[18] RDF Vocabulary Description Language 1.0: RDF Schema, http://www.w3.org/TR/rdf-schema/
[19] Ronald S. Burt, Structural Holes : The Social Structure of Competition. Cambridge, Mass.: Harvard University Press, 1992
[20] RSS 1.0 Specification, http://web.resource.org/rss/1.0/spec
[21] RSS 2.0 Specification, http://www.rssboard.org/rss-specification
[22] Scott, J., Social Network Analysis: A Handbook, Sage Publications, London (2000).
[23] Stephen Dill, Nadav Eiron, David Gibson, et al., A case for automated large-scale semantic annotation. Web Semantics: Science, Services and Agents on the World Wide Web 1 (2003) 115–132.
[24] Steve Cayzer, Semantic Blogging and Decentralized Knowledge Management. COMMUNICATIONS OF THE ACM, December 2004/Vol. 47, No. 12.
[25] Susan C. Herring, Inna Kouper, et al., Conversations in the Blogosphere: An Analysis “From the Bottom Up”, The Thirty-Eighth Hawaii’s International Conference on System Sciences (HICSS-38).
[26] Tim Berners-Lee, et al., The Semantic Web. Scientific American, May 2001.
[27] Tim Berners-Lee new layer cake, http://www.w3.org/2005/Talks/0511-keynote-tbl/
[28] Technorati, http://www.technorati.com/
[29] Victoria Uren, Philipp Cimiano, Jose Iria, et al., Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Web Semantics: Science, Services and Agents on the World Wide Web 4 (2006) 14–28
[30] What is Web 2.0, http://www.oreillynet.com/pub/a/oreilly/tim/news/
2005/09/30/what-is-web-20.html
[31] Wouter de Nooy, Andrej Mrvar, and Vladimir Batageli, Exploratory Social Network Analysis with Pajek, American (2005).
描述 碩士
國立政治大學
資訊科學學系
93753002
95
資料來源 http://thesis.lib.nccu.edu.tw/record/#G0093753002
資料類型 thesis
dc.contributor.advisor 胡毓忠zh_TW
dc.contributor.advisor Hu,Yuh-Jongen_US
dc.contributor.author (Authors) 余承遠zh_TW
dc.contributor.author (Authors) Yu,Cheng-Yuanen_US
dc.creator (作者) 余承遠zh_TW
dc.creator (作者) Yu,Cheng-Yuanen_US
dc.date (日期) 2006en_US
dc.date.accessioned 17-Sep-2009 13:55:40 (UTC+8)-
dc.date.available 17-Sep-2009 13:55:40 (UTC+8)-
dc.date.issued (上傳時間) 17-Sep-2009 13:55:40 (UTC+8)-
dc.identifier (Other Identifiers) G0093753002en_US
dc.identifier.uri (URI) https://nccur.lib.nccu.edu.tw/handle/140.119/32646-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 資訊科學學系zh_TW
dc.description (描述) 93753002zh_TW
dc.description (描述) 95zh_TW
dc.description.abstract (摘要) Web 2.0的提出,主要的概念是以Web為平台,以「個人」為中心,透過群體智慧的方式來共享與產生知識,例如維基百科、部落格等。部落格提供了個人自由創作與發表文章空間,主要以RSS、Trackback為共有標準,服務提供者可另外加上自訂功能。然而部落格每天所產生的文章量相當龐大,我們是否有辦法在這些文章中,找出符合使用者想看的文章。本研究期望建構一個部落格入口網站,分析目前部落格使用的特徵,比較與目前Web環境差異;引入語意網技術,針對Metadata處理資訊,設計本體論(Ontology)來描述人、文章與標籤之間的關係並建立簡單分類;導入大眾既有經驗與人脈網路建立,觀察社會網路所能提供的貢獻;實作上將透過特徵分析來設計Crawler,自動抓取並解析文章,並建置入口網站,進行資料的分析與驗證,探討加入語意網與社會網路分析的結合所能帶來的效益。zh_TW
dc.description.abstract (摘要) The Web 2.0 is based on the main concept "individuals" as the center, through the collaborative wisdom to share and to generate knowledge on the Web, such as the Wikipedia, Blog, etc. Blog provides a space for the free creativity and posting articles from individuals. Based on RSS and Trackback service providers can set an additional function. However, the daily amount of articles issued from the Blog is enormous. How can we provide methods for users to find their interesting articles? This study hopes to build the Blog portal and analysis of the current Blog features compared with the web environment. We use semantic web technology and focus on metadata processing. The ontology describes the relationship among persons, articles, tags and a simple categorization. Folks experience and relationship are established and observed with the benefits from social network analysis. In this study, we implement a crawler, and automatically grab and analysis articles. With constructing the portal, we extract information and discuss the benefits of using combination semantic web and social network analysisen_US
dc.description.tableofcontents 第一章 導論 1
1.1 研究背景 1
1.2 研究目的 2
1.3 各章節概述 3
第二章 相關研究 4
2.1 部落格介紹與相關研究 4
2.2 語意部落格 6
2.3 分類與標籤系統 10
2.4 部落格入口網站 12
2.4.1入口網站設計 12
2.4.2部落格入口網站發展現況 13
第三章 語意部落格入口網站設計 15
3.1 來源格式分析 16
3.1.1 網頁資料交換和匯集標記(RSS) 16
3.1.2 人際關係標記(FOAF) 18
3.1.3 超本文標記語言(HTML) 19
3.2 部落格中的社會網路 20
3.2.1 社會網路指標 20
3.2.2 人與人社群關係 23
3.2.3 文章與文章社群關係 24
3.3 本體論設計 24
3.3.1 部落格領域的本體論 24
3.3.2 文章主題的本體論 27
3.4 自動語意標記 29
3.5 查詢與展示 31
第四章 語意部落格架構與實作 33
4.1 資料蒐集 33
4.2 系統架構 34
4.3 搜尋系統實做 35
4.3.1 網頁抓取設計(Crawler Design) 35
4.3.2 儲存媒介設計(Repository Design) 36
4.4 主題本體論設計 37
4.4.1 上層頻道與標籤關係產生 37
4.4.2 標籤與標籤關係產生 40
第五章 社會網路分析 42
5.1 全體使用者分析 42
5.2 群體使用者分析 48
5.3 文章的社會網路分析 50
5.4 社會網路分析與入口網站 51
第六章 語意部落格入口網站呈現 53
6.1 基本功能查詢 53
6.1.1 首頁資訊匯集 53
6.1.2 查詢標籤相關性 55
6.1.3 分類概念 + 關鍵字查詢 56
6.2結合社會網路與串聯機制 59
第七章 結論與未來展望 63
7.1 結論 63
7.2 未來展望 64
參考文獻 65
zh_TW
dc.format.extent 47996 bytes-
dc.format.extent 62951 bytes-
dc.format.extent 219202 bytes-
dc.format.extent 349532 bytes-
dc.format.extent 226108 bytes-
dc.format.extent 452600 bytes-
dc.format.extent 822446 bytes-
dc.format.extent 385177 bytes-
dc.format.extent 802971 bytes-
dc.format.extent 2251891 bytes-
dc.format.extent 221862 bytes-
dc.format.extent 237880 bytes-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.language.iso en_US-
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#G0093753002en_US
dc.subject (關鍵詞) 語意網zh_TW
dc.subject (關鍵詞) 社會網路分析zh_TW
dc.subject (關鍵詞) 本體論zh_TW
dc.subject (關鍵詞) 入口網站zh_TW
dc.subject (關鍵詞) 部落格zh_TW
dc.subject (關鍵詞) Semantic Weben_US
dc.subject (關鍵詞) Social Network Analysisen_US
dc.subject (關鍵詞) Ontologyen_US
dc.subject (關鍵詞) Portalen_US
dc.subject (關鍵詞) Blogen_US
dc.subject (關鍵詞) Web 2.0en_US
dc.title (題名) 建構以語意社會網路為主的部落格入口網站zh_TW
dc.title (題名) Building Semantic Social Network-Based Blog Portalen_US
dc.type (資料類型) thesisen
dc.relation.reference (參考文獻) [1] Andreas Harth, Jurgen Umbrich, and Stefan Decker, MultiCrawler: A Pipelined Architecture for Crawling and Indexing Semantic Web Data. The fifth International Semantic Web Conference, Athens, GA, USA, November 5-9, 2006.zh_TW
dc.relation.reference (參考文獻) [2] Arvid Arasu, Junghoo Cho, Hector Garcia-Molina, et al., Searching the Web. ACM Transactions on Internet Technology, Vol. 1, No. 1, August 2001, Pages 2–43.zh_TW
dc.relation.reference (參考文獻) [3] Atanas Kiryakov, Borislav Popov, Ivan Terziev,et al., Semantic annotation, indexing, and retrieval. Web Semantics: Science, Services and Agents on the World Wide Web 2 (2004) 49–79.zh_TW
dc.relation.reference (參考文獻) [4] Blo.gs, http://blo.gs/zh_TW
dc.relation.reference (參考文獻) [5] BlogPulse, http://www.blogpulse.com/index.htmlzh_TW
dc.relation.reference (參考文獻) [6] Christopher H. Brooks and Vancy Montanez, Improve Annotation of the Blogosphere via Autotagging and Hierarchical Clustering. WWW 2006, May 23-26, 2006, Edinburgh, Scotland.zh_TW
dc.relation.reference (參考文獻) [7] D. Gruhl, R. Guha, David Liben-Nowell and A. Tomkins, Information Diffusion Through Blogspace. WWW 2004, May 17–22, 2004, New York, USA.zh_TW
dc.relation.reference (參考文獻) [8] David R. Karger and Dennis Quan, 2004, What Would It Mean to Blog on the Semantic Web?. The Third International Semantic Web Conference(ISWC 2004), Springer-Verlag Berlin Heidelberg 2004.zh_TW
dc.relation.reference (參考文獻) [9] Feedster, http://www.feedster.com/zh_TW
dc.relation.reference (參考文獻) [10] Friend of a Friend (FOAF) Vocabulary: http://xmlns.com/foaf/0.1/zh_TW
dc.relation.reference (參考文獻) [11] Google Blog Search (BETA), http://blogsearch.google.com/zh_TW
dc.relation.reference (參考文獻) [12] Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd, 1999. The PageRank citation ranking: Bringing order to the Web. Tech. Rep. Computer Systems Laboratory, Stanford University, Stanford, CA.zh_TW
dc.relation.reference (參考文獻) [13] Li Ding, Lina Zhou, Tim Finin, and Anupam Joshi, How the Semantic Web is Being Used: An Analysis of FOAF Documents. Proceedings of the 38th Hawaii International Conference on System Sciences 2005.zh_TW
dc.relation.reference (參考文獻) [14] OWL Web Ontology Language, http://www.w3.org/TR/owl-features/zh_TW
dc.relation.reference (參考文獻) [15] Peter Mika, Flink: SemanticWeb Technology for the Extraction and Analysis of Social Networks. Journal of Web Semantics, 3(2), 2005.zh_TW
dc.relation.reference (參考文獻) [16] R. Kosala and H. Blockeel, Web mining research: A survey. SIGKDD Esplorations, Volume 2, Issue 1, July 2000, Pages 1–15.zh_TW
dc.relation.reference (參考文獻) [17] Ravi Kumar, Jasmine Novak, Prabhakar Raghavan, and Andrew Tomkin, Structure and evolution of blogspace. COMMUNICATIONS OF THE ACM December 2004/Vol. 47, No. 12zh_TW
dc.relation.reference (參考文獻) [18] RDF Vocabulary Description Language 1.0: RDF Schema, http://www.w3.org/TR/rdf-schema/zh_TW
dc.relation.reference (參考文獻) [19] Ronald S. Burt, Structural Holes : The Social Structure of Competition. Cambridge, Mass.: Harvard University Press, 1992zh_TW
dc.relation.reference (參考文獻) [20] RSS 1.0 Specification, http://web.resource.org/rss/1.0/speczh_TW
dc.relation.reference (參考文獻) [21] RSS 2.0 Specification, http://www.rssboard.org/rss-specificationzh_TW
dc.relation.reference (參考文獻) [22] Scott, J., Social Network Analysis: A Handbook, Sage Publications, London (2000).zh_TW
dc.relation.reference (參考文獻) [23] Stephen Dill, Nadav Eiron, David Gibson, et al., A case for automated large-scale semantic annotation. Web Semantics: Science, Services and Agents on the World Wide Web 1 (2003) 115–132.zh_TW
dc.relation.reference (參考文獻) [24] Steve Cayzer, Semantic Blogging and Decentralized Knowledge Management. COMMUNICATIONS OF THE ACM, December 2004/Vol. 47, No. 12.zh_TW
dc.relation.reference (參考文獻) [25] Susan C. Herring, Inna Kouper, et al., Conversations in the Blogosphere: An Analysis “From the Bottom Up”, The Thirty-Eighth Hawaii’s International Conference on System Sciences (HICSS-38).zh_TW
dc.relation.reference (參考文獻) [26] Tim Berners-Lee, et al., The Semantic Web. Scientific American, May 2001.zh_TW
dc.relation.reference (參考文獻) [27] Tim Berners-Lee new layer cake, http://www.w3.org/2005/Talks/0511-keynote-tbl/zh_TW
dc.relation.reference (參考文獻) [28] Technorati, http://www.technorati.com/zh_TW
dc.relation.reference (參考文獻) [29] Victoria Uren, Philipp Cimiano, Jose Iria, et al., Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Web Semantics: Science, Services and Agents on the World Wide Web 4 (2006) 14–28zh_TW
dc.relation.reference (參考文獻) [30] What is Web 2.0, http://www.oreillynet.com/pub/a/oreilly/tim/news/zh_TW
dc.relation.reference (參考文獻) 2005/09/30/what-is-web-20.htmlzh_TW
dc.relation.reference (參考文獻) [31] Wouter de Nooy, Andrej Mrvar, and Vladimir Batageli, Exploratory Social Network Analysis with Pajek, American (2005).zh_TW