學術產出-Proceedings

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

題名 Textual analysis for studying Chinese historical documents and literary Novels
作者 劉昭麟
Liu, Chao-Lin
Jin, Guan-Tao
Wang, Hongsu
Liu, Qing-Feng
Cheng, Wen-Huei
Chiu, Wei-Yun
Tsai, Richard Tzong-Han
Wang, Yu-Chun
貢獻者 資訊科學系
關鍵詞 Character recognition; Computational linguistics; Data mining; History; Natural language processing systems; Text processing; 228 incident in Taiwan; Digital humanities; Geographical analysis; Historical documents; Keyword collocation; Name disambiguation; Named entity recognition; Temporal analysis; Text mining; Textual analysis; Computer aided analysis
日期 2015-10
上傳時間 10-Aug-2017 15:15:38 (UTC+8)
摘要 We analyzed historical and literary documents in Chinese to gain insights into research issues, and overview1 our studies which utilized four different sources of text materials in this paper. We investigated the history of concepts and transliterated words in China with the Database for the Study of Modern China Thought and Literature, which contains historical documents about China between 1830 and 1930. We also attempted to disambiguate names that were shared by multiple government officers who served between 618 and 1912 and were recorded in Chinese local gazetteers (/di4 fang1 zhi4/). To showcase the potentials and challenges of computer-assisted analysis of Chinese literatures, we explored some interesting yet non-trivial questions about two of the Four Great Classical Novels of China: (1) Which monsters attempted to consume the Buddhist monk Xuanzang in the Journey to the West (/xi1 you2 ji4/, JTTW), which was published in the 16th century, (2) Which was the most powerful monster in JTTW, and (3) Which major role smiled the most in the Dream of the Red Chamber (/hong2 lou2 meng4/), which was published in the 18th century. Similar approaches can be applied to the analysis and study of modern documents, such as the newspaper articles published about the 228 incident that occurred in 1947 in Taiwan. Copyright is held by the owner/author(s). Publication rights licensed to ACM.
關聯 ACM International Conference Proceeding Series, Volume 07-09-Ocobert-2015, 7 October 2015, 論文編號 a30
ASE BigData and SocialInformatics, ASE BD and SI 2015; Kaohsiung; Taiwan; 7 October 2015 到 9 October 2015; 代碼 118806
資料類型 conference
DOI https://arxiv.org/abs/1510.03021
dc.contributor 資訊科學系zh_Tw
dc.creator (作者) 劉昭麟zh_TW
dc.creator (作者) Liu, Chao-Linen_US
dc.creator (作者) Jin, Guan-Taoen_US
dc.creator (作者) Wang, Hongsuen_US
dc.creator (作者) Liu, Qing-Fengen_US
dc.creator (作者) Cheng, Wen-Hueien_US
dc.creator (作者) Chiu, Wei-Yunen_US
dc.creator (作者) Tsai, Richard Tzong-Hanen_US
dc.creator (作者) Wang, Yu-Chunen_US
dc.date (日期) 2015-10en_US
dc.date.accessioned 10-Aug-2017 15:15:38 (UTC+8)-
dc.date.available 10-Aug-2017 15:15:38 (UTC+8)-
dc.date.issued (上傳時間) 10-Aug-2017 15:15:38 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/111899-
dc.description.abstract (摘要) We analyzed historical and literary documents in Chinese to gain insights into research issues, and overview1 our studies which utilized four different sources of text materials in this paper. We investigated the history of concepts and transliterated words in China with the Database for the Study of Modern China Thought and Literature, which contains historical documents about China between 1830 and 1930. We also attempted to disambiguate names that were shared by multiple government officers who served between 618 and 1912 and were recorded in Chinese local gazetteers (/di4 fang1 zhi4/). To showcase the potentials and challenges of computer-assisted analysis of Chinese literatures, we explored some interesting yet non-trivial questions about two of the Four Great Classical Novels of China: (1) Which monsters attempted to consume the Buddhist monk Xuanzang in the Journey to the West (/xi1 you2 ji4/, JTTW), which was published in the 16th century, (2) Which was the most powerful monster in JTTW, and (3) Which major role smiled the most in the Dream of the Red Chamber (/hong2 lou2 meng4/), which was published in the 18th century. Similar approaches can be applied to the analysis and study of modern documents, such as the newspaper articles published about the 228 incident that occurred in 1947 in Taiwan. Copyright is held by the owner/author(s). Publication rights licensed to ACM.en_US
dc.format.extent 662386 bytes-
dc.format.mimetype application/pdf-
dc.relation (關聯) ACM International Conference Proceeding Series, Volume 07-09-Ocobert-2015, 7 October 2015, 論文編號 a30en_US
dc.relation (關聯) ASE BigData and SocialInformatics, ASE BD and SI 2015; Kaohsiung; Taiwan; 7 October 2015 到 9 October 2015; 代碼 118806en_US
dc.subject (關鍵詞) Character recognition; Computational linguistics; Data mining; History; Natural language processing systems; Text processing; 228 incident in Taiwan; Digital humanities; Geographical analysis; Historical documents; Keyword collocation; Name disambiguation; Named entity recognition; Temporal analysis; Text mining; Textual analysis; Computer aided analysisen_US
dc.title (題名) Textual analysis for studying Chinese historical documents and literary Novelsen_US
dc.type (資料類型) conference
dc.identifier.doi (DOI) 10.1145/2818869.2818912
dc.doi.uri (DOI) https://arxiv.org/abs/1510.03021