學術產出-Proceedings

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

  • No doi shows Citation Infomation
題名 Sentiment classification of short Chinese sentences
作者 Sun, Ying-Tse;Chen, C.-L.;Liu, C.-C.;Liu, Chaolin;Soo, V.-W.
劉昭麟
貢獻者 資科系
關鍵詞 Chinese sentence; Classification methods; Classification tasks; Probabilistic modeling; Sentiment classification; Statistical information; Text classification; Classification (of information); Computational linguistics; Speech processing; Text processing; Internet
日期 2010
上傳時間 20-May-2015 17:03:26 (UTC+8)
摘要 We explore an extreme case of text classification. The short statements in micro-blogs were collected, and were associated by a category based on the sentiment indicated by the associated icons. We evaluated different methods that assigned the categories with just the wordings in the short statements. Short statements in micro-blogs are harder to classify because of the shortage of context, yet it is not rare for the statements to include words that may be linked to sentiments directly. In this work, we considered two polarities of sentiments: negative and positive. We employed the statistical information about the word usage, a dictionary for Chinese synonyms, and an emotional phrases dictionary to convert short statements into vectors, and applied techniques of support vector machines and probabilistic modeling for the classification task. The results of classification varied with the classification methods and experimental setups. The best one exceeded 80%, but the lowest just made 55%.
關聯 Proceedings of the 22nd Conference on Computational Linguistics and Speech Processing, ROCLING 2010
資料類型 conference
dc.contributor 資科系
dc.creator (作者) Sun, Ying-Tse;Chen, C.-L.;Liu, C.-C.;Liu, Chaolin;Soo, V.-W.
dc.creator (作者) 劉昭麟zh_TW
dc.date (日期) 2010
dc.date.accessioned 20-May-2015 17:03:26 (UTC+8)-
dc.date.available 20-May-2015 17:03:26 (UTC+8)-
dc.date.issued (上傳時間) 20-May-2015 17:03:26 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/75199-
dc.description.abstract (摘要) We explore an extreme case of text classification. The short statements in micro-blogs were collected, and were associated by a category based on the sentiment indicated by the associated icons. We evaluated different methods that assigned the categories with just the wordings in the short statements. Short statements in micro-blogs are harder to classify because of the shortage of context, yet it is not rare for the statements to include words that may be linked to sentiments directly. In this work, we considered two polarities of sentiments: negative and positive. We employed the statistical information about the word usage, a dictionary for Chinese synonyms, and an emotional phrases dictionary to convert short statements into vectors, and applied techniques of support vector machines and probabilistic modeling for the classification task. The results of classification varied with the classification methods and experimental setups. The best one exceeded 80%, but the lowest just made 55%.
dc.format.extent 176 bytes-
dc.format.mimetype text/html-
dc.relation (關聯) Proceedings of the 22nd Conference on Computational Linguistics and Speech Processing, ROCLING 2010
dc.subject (關鍵詞) Chinese sentence; Classification methods; Classification tasks; Probabilistic modeling; Sentiment classification; Statistical information; Text classification; Classification (of information); Computational linguistics; Speech processing; Text processing; Internet
dc.title (題名) Sentiment classification of short Chinese sentences
dc.type (資料類型) conferenceen