Please use this identifier to cite or link to this item: https://ah.lib.nccu.edu.tw/handle/140.119/133561
DC FieldValueLanguage
dc.contributor語言所
dc.creator張瑜芸
dc.creatorChang, Yu-Yun
dc.creatorMagistry, Pierre
dc.creatorHsieh, Shu-Kai
dc.date2016-12
dc.date.accessioned2021-01-18T05:21:25Z-
dc.date.available2021-01-18T05:21:25Z-
dc.date.issued2021-01-18T05:21:25Z-
dc.identifier.urihttp://nccur.lib.nccu.edu.tw/handle/140.119/133561-
dc.description.abstractIn this paper, we present a proposed system designed for sentiment detection for micro-blog data in Chinese. Our system surprisingly benefits from the lack of word boundary in Chinese writing system and shifts the focus directly to larger and more relevant chunks. We use an unsupervised Chinese word segmentation system and binomial test to extract specific and endogenous lexicon chunks from the training corpus. We combine the lexicon chunks with other external resources to train a maximum entropy model for document classification. With this method, we obtained an averaged F1 score of 87.2 which outperforms the state-of-the-art approach based on the released data in the second SocialNLP shared task.
dc.format.extent1137118 bytes-
dc.format.mimetypeapplication/pdf-
dc.relationLingua Sinica, Vol.2, No.1, pp.1-10
dc.subjectSentiment analysis;Emotion lexicon;Unsupervised learning
dc.titleSentiment detection in micro-blogs using unsupervised chunk extraction
dc.typearticle
dc.identifier.doi10.1186/s40655-015-0010-8
dc.doi.urihttps://doi.org/10.1186/s40655-015-0010-8
item.openairetypearticle-
item.fulltextWith Fulltext-
item.grantfulltextrestricted-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.cerifentitytypePublications-
Appears in Collections:期刊論文
Files in This Item:
File Description SizeFormat
19.pdf1.11 MBAdobe PDF2View/Open
Show simple item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.