dc.contributor | 資科系 | |
dc.creator (作者) | 黃瀚萱 | |
dc.creator (作者) | Huang, Hen-Hsen | |
dc.creator (作者) | Chen, Chung-Chi | |
dc.creator (作者) | Chen, Hsin-Hsi | |
dc.date (日期) | 2020-05 | |
dc.date.accessioned | 4-六月-2021 14:39:38 (UTC+8) | - |
dc.date.available | 4-六月-2021 14:39:38 (UTC+8) | - |
dc.date.issued (上傳時間) | 4-六月-2021 14:39:38 (UTC+8) | - |
dc.identifier.uri (URI) | http://nccur.lib.nccu.edu.tw/handle/140.119/135523 | - |
dc.description.abstract (摘要) | In this paper, we investigate the annotation of financial social media data from several angles. We present Fin-SoMe, a dataset with 10,000 labeled financial tweets annotated by experts from both the front desk and the middle desk in a bank’s treasury. These annotated results reveal that (1) writer-labeled market sentiment may be a misleading label; (2) writer’s sentiment and market sentiment of an investor may be different; (3) most financial tweets provide unfounded analysis results; and (4) almost no investors write down the gain/loss results for their positions, which would otherwise greatly facilitate detailed evaluation of their performance. Based on these results, we address various open problems and suggest possible directions for future work on financial social media data. We also provide an experiment on the key snippet extraction task to compare the performance of using a general sentiment dictionary and using the domain-specific dictionary. The results echo our findings from the experts’ annotations. | |
dc.format.extent | 208652 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.relation (關聯) | Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), European Language Resources Association, pp.6106–6110 | |
dc.subject (關鍵詞) | financial social media ; sentiment analysis ; market sentiment | |
dc.title (題名) | Issues and Perspectives from 10,000 Annotated Financial Social Media Data | |
dc.type (資料類型) | conference | |