學術產出-Proceedings

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

  • No doi shows Citation Infomation
題名 Issues and Perspectives from 10,000 Annotated Financial Social Media Data
作者 黃瀚萱
Huang, Hen-Hsen
Chen, Chung-Chi
Chen, Hsin-Hsi
貢獻者 資科系
關鍵詞 financial social media ; sentiment analysis ; market sentiment
日期 2020-05
上傳時間 4-Jun-2021 14:39:38 (UTC+8)
摘要 In this paper, we investigate the annotation of financial social media data from several angles. We present Fin-SoMe, a dataset with 10,000 labeled financial tweets annotated by experts from both the front desk and the middle desk in a bank’s treasury. These annotated results reveal that (1) writer-labeled market sentiment may be a misleading label; (2) writer’s sentiment and market sentiment of an investor may be different; (3) most financial tweets provide unfounded analysis results; and (4) almost no investors write down the gain/loss results for their positions, which would otherwise greatly facilitate detailed evaluation of their performance. Based on these results, we address various open problems and suggest possible directions for future work on financial social media data. We also provide an experiment on the key snippet extraction task to compare the performance of using a general sentiment dictionary and using the domain-specific dictionary. The results echo our findings from the experts’ annotations.
關聯 Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), European Language Resources Association, pp.6106–6110
資料類型 conference
dc.contributor 資科系
dc.creator (作者) 黃瀚萱
dc.creator (作者) Huang, Hen-Hsen
dc.creator (作者) Chen, Chung-Chi
dc.creator (作者) Chen, Hsin-Hsi
dc.date (日期) 2020-05
dc.date.accessioned 4-Jun-2021 14:39:38 (UTC+8)-
dc.date.available 4-Jun-2021 14:39:38 (UTC+8)-
dc.date.issued (上傳時間) 4-Jun-2021 14:39:38 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/135523-
dc.description.abstract (摘要) In this paper, we investigate the annotation of financial social media data from several angles. We present Fin-SoMe, a dataset with 10,000 labeled financial tweets annotated by experts from both the front desk and the middle desk in a bank’s treasury. These annotated results reveal that (1) writer-labeled market sentiment may be a misleading label; (2) writer’s sentiment and market sentiment of an investor may be different; (3) most financial tweets provide unfounded analysis results; and (4) almost no investors write down the gain/loss results for their positions, which would otherwise greatly facilitate detailed evaluation of their performance. Based on these results, we address various open problems and suggest possible directions for future work on financial social media data. We also provide an experiment on the key snippet extraction task to compare the performance of using a general sentiment dictionary and using the domain-specific dictionary. The results echo our findings from the experts’ annotations.
dc.format.extent 208652 bytes-
dc.format.mimetype application/pdf-
dc.relation (關聯) Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), European Language Resources Association, pp.6106–6110
dc.subject (關鍵詞) financial social media ; sentiment analysis ; market sentiment
dc.title (題名) Issues and Perspectives from 10,000 Annotated Financial Social Media Data
dc.type (資料類型) conference