學術產出-Proceedings

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

題名 Sentimental Analysis on Big Data – On case of Financial Document Text Mining to Predict Sub-Index Trend
作者 姜國輝
Chiang, Johannes K.
Chen, Chun-Cheng
貢獻者 資管系
關鍵詞 Sentimental analysis; Big Data; LDA; SVM; Taiwan Electronic Sub-Index Trend
日期 2016-12
上傳時間 9-Jul-2018 14:52:59 (UTC+8)
摘要 This research analyzed the potential emotion by sentimental analysis in large volume of financial text documents about Taiwan electronic industry to predict the stock trend. In recent researches about sentimental analysis, supervised method was proven to be able to reach high accuracy, but the training set of supervised method should be classified by manpower and couldn’t discover the unknown category. So this research put forward a solution which mixed supervised and unsupervised methods. First, we introduce unsupervised method to find out the topics of documents. Then we calculated the sentimental index to judge the document’s emotional direction. After that, we find out which theme documents’ sentiment index are leading indicators in Taiwan electronic sub-index (TE). Finally, we used supervised method by integrating the sentimental index of leading indicators with other 24 indirect sentimental indexes to build the prediction model of TE. By result, we found that LDA model has better cluster performance than TFIDF-Kmeans model, and also has higher accuracy than NPMI-Concor model on classification. By comparing sentimental index with MACD, we proved that the trend of sentimental index and TE to each other is more similar than MACD line and TE to each other. We also discovered that the sentiment indexes from enterprise operation and macro-economics topics are leading indicators and found that the prediction model of TE which includes the sentiment index is better than which only includes the technical indicators.
關聯 Proceedings of 5th ICCSAE, ICCSAE, IEEE, pp.423-428
資料類型 conference
DOI http://dx.doi.org/10.2991/iccsae-15.2016.81
dc.contributor 資管系
dc.creator (作者) 姜國輝zh_TW
dc.creator (作者) Chiang, Johannes K.en_US
dc.creator (作者) Chen, Chun-Chengen_US
dc.date (日期) 2016-12
dc.date.accessioned 9-Jul-2018 14:52:59 (UTC+8)-
dc.date.available 9-Jul-2018 14:52:59 (UTC+8)-
dc.date.issued (上傳時間) 9-Jul-2018 14:52:59 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/118485-
dc.description.abstract (摘要) This research analyzed the potential emotion by sentimental analysis in large volume of financial text documents about Taiwan electronic industry to predict the stock trend. In recent researches about sentimental analysis, supervised method was proven to be able to reach high accuracy, but the training set of supervised method should be classified by manpower and couldn’t discover the unknown category. So this research put forward a solution which mixed supervised and unsupervised methods. First, we introduce unsupervised method to find out the topics of documents. Then we calculated the sentimental index to judge the document’s emotional direction. After that, we find out which theme documents’ sentiment index are leading indicators in Taiwan electronic sub-index (TE). Finally, we used supervised method by integrating the sentimental index of leading indicators with other 24 indirect sentimental indexes to build the prediction model of TE. By result, we found that LDA model has better cluster performance than TFIDF-Kmeans model, and also has higher accuracy than NPMI-Concor model on classification. By comparing sentimental index with MACD, we proved that the trend of sentimental index and TE to each other is more similar than MACD line and TE to each other. We also discovered that the sentiment indexes from enterprise operation and macro-economics topics are leading indicators and found that the prediction model of TE which includes the sentiment index is better than which only includes the technical indicators.en_US
dc.format.extent 1653959 bytes-
dc.format.mimetype application/pdf-
dc.relation (關聯) Proceedings of 5th ICCSAE, ICCSAE, IEEE, pp.423-428
dc.subject (關鍵詞) Sentimental analysis; Big Data; LDA; SVM; Taiwan Electronic Sub-Index Trenden_US
dc.title (題名) Sentimental Analysis on Big Data – On case of Financial Document Text Mining to Predict Sub-Index Trenden_US
dc.type (資料類型) conference
dc.identifier.doi (DOI) 10.2991/iccsae-15.2016.81
dc.doi.uri (DOI) http://dx.doi.org/10.2991/iccsae-15.2016.81