DSpace Community: 統計系
https://ah.lib.nccu.edu.tw/handle/140.119/71
統計系2024-03-29T01:06:21ZSemiparametric Estimation for Error-prone Partial Linear Single-index Models
https://ah.lib.nccu.edu.tw/handle/140.119/147491
題名: Semiparametric Estimation for Error-prone Partial Linear Single-index Models2023-09-08T08:22:24ZMeasurement Error Correction for EWMA p-Charts
https://ah.lib.nccu.edu.tw/handle/140.119/147490
題名: Measurement Error Correction for EWMA p-Charts2023-09-08T08:22:20ZAn Exact Control Chart for the Median of Birnbaum-Saunders Distribution
https://ah.lib.nccu.edu.tw/handle/140.119/147489
題名: An Exact Control Chart for the Median of Birnbaum-Saunders Distribution2023-09-08T08:22:16Z在臺灣新聞資料下透過貪婪演算法預測股票報酬
https://ah.lib.nccu.edu.tw/handle/140.119/146908
題名: 在臺灣新聞資料下透過貪婪演算法預測股票報酬; Predicting Stock Returns via Greedy Algorithm with Taiwanese News Data
Authors: 程長磊; Cheng, Chang-Lei
摘要: 隨著大數據、自然語言處理等領域發展,使得非結構化資料(Unstructured Data)具有極大的學術研究價值,尤其是文本資料。許多研究著手文字訊息對資產報酬之影響,使其成為財務領域中重要的研究目標之一,然而文本資料屬於高維度資料,如何正確分析文本資料與報酬間的關係成為此類研究的重要議題。而新聞文章是投資人在交易時最普遍接觸的文本資料,新聞文章與財報資料不同的地方在於新聞文章並沒有實際量化資料做為投資的依據,因此本研究欲透過Ing and Lai (2011)提出之 Orthogonal Greedy Algorithm (OGA) 以及由Chen, Dai, Ing, Lai (2019) 所改良之Chebyshev Greedy Algorithm (CGA) 高維度選模模型,挑選新聞中常用字詞之文字探勘方法以量化新聞文章之情緒分數,並在排除公司報酬因子下計算新聞情緒因子與公司報酬間之關係,並比較當應變數報酬為線性或是非線性的假設之下,利用新聞情緒分數所建構之投資組合之報酬差異。在應變數報酬為連續變數之線性假設下使用 OGA 並推廣為 OGA Predict模型,而在應變數報酬為非線性假設下則使用CGA並推廣為CGA Predict模型,並將上述兩種選模方法創新應用於財務文本分析之中。我們發現相較於OGA Predict,CGA predict模型可以得到更好的超額報酬,同時透過績效評估發現,新聞文章情緒對於散戶投資人為主的臺灣市場之影響與法人投資人為主的美國市場相比是顯著不同的,其結果也符合我們對於臺灣股票市場的經濟直觀。; The development of unstructured data grows fast and has the value of research along with the improvement of the realm of big data, especially for textual data. However, textual data are high dimensional data (i.e. the number of text in the news articles far exceeded than the news articles themselves.), therefore analyzing the relationship between textual data and the average return correctly has been an important issue according to this realm of research. When trading, the textual data that are most commonly received by investors are news articles. The difference between news articles and financial statements is that news articles can not provide quantitative information as an investment foundation. Therefore, we suppose to use two different kinds of high dimensional model selection methods, Orthogonal Greedy Algorithm(Ing and Lai (2011)) and Chebyshev Greedy Algorithm(Chen, Dai, Ing, Lai(2019)), and then select the frequently use words from news articles in order to quantify the sentiment scores of news articles. Moreover, we compare the difference of the portfolio returns which are constructed under two different assumptions(linear or nonlinear) of dependent variables according to the news sentiments. We use the OGA predict model to construct news sentiment when the dependent variable is under linear assumption, otherwise, we use the CGA predict. We find that the average return from the CGA predict model is better than the average return from the OGA predict model. Moreover, there is a significant difference in decision making when trading between the Taiwanese market and US market.
描述: 碩士; 國立政治大學; 統計學系; 1103540302023-09-01T06:58:16Z