深度強化學習之模型比較: 以股票自動交易系統為例

Publications-Theses

Article View/Open

pdf(0)

Publication Export

Google Scholar^TM

NCCU Library

Discovery System

Citation Infomation

Related Publications in TAIR

Simple Record
Full Record

題名	深度強化學習之模型比較: 以股票自動交易系統為例 A Comparison of Deep Reinforcement Learning Models: The Case of Stock Automated Trading System
作者	黃瑜萍 Huang, Yu-Ping
貢獻者	蔡炎龍<br>蕭明福 Tsai, Yen-Lung<br>Shaw, Ming-Fu 黃瑜萍 Huang, Yu-Ping
關鍵詞	深度學習強化學習深度 Q 學習匯率股票交易 Deep learning Reinforcement learning Deep Q learning Exchange rate Stock trading
日期	2021
上傳時間	4-Aug-2021 15:58:49 (UTC+8)
摘要	本研究引入深度 Q 學習方法，建構一個自動化股票交易系統，研究範圍包含台灣股票市場 14 家科技業公司。研究期間為 2016 年 1 月 4 日至 2020年 12 月 31 日。本研究數據資料有兩種型態 (1) 股票資訊，(2) 股票資訊加上匯率參數。我們將深度 Q 學習的模型，與不同模型和其他策略相比較，以檢測深度 Q 學習是否更適用於股票交易。實證結果發現支持向量機與神經網路在實務面上難以進行股票交易操作，而深度 Q 學習的模型則具有相對好的成效。尤其，加入匯率參數的深度 Q 學習，獲得的報酬皆優於買入持有策略和台灣加權股價指數。 This research introduces the Deep Q learning model to construct an automated stock trading system. Our samples are 14 Taiwanese technology companies. Specifically, we include two types of data, (1) stock information and (2) stock information and exchange rate parameters, which are collected from the Taiwan stock market. The sampling period is from Jan 4, 2016 to Dec 31, 2020. We compare our main model, Deep Q learning, with different models and strategies to examine whether Deep Q learning is more applicable to stock trading. The empirical results show that it is difficult for Support vector machines and Neural networks to operate stock trading; however, Deep Q learning demonstrates better performance. In particular, the return rate of the Deep Q learning model is higher than the Buy-and-hold strategy and Taiwan Weighted Stock Index if considering exchange rate parameters.
參考文獻	王泓仁. 台幣匯率對我國經濟金融活動之影響. 中央銀行季刊》,(),–, Wang, Hung-Jen (),“e Impacts of NT Dollar Exchange Rates on Taiwan＇s Economy＂, Quarterly Reviews, Central Bank of the Republic of China (Taiwan),(), 2005. 徐千婷. 匯率與總體經濟變數之關係: 台灣實證分析, 2006. Wei Bao, Jun Yue, and Yulei Rao. A deep learning framework for financial time series using stacked autoencoders and long-short term memory. PloS one, 12(7):e0180944, 2017. Alessio Brini and Daniele Tantari. Deep reinforcement trading with predictable returns. arXiv preprint arXiv:2104.14683, 2021. Arthur Charpentier, Romuald Elie, and Carl Remlinger. Reinforcement learning in economics and finance. Computational Economics, pages 1–38, 2021. Corinna Cortes and Vladimir Vapnik. Support-vector networks. Machine learning, 20(3): 273–297, 1995. Li Deng and Dong Yu. Deep learning: methods and applications. Foundations and trends in signal processing, 7(3–4):197–387, 2014. Eugene F Fama. The behavior of stock-market prices. The journal of Business, 38(1): 34–105, 1965. Eugene F Fama. Random walks in stock market prices. Financial analysts journal, 51(1): 75–80, 1995. Ian Goodfellow, Yoshua Bengio, Aaron Courville, and Yoshua Bengio. Deep learning, volume 1. MIT press Cambridge, 2016. Chien Yi Huang. Financial trading as a game: A deep reinforcement learning approach. arXiv preprint arXiv:1807.02787, 2018. Leslie Pack Kaelbling, Michael L Littman, and Andrew W Moore. Reinforcement learning: A survey. Journal of artificial intelligence research, 4:237–285, 1996. Konstantina Kourou, Themis P Exarchos, Konstantinos P Exarchos, Michalis V Karamouzis, and Dimitrios I Fotiadis. Machine learning applications in cancer prognosis and prediction. Computational and structural biotechnology journal, 13:8–17, 2015. Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. Deep learning. nature, 521(7553): 436–444, 2015. Jae Won Lee. Stock price prediction using reinforcement learning. In ISIE 2001. 2001 IEEE International Symposium on Industrial Electronics Proceedings (Cat. No. 01TH8570), volume 1, pages 690–695. IEEE, 2001. Jae Won Lee, Jonghun Park, O Jangmin, Jongwoo Lee, and Euyseok Hong. A multiagent approach to q-learning for daily stock trading. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, 37(6):864–877, 2007. Pierre Menard, Omar Darwiche Domingues, Xuedong Shang, and Michal Valko. Ucb momentum q-learning: Correcting the bias without forgetting. arXiv preprint arXiv: 2103.01312, 2021. Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602, 2013. Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, et al. Massively parallel methods for deep reinforcement learning. arXiv preprint arXiv: 1507.04296, 2015. Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and Andrew Y Ng. Multimodal deep learning. In ICML, 2011. Kate Phylaktis and Fabiola Ravazzolo. Stock prices and exchange rate dynamics. Journal of international Money and Finance, 24(7):1031–1053, 2005. G William Schwert. Business cycles, financial crises, and stock volatility. In CarnegieRochester Conference series on public policy, volume 31, pages 83–125. E Richard S Sutton and Andrew G Barto. Reinforcement learning: An introduction. MIT press, 2018. Zhuoran Xiong, Xiao-Yang Liu, Shan Zhong, Hongyang Yang, and Anwar Walid. Practical deep reinforcement learning approach for stock trading. arXiv preprint arXiv:1811.07522, 2018. Paul D Yoo, Maria H Kim, and Tony Jan. Machine learning techniques and use of event information for stock market prediction: A survey and evaluation. In International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC’06), volume 2, pages 835–841. IEEE, 2005.
描述	碩士國立政治大學經濟學系 108258021
資料來源	http://thesis.lib.nccu.edu.tw/record/#G0108258021
資料類型	thesis

dc.contributor.advisor	蔡炎龍<br>蕭明福	zh_TW
dc.contributor.advisor	Tsai, Yen-Lung<br>Shaw, Ming-Fu	en_US
dc.contributor.author (Authors)	黃瑜萍	zh_TW
dc.contributor.author (Authors)	Huang, Yu-Ping	en_US
dc.creator (作者)	黃瑜萍	zh_TW
dc.creator (作者)	Huang, Yu-Ping	en_US
dc.date (日期)	2021	en_US
dc.date.accessioned	4-Aug-2021 15:58:49 (UTC+8)	-
dc.date.available	4-Aug-2021 15:58:49 (UTC+8)	-
dc.date.issued (上傳時間)	4-Aug-2021 15:58:49 (UTC+8)	-
dc.identifier (Other Identifiers)	G0108258021	en_US
dc.identifier.uri (URI)	http://nccur.lib.nccu.edu.tw/handle/140.119/136559	-
dc.description (描述)	碩士	zh_TW
dc.description (描述)	國立政治大學	zh_TW
dc.description (描述)	經濟學系	zh_TW
dc.description (描述)	108258021	zh_TW
dc.description.abstract (摘要)	本研究引入深度 Q 學習方法，建構一個自動化股票交易系統，研究範圍包含台灣股票市場 14 家科技業公司。研究期間為 2016 年 1 月 4 日至 2020年 12 月 31 日。本研究數據資料有兩種型態 (1) 股票資訊，(2) 股票資訊加上匯率參數。我們將深度 Q 學習的模型，與不同模型和其他策略相比較，以檢測深度 Q 學習是否更適用於股票交易。實證結果發現支持向量機與神經網路在實務面上難以進行股票交易操作，而深度 Q 學習的模型則具有相對好的成效。尤其，加入匯率參數的深度 Q 學習，獲得的報酬皆優於買入持有策略和台灣加權股價指數。	zh_TW
dc.description.abstract (摘要)	This research introduces the Deep Q learning model to construct an automated stock trading system. Our samples are 14 Taiwanese technology companies. Specifically, we include two types of data, (1) stock information and (2) stock information and exchange rate parameters, which are collected from the Taiwan stock market. The sampling period is from Jan 4, 2016 to Dec 31, 2020. We compare our main model, Deep Q learning, with different models and strategies to examine whether Deep Q learning is more applicable to stock trading. The empirical results show that it is difficult for Support vector machines and Neural networks to operate stock trading; however, Deep Q learning demonstrates better performance. In particular, the return rate of the Deep Q learning model is higher than the Buy-and-hold strategy and Taiwan Weighted Stock Index if considering exchange rate parameters.	en_US
dc.description.tableofcontents	致謝 i 中文摘要 ii Abstract iii 目錄 iv 表目錄 vi 圖目錄 vii 第一章緒論 1 第一節研究背景與動機 1 第二節強化學習簡述 1 第三節問題設定 2 第二章文獻回顧 3 第一節匯率與股市關聯性 3 第二節強化學習在金融領域之應用 3 第三章機器學習與深度學習模型探討 5 第一節支持向量機 5 第二節神經網路 7 一、神經網路架構 9 二、激發函數 10 三、損失函數 12 第三節強化學習 14 一、馬可夫決策過程 15 二、價值函數 16 三、蒙地卡羅法及時差學習法 17 四、深度強化學習 19 第四章資料來源及說明 21 第一節股票資訊 21 第二節匯率參數 22 第三節模型使用類型輸入資料 22 第五章實證結果 24 第一節模型結構介紹 24 第二節支持向量機、神經網路對於股票資訊預測 24 第三節深度強化學習模型之結構變化與過程 28 第四節深度強化學習模型實證結果 34 一、深度強化學習與買入持有策略 34 二、深度強化學習與台灣加權股價指數 36 三、深度 Q 學習綜合比較 38 四、預測模型 (支持向量機、神經網路) 與深度強化學習模型之比較 42 第六章結論與未來展望 43 參考文獻 44	zh_TW
dc.format.extent	2358372 bytes	-
dc.format.mimetype	application/pdf	-
dc.source.uri (資料來源)	http://thesis.lib.nccu.edu.tw/record/#G0108258021	en_US
dc.subject (關鍵詞)	深度學習	zh_TW
dc.subject (關鍵詞)	強化學習	zh_TW
dc.subject (關鍵詞)	深度 Q 學習	zh_TW
dc.subject (關鍵詞)	匯率	zh_TW
dc.subject (關鍵詞)	股票交易	zh_TW
dc.subject (關鍵詞)	Deep learning	en_US
dc.subject (關鍵詞)	Reinforcement learning	en_US
dc.subject (關鍵詞)	Deep Q learning	en_US
dc.subject (關鍵詞)	Exchange rate	en_US
dc.subject (關鍵詞)	Stock trading	en_US
dc.title (題名)	深度強化學習之模型比較: 以股票自動交易系統為例	zh_TW
dc.title (題名)	A Comparison of Deep Reinforcement Learning Models: The Case of Stock Automated Trading System	en_US
dc.type (資料類型)	thesis	en_US
dc.relation.reference (參考文獻)	王泓仁. 台幣匯率對我國經濟金融活動之影響. 中央銀行季刊》,(),–, Wang, Hung-Jen (),“e Impacts of NT Dollar Exchange Rates on Taiwan＇s Economy＂, Quarterly Reviews, Central Bank of the Republic of China (Taiwan),(), 2005. 徐千婷. 匯率與總體經濟變數之關係: 台灣實證分析, 2006. Wei Bao, Jun Yue, and Yulei Rao. A deep learning framework for financial time series using stacked autoencoders and long-short term memory. PloS one, 12(7):e0180944, 2017. Alessio Brini and Daniele Tantari. Deep reinforcement trading with predictable returns. arXiv preprint arXiv:2104.14683, 2021. Arthur Charpentier, Romuald Elie, and Carl Remlinger. Reinforcement learning in economics and finance. Computational Economics, pages 1–38, 2021. Corinna Cortes and Vladimir Vapnik. Support-vector networks. Machine learning, 20(3): 273–297, 1995. Li Deng and Dong Yu. Deep learning: methods and applications. Foundations and trends in signal processing, 7(3–4):197–387, 2014. Eugene F Fama. The behavior of stock-market prices. The journal of Business, 38(1): 34–105, 1965. Eugene F Fama. Random walks in stock market prices. Financial analysts journal, 51(1): 75–80, 1995. Ian Goodfellow, Yoshua Bengio, Aaron Courville, and Yoshua Bengio. Deep learning, volume 1. MIT press Cambridge, 2016. Chien Yi Huang. Financial trading as a game: A deep reinforcement learning approach. arXiv preprint arXiv:1807.02787, 2018. Leslie Pack Kaelbling, Michael L Littman, and Andrew W Moore. Reinforcement learning: A survey. Journal of artificial intelligence research, 4:237–285, 1996. Konstantina Kourou, Themis P Exarchos, Konstantinos P Exarchos, Michalis V Karamouzis, and Dimitrios I Fotiadis. Machine learning applications in cancer prognosis and prediction. Computational and structural biotechnology journal, 13:8–17, 2015. Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. Deep learning. nature, 521(7553): 436–444, 2015. Jae Won Lee. Stock price prediction using reinforcement learning. In ISIE 2001. 2001 IEEE International Symposium on Industrial Electronics Proceedings (Cat. No. 01TH8570), volume 1, pages 690–695. IEEE, 2001. Jae Won Lee, Jonghun Park, O Jangmin, Jongwoo Lee, and Euyseok Hong. A multiagent approach to q-learning for daily stock trading. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, 37(6):864–877, 2007. Pierre Menard, Omar Darwiche Domingues, Xuedong Shang, and Michal Valko. Ucb momentum q-learning: Correcting the bias without forgetting. arXiv preprint arXiv: 2103.01312, 2021. Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602, 2013. Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, et al. Massively parallel methods for deep reinforcement learning. arXiv preprint arXiv: 1507.04296, 2015. Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and Andrew Y Ng. Multimodal deep learning. In ICML, 2011. Kate Phylaktis and Fabiola Ravazzolo. Stock prices and exchange rate dynamics. Journal of international Money and Finance, 24(7):1031–1053, 2005. G William Schwert. Business cycles, financial crises, and stock volatility. In CarnegieRochester Conference series on public policy, volume 31, pages 83–125. E Richard S Sutton and Andrew G Barto. Reinforcement learning: An introduction. MIT press, 2018. Zhuoran Xiong, Xiao-Yang Liu, Shan Zhong, Hongyang Yang, and Anwar Walid. Practical deep reinforcement learning approach for stock trading. arXiv preprint arXiv:1811.07522, 2018. Paul D Yoo, Maria H Kim, and Tony Jan. Machine learning techniques and use of event information for stock market prediction: A survey and evaluation. In International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC’06), volume 2, pages 835–841. IEEE, 2005.	zh_TW
dc.identifier.doi (DOI)	10.6814/NCCU202100671	en_US

Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

Google Scholar^TM