強化學習應用於美式選擇權評價 | Publication

Publications-Theses

Article View/Open

pdf(138)

Publication Export

Google Scholar^TM

NCCU Library

Discovery System

Citation Infomation

Related Publications in TAIR

Simple Record
Full Record

題名	強化學習應用於美式選擇權評價 Applying Reinforcement Learning to American Option Pricing
作者	許琳 Xu, Lin
貢獻者	江彌修 Chiang, Mi-Hsiu 許琳 Xu, Lin
關鍵詞	美式選擇權定價強化學習最小平方策略迭代最小平方蒙地卡羅法 American option Pricing Reinforcement learning LSPI FQI LSM
日期	2019
上傳時間	1-Jul-2019 10:48:51 (UTC+8)
摘要	本文研究了強化學習應用於美式選擇權定價問題，首先，使用 Li, Szepesvari and Schuurmans 提出之最小平方策略迭代(LSPI)演算法學習美式賣權履約策略並進行定價，將蘋果公司美式股票選擇權之真實市場數據處理後套用於 LSPI 方法，並將 LSPI 方法與 Tsitsiklis and Van Roy提出之FQI方法和傳統最小平方蒙地卡羅法比較定價準確性。其次，使用符合金融市場之分析方式，將賣權分價內外不同情況分析，並進行敏感度分析，觀察強化學習使用之參數對於定價結果之影響。模擬結果表示，LSPI 方法與 FQI 方法總體優於 LSM 方法，強化學習對於愈價內之賣權定價愈準確。本文發現強化學習在商品定價領域仍有很大研究潛力，特別是模擬路徑方式與執行動作多樣性方面值得進一步討論。 In this paper we apply the reinforcement learning method to American options pricing. We mainly consider the least squares policy iteration (LSPI) proposed by Li, Szepesvari and Schuurmans(2009) to learn the exercise policy and pricing method of American put options. We price AAPL American stock option with processed real market data, and compare the accuracy between LSPI, FQI proposed by Tsitsiklis and Van Roy(2001), and the standard least square Monte Carlo method (LSM). In order to investigate the influence of parameters used in LSPI on pricing results, the analysis method in financial market, sensitivity analysis is carried out under different situations which are divided according to whether the put option is in-the-money or out-of-the-money. The simulation result shows that LSPI and FQI are superior to LSM in general, and LSPI is more accurate in pricing deeper in-the-money put option. We also find that the reinforcement learning method still has great research potential in the field of derivatives pricing. In particular, there is a need for further investigation on simulation method of price path or selecting action variety.
參考文獻	[1] Barone-Adesi, G. and Whaley, R. (1987). Efficient Analytical Approximation of American Option Values. Journal of Finance, Vol. 42, 301-320. [2] Bellman, R. (1957). A Markovian Decision Process. Journal of Mathematics and Mechanics, Vol. 6, 679–684. [3] Black, F. and Scholes, M. (1973). The Pricing of Options and Corporate Liabilities. Journal of Political Economy, Vol. 81, 637-659. [4] Boyle, P. P. (1977). Options: A Monte Carlo Approach. Journal of Financial Economics, Vol. 4, 323–338. [5] Boyle, P. P. (1986). A lattice framework for option pricing with two state variables, Journal of Financial and Quantitative Analysis, Vol. 23(1), 1-12. [6] Brennan, M. and Schwartz, E. (1977). The Valuation of American Put Options. Journal of Finance, Vol. 32, 449-462. [7] Cox, J. C., Ross S. A. and Rubinstein, M. (1979). Option Pricing: A simplified Approach. Journal of Financial Economics, Vol. 7, 229-264. [8] Dubrov, B. (2015). Monte Carlo Simulation with Machine Learning for Pricing American Options and Convertible Bonds. SSRN. [9] Geske, R. (1979). The Valuation of Compound Options. Journal of Financial Economics, Vol. 7, 63–81. [10] Geske, R. (1979). A Note on an Analytic Valuation Formula for Unprotected American Call Options on Stocks with Known Dividends. Journal of Financial Economics, Vol. 7, 275–380. [11] Geske, R. (1981). Comments on Whaley’s Note. Journal of Financial Economics, Vol. 9, 213–215. [12] Geske, R. and Johnson, H. E. (1984). The American Put Valued Analytically. Journal of Finance, Vol. 39, 1511-1524. [13] Haug, E.G., Haug, J. and Lewis, A. (2003). Back to Basics: a New Approach to the Discrete Dividend Problem. Wilmott Magazine, 37–47. [14] Howard, R. A. (1960). Dynamic Programming and Markov Processes. Cambridge, Mass: MIT Press. [15] Hull, J. C. (2011). Options, Futures, and Other Derivatives, 8th edition. United States of America: Prentice Hall. [16] Johnson, H. (1983). An Analytical Approximation for the American Put Price. Journal of Financial and Quantitative Analysis, Vol. 18, 141-148. [17] Ju., N. and Zhong, R. (1998). An Approximate Formula for Pricing American Options. Review of Financial Studies, Vol. 11, 627-646. [18] Lagoudakis, M. G. and Parr, R. (2003). Least-Squares Policy Iteration. Journal of Machine Learning Research , Vol. 4, 1107 – 1149. [19] Li, Y., Szepesvari, C. and Schuurmans, D. (2009). Learning Exercise Policies for American Options. In Proc. of the Twelfth International Conference on Artificial Intelligence and Statistics, JMLR: W&CP, Vol. 5, 352-359. [20] Longstaff, F. A. and Schwartz, E. S. (2001). Valuing American options by simulation: a simple Least-Squares approach. Review Financial Studies, Vol. 14, 113-147. [21] Medvedev, A. and Scaillet, O. (2010). Pricing American options under stochastic volatility and stochastic interest rates. Journal of Financial Economics, Vol. 98, 145–159. [22] Merton, R. (1976). Option pricing when underlying stock returns are discontinuous. Journal of Financial Economics, Vol. 2, 125–144. [23] Roll, R. (1977). An Analytical Valuation Formula for Unprotected American Call Options on Stocks with Known Dividends. Journal of Financial Economics, Vol. 5, 251-258. [24] Tilley, J. A. (1993). Valuing American options in a path simulation model. Transactions of the Society of Actuaries, Vol. 45, 83–104. [25] Tsitsiklis, J. N. and Van Roy, B. (2001). Regression Methods for Pricing Complex American-style Options. IEEE Transactions on Neural Networks(special issue on computational finance), Vol. 12(4), 694–703. [26] Whaley, R. E. (1981). On the Valuation of American Call Options on Stocks with Known Dividends. Journal of Financial Economics, Vol. 10, 207–211. [27]陳戚光，（2001）。選擇權:理論.實務與應用。台灣：智勝文化。
描述	碩士國立政治大學金融學系 106352047
資料來源	http://thesis.lib.nccu.edu.tw/record/#G0106352047
資料類型	thesis

dc.contributor.advisor	江彌修	zh_TW
dc.contributor.advisor	Chiang, Mi-Hsiu	en_US
dc.contributor.author (Authors)	許琳	zh_TW
dc.contributor.author (Authors)	Xu, Lin	en_US
dc.creator (作者)	許琳	zh_TW
dc.creator (作者)	Xu, Lin	en_US
dc.date (日期)	2019	en_US
dc.date.accessioned	1-Jul-2019 10:48:51 (UTC+8)	-
dc.date.available	1-Jul-2019 10:48:51 (UTC+8)	-
dc.date.issued (上傳時間)	1-Jul-2019 10:48:51 (UTC+8)	-
dc.identifier (Other Identifiers)	G0106352047	en_US
dc.identifier.uri (URI)	http://nccur.lib.nccu.edu.tw/handle/140.119/124147	-
dc.description (描述)	碩士	zh_TW
dc.description (描述)	國立政治大學	zh_TW
dc.description (描述)	金融學系	zh_TW
dc.description (描述)	106352047	zh_TW
dc.description.abstract (摘要)	本文研究了強化學習應用於美式選擇權定價問題，首先，使用 Li, Szepesvari and Schuurmans 提出之最小平方策略迭代(LSPI)演算法學習美式賣權履約策略並進行定價，將蘋果公司美式股票選擇權之真實市場數據處理後套用於 LSPI 方法，並將 LSPI 方法與 Tsitsiklis and Van Roy提出之FQI方法和傳統最小平方蒙地卡羅法比較定價準確性。其次，使用符合金融市場之分析方式，將賣權分價內外不同情況分析，並進行敏感度分析，觀察強化學習使用之參數對於定價結果之影響。模擬結果表示，LSPI 方法與 FQI 方法總體優於 LSM 方法，強化學習對於愈價內之賣權定價愈準確。本文發現強化學習在商品定價領域仍有很大研究潛力，特別是模擬路徑方式與執行動作多樣性方面值得進一步討論。	zh_TW
dc.description.abstract (摘要)	In this paper we apply the reinforcement learning method to American options pricing. We mainly consider the least squares policy iteration (LSPI) proposed by Li, Szepesvari and Schuurmans(2009) to learn the exercise policy and pricing method of American put options. We price AAPL American stock option with processed real market data, and compare the accuracy between LSPI, FQI proposed by Tsitsiklis and Van Roy(2001), and the standard least square Monte Carlo method (LSM). In order to investigate the influence of parameters used in LSPI on pricing results, the analysis method in financial market, sensitivity analysis is carried out under different situations which are divided according to whether the put option is in-the-money or out-of-the-money. The simulation result shows that LSPI and FQI are superior to LSM in general, and LSPI is more accurate in pricing deeper in-the-money put option. We also find that the reinforcement learning method still has great research potential in the field of derivatives pricing. In particular, there is a need for further investigation on simulation method of price path or selecting action variety.	en_US
dc.description.tableofcontents	第一章簡介 1 第二章文獻回顧 5 第三章研究方法 8 第一節馬可夫決策過程MDP 9 第二節近似價值函數 14 第三節最小平方策略迭代LSPI 18 第四節美式選擇權定價 20 （一） LSPI 20 （二） FQI 23 （三）最小平方蒙地卡羅方法LSM 24 （四）基函數設定 25 第四章實證分析 27 第一節實證方法 27 （一）模型訓練方法 29 （二）模型套用及結果 31 第二節敏感度分析 42 （一）股價 42 （二）股價變動率 43 （三）股價波動率 44 （四）無風險利率 45 第五章結論與建議 46 參考文獻 47	zh_TW
dc.format.extent	2457193 bytes	-
dc.format.mimetype	application/pdf	-
dc.source.uri (資料來源)	http://thesis.lib.nccu.edu.tw/record/#G0106352047	en_US
dc.subject (關鍵詞)	美式選擇權	zh_TW
dc.subject (關鍵詞)	定價	zh_TW
dc.subject (關鍵詞)	強化學習	zh_TW
dc.subject (關鍵詞)	最小平方策略迭代	zh_TW
dc.subject (關鍵詞)	最小平方蒙地卡羅法	zh_TW
dc.subject (關鍵詞)	American option	en_US
dc.subject (關鍵詞)	Pricing	en_US
dc.subject (關鍵詞)	Reinforcement learning	en_US
dc.subject (關鍵詞)	LSPI	en_US
dc.subject (關鍵詞)	FQI	en_US
dc.subject (關鍵詞)	LSM	en_US
dc.title (題名)	強化學習應用於美式選擇權評價	zh_TW
dc.title (題名)	Applying Reinforcement Learning to American Option Pricing	en_US
dc.type (資料類型)	thesis	en_US
dc.relation.reference (參考文獻)	[1] Barone-Adesi, G. and Whaley, R. (1987). Efficient Analytical Approximation of American Option Values. Journal of Finance, Vol. 42, 301-320. [2] Bellman, R. (1957). A Markovian Decision Process. Journal of Mathematics and Mechanics, Vol. 6, 679–684. [3] Black, F. and Scholes, M. (1973). The Pricing of Options and Corporate Liabilities. Journal of Political Economy, Vol. 81, 637-659. [4] Boyle, P. P. (1977). Options: A Monte Carlo Approach. Journal of Financial Economics, Vol. 4, 323–338. [5] Boyle, P. P. (1986). A lattice framework for option pricing with two state variables, Journal of Financial and Quantitative Analysis, Vol. 23(1), 1-12. [6] Brennan, M. and Schwartz, E. (1977). The Valuation of American Put Options. Journal of Finance, Vol. 32, 449-462. [7] Cox, J. C., Ross S. A. and Rubinstein, M. (1979). Option Pricing: A simplified Approach. Journal of Financial Economics, Vol. 7, 229-264. [8] Dubrov, B. (2015). Monte Carlo Simulation with Machine Learning for Pricing American Options and Convertible Bonds. SSRN. [9] Geske, R. (1979). The Valuation of Compound Options. Journal of Financial Economics, Vol. 7, 63–81. [10] Geske, R. (1979). A Note on an Analytic Valuation Formula for Unprotected American Call Options on Stocks with Known Dividends. Journal of Financial Economics, Vol. 7, 275–380. [11] Geske, R. (1981). Comments on Whaley’s Note. Journal of Financial Economics, Vol. 9, 213–215. [12] Geske, R. and Johnson, H. E. (1984). The American Put Valued Analytically. Journal of Finance, Vol. 39, 1511-1524. [13] Haug, E.G., Haug, J. and Lewis, A. (2003). Back to Basics: a New Approach to the Discrete Dividend Problem. Wilmott Magazine, 37–47. [14] Howard, R. A. (1960). Dynamic Programming and Markov Processes. Cambridge, Mass: MIT Press. [15] Hull, J. C. (2011). Options, Futures, and Other Derivatives, 8th edition. United States of America: Prentice Hall. [16] Johnson, H. (1983). An Analytical Approximation for the American Put Price. Journal of Financial and Quantitative Analysis, Vol. 18, 141-148. [17] Ju., N. and Zhong, R. (1998). An Approximate Formula for Pricing American Options. Review of Financial Studies, Vol. 11, 627-646. [18] Lagoudakis, M. G. and Parr, R. (2003). Least-Squares Policy Iteration. Journal of Machine Learning Research , Vol. 4, 1107 – 1149. [19] Li, Y., Szepesvari, C. and Schuurmans, D. (2009). Learning Exercise Policies for American Options. In Proc. of the Twelfth International Conference on Artificial Intelligence and Statistics, JMLR: W&CP, Vol. 5, 352-359. [20] Longstaff, F. A. and Schwartz, E. S. (2001). Valuing American options by simulation: a simple Least-Squares approach. Review Financial Studies, Vol. 14, 113-147. [21] Medvedev, A. and Scaillet, O. (2010). Pricing American options under stochastic volatility and stochastic interest rates. Journal of Financial Economics, Vol. 98, 145–159. [22] Merton, R. (1976). Option pricing when underlying stock returns are discontinuous. Journal of Financial Economics, Vol. 2, 125–144. [23] Roll, R. (1977). An Analytical Valuation Formula for Unprotected American Call Options on Stocks with Known Dividends. Journal of Financial Economics, Vol. 5, 251-258. [24] Tilley, J. A. (1993). Valuing American options in a path simulation model. Transactions of the Society of Actuaries, Vol. 45, 83–104. [25] Tsitsiklis, J. N. and Van Roy, B. (2001). Regression Methods for Pricing Complex American-style Options. IEEE Transactions on Neural Networks(special issue on computational finance), Vol. 12(4), 694–703. [26] Whaley, R. E. (1981). On the Valuation of American Call Options on Stocks with Known Dividends. Journal of Financial Economics, Vol. 10, 207–211. [27]陳戚光，（2001）。選擇權:理論.實務與應用。台灣：智勝文化。	zh_TW
dc.identifier.doi (DOI)	10.6814/NCCU201900058	en_US

Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

Google Scholar^TM