學術產出-學位論文

文章檢視/開啟

書目匯出

Google ScholarTM

政大圖書館

引文資訊

TAIR相關學術產出

題名 非均質馬可夫決策系統的決策空間
Policies in Nonhomogeneous Markov Decision Processes
作者 劉任昌
Liou, Chen Chang
貢獻者 陸行
Paul Lu
劉任昌
Liou, Chen Chang
關鍵詞 非均質馬可夫決策系統
Nonhomogeneous Markov Processes
日期 1994
上傳時間 29-四月-2016 16:32:18 (UTC+8)
摘要   在求無限期非均質馬可夫決策過程(nonhomogeneous Markov decisinon processes)第一期的的最佳解時,我們通常要將它表示成有限期的動態規劃問題。動態規劃可以用合成函數型式表示,也可以用最常見的線性規劃型式表示。
  Hopp, Bean and Duenyas(1992) formulate a mixed integer program (MIP) to determine whether a finite time horizon is a forecast horizon in a nonhomogeneous Markov decision process(NMDP). Their formula are solved by complex Bender`s decomposition In this thesis, we make an examination in details of the contraction property and affine mapping property of NMDP. By these properties we are relieved of the complex MIP formula and Bender`s decomposition algorithm. The main contribution of the thesis is to show that it is not necessary to determine the optimal policies by running through the whole feasible solution space of their MIP problem. We only need to check a finite number of vertices at a polyhedral set shaped by the solution of the NMDP. The analysis shows insights into the NMDP and facilitate the prosess in determining the forecast horizon. Furthermore, this NMDP formulation is presented in the form of a simple dynamic function which is different from the linear program presented by Hopp, Bean and Duenyas.
描述 碩士
國立政治大學
應用數學系
81155006
資料來源 http://thesis.lib.nccu.edu.tw/record/#B2002003903
資料類型 thesis
dc.contributor.advisor 陸行zh_TW
dc.contributor.advisor Paul Luen_US
dc.contributor.author (作者) 劉任昌zh_TW
dc.contributor.author (作者) Liou, Chen Changen_US
dc.creator (作者) 劉任昌zh_TW
dc.creator (作者) Liou, Chen Changen_US
dc.date (日期) 1994en_US
dc.date.accessioned 29-四月-2016 16:32:18 (UTC+8)-
dc.date.available 29-四月-2016 16:32:18 (UTC+8)-
dc.date.issued (上傳時間) 29-四月-2016 16:32:18 (UTC+8)-
dc.identifier (其他 識別碼) B2002003903en_US
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/88735-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 應用數學系zh_TW
dc.description (描述) 81155006zh_TW
dc.description.abstract (摘要)   在求無限期非均質馬可夫決策過程(nonhomogeneous Markov decisinon processes)第一期的的最佳解時,我們通常要將它表示成有限期的動態規劃問題。動態規劃可以用合成函數型式表示,也可以用最常見的線性規劃型式表示。zh_TW
dc.description.abstract (摘要)   Hopp, Bean and Duenyas(1992) formulate a mixed integer program (MIP) to determine whether a finite time horizon is a forecast horizon in a nonhomogeneous Markov decision process(NMDP). Their formula are solved by complex Bender`s decomposition In this thesis, we make an examination in details of the contraction property and affine mapping property of NMDP. By these properties we are relieved of the complex MIP formula and Bender`s decomposition algorithm. The main contribution of the thesis is to show that it is not necessary to determine the optimal policies by running through the whole feasible solution space of their MIP problem. We only need to check a finite number of vertices at a polyhedral set shaped by the solution of the NMDP. The analysis shows insights into the NMDP and facilitate the prosess in determining the forecast horizon. Furthermore, this NMDP formulation is presented in the form of a simple dynamic function which is different from the linear program presented by Hopp, Bean and Duenyas.en_US
dc.description.tableofcontents 感言與謝詞
     簡介
     Abstract
     Contents-----1
     List of Figures-----2
     1 Introduction-----3
     2 Model Formulation-----6
     3 A Stopping Rule Using MIP-----12
     4 The Property of Contraction Mapping in the NMDP-----17
     5 The Property of Affine Mapping in the NMDP-----25
     6 A Simple but Powerful Stopping Rule-----29
     7 Conclusions and Further Work-----32
     A Bender`s decompositions-----33
     
     List of Figures
     1.1 Stopping rule algorithms-----5
     2.1 Markov decision processes with 3 states-----10
     4.1 S={1,2}, l2=M-----18
     4.2 S={1,2}, l2=M, multistage profiles-----19
     4.3 Contraction mapping profiles-----20
     6.1 Simple but power stopping rule-----31
zh_TW
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#B2002003903en_US
dc.subject (關鍵詞) 非均質馬可夫決策系統zh_TW
dc.subject (關鍵詞) Nonhomogeneous Markov Processesen_US
dc.title (題名) 非均質馬可夫決策系統的決策空間zh_TW
dc.title (題名) Policies in Nonhomogeneous Markov Decision Processesen_US
dc.type (資料類型) thesisen_US