非均質馬可夫決策系統的決策空間 | 學術產出

學術產出-學位論文

文章檢視/開啟

html(535)

書目匯出

Google Scholar^TM

題名	非均質馬可夫決策系統的決策空間 Policies in Nonhomogeneous Markov Decision Processes
作者	劉任昌 Liou, Chen Chang
貢獻者	陸行 Paul Lu 劉任昌 Liou, Chen Chang
關鍵詞	非均質馬可夫決策系統 Nonhomogeneous Markov Processes
日期	1994
上傳時間	29-四月-2016 16:32:18 (UTC+8)
摘要	在求無限期非均質馬可夫決策過程（nonhomogeneous Markov decisinon processes）第一期的的最佳解時，我們通常要將它表示成有限期的動態規劃問題。動態規劃可以用合成函數型式表示，也可以用最常見的線性規劃型式表示。　　Hopp, Bean and Duenyas(1992) formulate a mixed integer program (MIP) to determine whether a finite time horizon is a forecast horizon in a nonhomogeneous Markov decision process(NMDP). Their formula are solved by complex Bender`s decomposition In this thesis, we make an examination in details of the contraction property and affine mapping property of NMDP. By these properties we are relieved of the complex MIP formula and Bender`s decomposition algorithm. The main contribution of the thesis is to show that it is not necessary to determine the optimal policies by running through the whole feasible solution space of their MIP problem. We only need to check a finite number of vertices at a polyhedral set shaped by the solution of the NMDP. The analysis shows insights into the NMDP and facilitate the prosess in determining the forecast horizon. Furthermore, this NMDP formulation is presented in the form of a simple dynamic function which is different from the linear program presented by Hopp, Bean and Duenyas.
描述	碩士國立政治大學應用數學系 81155006
資料來源	http://thesis.lib.nccu.edu.tw/record/#B2002003903
資料類型	thesis

dc.contributor.advisor	陸行	zh_TW
dc.contributor.advisor	Paul Lu	en_US
dc.contributor.author (作者)	劉任昌	zh_TW
dc.contributor.author (作者)	Liou, Chen Chang	en_US
dc.creator (作者)	劉任昌	zh_TW
dc.creator (作者)	Liou, Chen Chang	en_US
dc.date (日期)	1994	en_US
dc.date.accessioned	29-四月-2016 16:32:18 (UTC+8)	-
dc.date.available	29-四月-2016 16:32:18 (UTC+8)	-
dc.date.issued (上傳時間)	29-四月-2016 16:32:18 (UTC+8)	-
dc.identifier (其他識別碼)	B2002003903	en_US
dc.identifier.uri (URI)	http://nccur.lib.nccu.edu.tw/handle/140.119/88735	-
dc.description (描述)	碩士	zh_TW
dc.description (描述)	國立政治大學	zh_TW
dc.description (描述)	應用數學系	zh_TW
dc.description (描述)	81155006	zh_TW
dc.description.abstract (摘要)	在求無限期非均質馬可夫決策過程（nonhomogeneous Markov decisinon processes）第一期的的最佳解時，我們通常要將它表示成有限期的動態規劃問題。動態規劃可以用合成函數型式表示，也可以用最常見的線性規劃型式表示。	zh_TW
dc.description.abstract (摘要)	Hopp, Bean and Duenyas(1992) formulate a mixed integer program (MIP) to determine whether a finite time horizon is a forecast horizon in a nonhomogeneous Markov decision process(NMDP). Their formula are solved by complex Bender`s decomposition In this thesis, we make an examination in details of the contraction property and affine mapping property of NMDP. By these properties we are relieved of the complex MIP formula and Bender`s decomposition algorithm. The main contribution of the thesis is to show that it is not necessary to determine the optimal policies by running through the whole feasible solution space of their MIP problem. We only need to check a finite number of vertices at a polyhedral set shaped by the solution of the NMDP. The analysis shows insights into the NMDP and facilitate the prosess in determining the forecast horizon. Furthermore, this NMDP formulation is presented in the form of a simple dynamic function which is different from the linear program presented by Hopp, Bean and Duenyas.	en_US
dc.description.tableofcontents	感言與謝詞簡介 Abstract Contents-----1 List of Figures-----2 1　Introduction-----3 2　Model Formulation-----6 3　A Stopping Rule Using MIP-----12 4　The Property of Contraction Mapping in the NMDP-----17 5　The Property of Affine Mapping in the NMDP-----25 6　A Simple but Powerful Stopping Rule-----29 7　Conclusions and Further Work-----32 A　Bender`s decompositions-----33 List of Figures 1.1　Stopping rule algorithms-----5 2.1　Markov decision processes with 3 states-----10 4.1　S={1,2}, l₂=M-----18 4.2　S={1,2}, l₂=M, multistage profiles-----19 4.3　Contraction mapping profiles-----20 6.1　Simple but power stopping rule-----31	zh_TW
dc.source.uri (資料來源)	http://thesis.lib.nccu.edu.tw/record/#B2002003903	en_US
dc.subject (關鍵詞)	非均質馬可夫決策系統	zh_TW
dc.subject (關鍵詞)	Nonhomogeneous Markov Processes	en_US
dc.title (題名)	非均質馬可夫決策系統的決策空間	zh_TW
dc.title (題名)	Policies in Nonhomogeneous Markov Decision Processes	en_US
dc.type (資料類型)	thesis	en_US

學術產出-學位論文

文章檢視/開啟

書目匯出

Google ScholarTM

政大圖書館

引文資訊

TAIR相關學術產出

Google Scholar^TM