學術產出-學位論文

文章檢視/開啟

書目匯出

Google ScholarTM

政大圖書館

引文資訊

TAIR相關學術產出

題名 交叉驗證用於迴歸樣條的模型選擇之探討
作者 謝式斌
貢獻者 黃子銘
謝式斌
關鍵詞 交叉驗證
迴歸樣條
節點
模型選擇
Knots
Cross validation
Splines
日期 2018
上傳時間 2-二月-2018 10:45:49 (UTC+8)
摘要 在無母數的迴歸當中,因為原始的函數類型未知,所以常用已知特定類型的函數來近似未知的函數,而spline函數也可以用來近似未知的函數,但是要估計spline函數就需要設定節點(knots),越多的節點越能準確近似原始函數的內容,可是如果節點太多有較多的參數要估計, 就會變得比較不準確,所以選擇適合節點個數就變得很重要。
在本研究中,用交叉驗證的方式來尋找適合的節點個數, 考慮了幾種不同切割資料方式來決定訓練資料和測試資料, 並比較不同切割資料的方式下選擇節點的結果與函數估計的效果。
In this thesis, I consider the problem of estimating an unknown regression function using spline approximation.
Splines are piecewise polynomials jointed at knots. When using splines to approximate unknown functions, it is crucial to determine the number of knots and the knot locations. In this thesis, I determine the knot locations using least squares for given a given number of knots, and use cross-validation to find appropriate number of knots. I consider three methods to split the data into training data and testing data, and compare the estimation results.
參考文獻 David Ruppert, Selecting the number of knots for penalized splines. Journal of Computational and Graphical Statistics, 11(4):735–757, 2002.

E.F. Halpern. Bayesian spline regression when the number of knots is unknown. Journal of the Royal Statistical Society B, 35:347–60, 1973.

Issac Jacob Schoenberg. Contributions to the problem of approximation of equidistant data by analytic functions ,part b: On the problem of osculatory interpolation, a second class of analytic appoximation formulae. Quart. Appl. Math, 4:112–141, 1983.

Jeff Racine. Feasible cross-validatory model selection for general stationary processes. Journal of Applied Econometrics, 12(2):169–179, 1997.

J Shao. Linear model selection by cross-validation. Journal of the American Statistical Association, 88(422):486–95, 1993.

Meyer Mary C. Inference using shape-restricted regression splines. The Annals of Applied Statistics, 2(3):1013–1033, 2008.

M Stone. Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society, 36:111–147, 1974.

R Picard, R and D Cook, R. Cross-validation of regression models. Journal of the American Statistical Association, 79:575–583, 1984.

S Geisser. A predictive sample reuse method with application. Journal of the American Statistical Association, 70:320–8, 1975.
描述 碩士
國立政治大學
統計學系
104354028
資料來源 http://thesis.lib.nccu.edu.tw/record/#G0104354028
資料類型 thesis
dc.contributor.advisor 黃子銘zh_TW
dc.contributor.author (作者) 謝式斌zh_TW
dc.creator (作者) 謝式斌zh_TW
dc.date (日期) 2018en_US
dc.date.accessioned 2-二月-2018 10:45:49 (UTC+8)-
dc.date.available 2-二月-2018 10:45:49 (UTC+8)-
dc.date.issued (上傳時間) 2-二月-2018 10:45:49 (UTC+8)-
dc.identifier (其他 識別碼) G0104354028en_US
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/115722-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 統計學系zh_TW
dc.description (描述) 104354028zh_TW
dc.description.abstract (摘要) 在無母數的迴歸當中,因為原始的函數類型未知,所以常用已知特定類型的函數來近似未知的函數,而spline函數也可以用來近似未知的函數,但是要估計spline函數就需要設定節點(knots),越多的節點越能準確近似原始函數的內容,可是如果節點太多有較多的參數要估計, 就會變得比較不準確,所以選擇適合節點個數就變得很重要。
在本研究中,用交叉驗證的方式來尋找適合的節點個數, 考慮了幾種不同切割資料方式來決定訓練資料和測試資料, 並比較不同切割資料的方式下選擇節點的結果與函數估計的效果。
zh_TW
dc.description.abstract (摘要) In this thesis, I consider the problem of estimating an unknown regression function using spline approximation.
Splines are piecewise polynomials jointed at knots. When using splines to approximate unknown functions, it is crucial to determine the number of knots and the knot locations. In this thesis, I determine the knot locations using least squares for given a given number of knots, and use cross-validation to find appropriate number of knots. I consider three methods to split the data into training data and testing data, and compare the estimation results.
en_US
dc.description.tableofcontents 第一章 緒論 8
第二章 文獻探討 9
第一節 Cross Validation 9
第二節 Spline節點的選擇 9
第三章 研究方法 11
第一節 迴歸函數估計 11
第二節 資料切割及交叉驗證 13
第四章 模擬資料分析 14
第一節 f非spline函數 14
第二節 f為spline函數 17
第五章 結果討論與建議 20
zh_TW
dc.format.extent 805857 bytes-
dc.format.mimetype application/pdf-
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#G0104354028en_US
dc.subject (關鍵詞) 交叉驗證zh_TW
dc.subject (關鍵詞) 迴歸樣條zh_TW
dc.subject (關鍵詞) 節點zh_TW
dc.subject (關鍵詞) 模型選擇zh_TW
dc.subject (關鍵詞) Knotsen_US
dc.subject (關鍵詞) Cross validationen_US
dc.subject (關鍵詞) Splinesen_US
dc.title (題名) 交叉驗證用於迴歸樣條的模型選擇之探討zh_TW
dc.type (資料類型) thesisen_US
dc.relation.reference (參考文獻) David Ruppert, Selecting the number of knots for penalized splines. Journal of Computational and Graphical Statistics, 11(4):735–757, 2002.

E.F. Halpern. Bayesian spline regression when the number of knots is unknown. Journal of the Royal Statistical Society B, 35:347–60, 1973.

Issac Jacob Schoenberg. Contributions to the problem of approximation of equidistant data by analytic functions ,part b: On the problem of osculatory interpolation, a second class of analytic appoximation formulae. Quart. Appl. Math, 4:112–141, 1983.

Jeff Racine. Feasible cross-validatory model selection for general stationary processes. Journal of Applied Econometrics, 12(2):169–179, 1997.

J Shao. Linear model selection by cross-validation. Journal of the American Statistical Association, 88(422):486–95, 1993.

Meyer Mary C. Inference using shape-restricted regression splines. The Annals of Applied Statistics, 2(3):1013–1033, 2008.

M Stone. Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society, 36:111–147, 1974.

R Picard, R and D Cook, R. Cross-validation of regression models. Journal of the American Statistical Association, 79:575–583, 1984.

S Geisser. A predictive sample reuse method with application. Journal of the American Statistical Association, 70:320–8, 1975.
zh_TW