資料採礦技術在保險公司客戶保單貸款行為研究的應用

學術產出-Theses

Article View/Open

html(285)

Publication Export

Google Scholar^TM

政大圖書館

學術資源探索系統

Citation Infomation

No doi shows Citation Infomation

Simple Record
Full Record

題名	資料採礦技術在保險公司客戶保單貸款行為研究的應用
作者	邱蔚群 Lilian Chiu
貢獻者	鄭宇庭邱蔚群 Lilian Chiu
關鍵詞	資料採礦決策樹保單貸款類神經 C4.5
日期	2002
上傳時間	2009-09-14
摘要	摘要過去對於保險資料的研究多採用傳統統計方法，然而保險公司龐大資料庫中蘊含的寶貴資訊可能因此被遺漏。本研究目的是將資料採礦的技術應用到保險公司資料庫中的高雄縣市保戶保單貸款資料上，研究保戶利用保單貸款的行為，做為保險公司日後推行保單貸款的參考。從整理過後的資料中，用不同抽樣方法抽出不同樣本大小以及不同是否貸款比例的樣本，將連續變數做轉換後，建立決策樹和類神經模型，透過統計上的變異數分析，討論四個因子對預測結果好壞的影響。選出最好組合的樣本大小、是否貸款比例（已貸款：尚未貸款）、抽樣方法、以及建立的模型。最後將此最佳組合建立的C4.5決策樹轉換成規則，並探討其中正確率較高的幾項，作為給保險公司的參考。 Abstract In the past, the analysis of insurance data is usually conducted with traditional statistical methods, however a large amount of valuable information hidden might be left undiscovered. The purpose of this research is to apply data mining techniques to customer policy data taken from one of insurance company’s database in Kaoshuing city and county to study the behavior of customers taking loans against their policies as a reference for insurance company in promoting policy in the future. From the cleansed data, we sample policies of different sizes and percentage of policies with loans by different sampling methods, decision trees and neural network models, then through the significant interactions of ANOVA, discuss how the results being influenced by the four factors. We then choose the best model that manifests factors affecting customer’s behavior in taking out the loan thus providing insurance company a vital information in targeting its customers group.
參考文獻	參考文獻 l Berry, M. J. A., and Linoff, G. S. (1997), Data Mining Techniques: for Marketing, Sales, and Customer Support. John Wiley & Sons Inc, New York. l Berry, M. J. A., and Linoff, G. S. (2000), Mastering Data Mining Techniques, The Art & Science of Customer Relationship Management. John Wiley & Sons Inc., New York. l Breiman, L. Friedman, J.H., Olshen, R. A., and Stone, C. J. (1984).. Classification and Regression Trees. Wadsworth, Pacific Grove, California. l Dunham, M. H. (2003), Data Mining: Introductory and Advanced Topics. Pearson Education Inc., Upper Saddle River, New Jersey. l Freund, Y., and Schapire, R. E. (1996), “Experiments with a New Boosting Algorithm”. Machine Learning: Proceedings of the Thirteenth International Conference. l Friedman, J., Hastie, T., and Tibshirani R. (1998), Additive Logistic Regression: a Statistical View of Boosting, l Smith, M. (1993), Neural Networks for Statistical Modeling. Van Norstrand Reinhold, New York. l Terano, T., Liu, H., and Chen, A. L. P. (2000), Knowledge Discovery and Data Mining: Current Issues and New Applications. Springer-Verlag, Berling, Germany. l Witten, I. H., and Eibe, F. (2000), Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann Publishers, San Francisco, California. l 中國科學技術大學生物醫學工程跨係委員會，神經網路及其應用，儒林圖書，1993。 l 呂奇傑，演化式類神經網路分類技術於資料探勘上之應用，輔大應統所碩士論文，2000。 l 張維哲，人工神經網路，全欣資訊圖書，1992。 l 陳智宏，應用類神經網路於電力系統負載之溫度敏感度分析，中山電機工程所碩士論文，2001。 l 黃國源，類神經網路與圖形辨識，維科，2000。 l 葉怡成，類神經網路模式應用與實作（第7版），儒林圖書公司，台北市，2000。 l 傅心家，神經網路導論，第三波，1991。 l 楊雅媛，迴歸分析與類神經網路預測能力之比較，政大統計所碩士論文，2002。 l 鄭忠樑，運用分類樹於股價報酬率預測之研究，元智大學資訊管理研究所碩士論文，民國九十一年。
描述	碩士國立政治大學統計研究所 90354004 91
資料來源	http://thesis.lib.nccu.edu.tw/record/#G0090354004
資料類型	thesis

dc.contributor.advisor	鄭宇庭	zh_TW
dc.contributor.author (Authors)	邱蔚群	zh_TW
dc.contributor.author (Authors)	Lilian Chiu	en_US
dc.creator (作者)	邱蔚群	zh_TW
dc.creator (作者)	Lilian Chiu	en_US
dc.date (日期)	2002	en_US
dc.date.accessioned	2009-09-14	-
dc.date.available	2009-09-14	-
dc.date.issued (上傳時間)	2009-09-14	-
dc.identifier (Other Identifiers)	G0090354004	en_US
dc.identifier.uri (URI)	https://nccur.lib.nccu.edu.tw/handle/140.119/30872	-
dc.description (描述)	碩士	zh_TW
dc.description (描述)	國立政治大學	zh_TW
dc.description (描述)	統計研究所	zh_TW
dc.description (描述)	90354004	zh_TW
dc.description (描述)	91	zh_TW
dc.description.abstract (摘要)	摘要過去對於保險資料的研究多採用傳統統計方法，然而保險公司龐大資料庫中蘊含的寶貴資訊可能因此被遺漏。本研究目的是將資料採礦的技術應用到保險公司資料庫中的高雄縣市保戶保單貸款資料上，研究保戶利用保單貸款的行為，做為保險公司日後推行保單貸款的參考。從整理過後的資料中，用不同抽樣方法抽出不同樣本大小以及不同是否貸款比例的樣本，將連續變數做轉換後，建立決策樹和類神經模型，透過統計上的變異數分析，討論四個因子對預測結果好壞的影響。選出最好組合的樣本大小、是否貸款比例（已貸款：尚未貸款）、抽樣方法、以及建立的模型。最後將此最佳組合建立的C4.5決策樹轉換成規則，並探討其中正確率較高的幾項，作為給保險公司的參考。	zh_TW
dc.description.abstract (摘要)	Abstract In the past, the analysis of insurance data is usually conducted with traditional statistical methods, however a large amount of valuable information hidden might be left undiscovered. The purpose of this research is to apply data mining techniques to customer policy data taken from one of insurance company’s database in Kaoshuing city and county to study the behavior of customers taking loans against their policies as a reference for insurance company in promoting policy in the future. From the cleansed data, we sample policies of different sizes and percentage of policies with loans by different sampling methods, decision trees and neural network models, then through the significant interactions of ANOVA, discuss how the results being influenced by the four factors. We then choose the best model that manifests factors affecting customer’s behavior in taking out the loan thus providing insurance company a vital information in targeting its customers group.	en_US
dc.description.tableofcontents	目次第一章序論…………………………………………………………………1 第一節研究動機與目的……………………………………………………1 第二節研究流程……………………………………………………………1 第二章文獻探討……………………………………………………………3 第一節資料採礦……………………………………………………………3 第二節資料採礦的技術……………………………………………………4 第三節 C4.5 & C5.0……………………………………………………… 6 第四節 cART………………………………………………………………..9 第五節類神經網路………………………………………………………..12 第六節使用軟體…………………………………………………………..16 第三章研究方法…………………………………………………………..31 第一節研究對象…………………………………………………………..31 第二節資料結構…………………………………………………………..31 第三節資料整理…………………………………………………………..36 第四節抽樣………………………………………………………………..38 第五節模型建立與選取…………………………………………………..39 第四章結果分析…………………………………………………………..42 第一節資料敘述統計……………………………………………………..42 第二節抽樣及變異數分析………………………………………………..52 第三節模型規則建立……………………………………………………..77 第五章結論與未來方向…………………………………………………..88 第一節結論……………………………………………………….……….88 第二節區域性比較………………………………………………………..90 第三節未來改進方向……………………………………………………..92 參考文獻……………………………………………………………………..95	zh_TW
dc.language.iso	en_US	-
dc.source.uri (資料來源)	http://thesis.lib.nccu.edu.tw/record/#G0090354004	en_US
dc.subject (關鍵詞)	資料採礦	zh_TW
dc.subject (關鍵詞)	決策樹	zh_TW
dc.subject (關鍵詞)	保單貸款	zh_TW
dc.subject (關鍵詞)	類神經	zh_TW
dc.subject (關鍵詞)	C4.5	en_US
dc.title (題名)	資料採礦技術在保險公司客戶保單貸款行為研究的應用	zh_TW
dc.type (資料類型)	thesis	en
dc.relation.reference (參考文獻)	參考文獻	zh_TW
dc.relation.reference (參考文獻)	l Berry, M. J. A., and Linoff, G. S. (1997), Data Mining Techniques: for Marketing, Sales, and Customer Support. John Wiley & Sons Inc, New York.	zh_TW
dc.relation.reference (參考文獻)	l Berry, M. J. A., and Linoff, G. S. (2000), Mastering Data Mining Techniques, The Art & Science of Customer Relationship Management. John Wiley & Sons Inc., New York.	zh_TW
dc.relation.reference (參考文獻)	l Breiman, L. Friedman, J.H., Olshen, R. A., and Stone, C. J. (1984).. Classification and Regression Trees. Wadsworth, Pacific Grove, California.	zh_TW
dc.relation.reference (參考文獻)	l Dunham, M. H. (2003), Data Mining: Introductory and Advanced Topics. Pearson Education Inc., Upper Saddle River, New Jersey.	zh_TW
dc.relation.reference (參考文獻)	l Freund, Y., and Schapire, R. E. (1996), “Experiments with a New Boosting Algorithm”. Machine Learning: Proceedings of the Thirteenth International Conference.	zh_TW
dc.relation.reference (參考文獻)	l Friedman, J., Hastie, T., and Tibshirani R. (1998), Additive Logistic Regression: a Statistical View of Boosting,	zh_TW
dc.relation.reference (參考文獻)	l Smith, M. (1993), Neural Networks for Statistical Modeling. Van Norstrand Reinhold, New York.	zh_TW
dc.relation.reference (參考文獻)	l Terano, T., Liu, H., and Chen, A. L. P. (2000), Knowledge Discovery and Data Mining: Current Issues and New Applications. Springer-Verlag, Berling, Germany.	zh_TW
dc.relation.reference (參考文獻)	l Witten, I. H., and Eibe, F. (2000), Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann Publishers, San Francisco, California.	zh_TW
dc.relation.reference (參考文獻)	l 中國科學技術大學生物醫學工程跨係委員會，神經網路及其應用，儒林圖書，1993。	zh_TW
dc.relation.reference (參考文獻)	l 呂奇傑，演化式類神經網路分類技術於資料探勘上之應用，輔大應統所碩士論文，2000。	zh_TW
dc.relation.reference (參考文獻)	l 張維哲，人工神經網路，全欣資訊圖書，1992。	zh_TW
dc.relation.reference (參考文獻)	l 陳智宏，應用類神經網路於電力系統負載之溫度敏感度分析，中山電機工程所碩士論文，2001。	zh_TW
dc.relation.reference (參考文獻)	l 黃國源，類神經網路與圖形辨識，維科，2000。	zh_TW
dc.relation.reference (參考文獻)	l 葉怡成，類神經網路模式應用與實作（第7版），儒林圖書公司，台北市，2000。	zh_TW
dc.relation.reference (參考文獻)	l 傅心家，神經網路導論，第三波，1991。	zh_TW
dc.relation.reference (參考文獻)	l 楊雅媛，迴歸分析與類神經網路預測能力之比較，政大統計所碩士論文，2002。	zh_TW
dc.relation.reference (參考文獻)	l 鄭忠樑，運用分類樹於股價報酬率預測之研究，元智大學資訊管理研究所碩士論文，民國九十一年。	zh_TW

學術產出-Theses

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

Google Scholar^TM