學術產出-Theses

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

題名 基於混合廣義伽瑪分配之顧客購買間隔時間模型
Customer interpurchase-­time models based on mixture generalized gamma distributions
作者 柯瀚鈞
Ke, Han-Jun
貢獻者 黃子銘
Huang, Tzee-Ming
柯瀚鈞
Ke, Han-Jun
關鍵詞 購買間隔時間
混合模型
乘法模型
馬可夫鏈
interpurchase times
mixture model
multiplicative model
Markov chain
日期 2021
上傳時間 4-Aug-2021 14:42:59 (UTC+8)
摘要   本論文以廣義伽瑪分配作為顧客購買間隔時間之母體分配,建立混合模型推估顧客處於非常活躍、活躍及非活躍狀態的比例,並導入乘法模型探討商品類型在各狀態下對購買間隔時間的影響。除此之外,運用馬可夫鏈的特性建立轉移矩陣收集顧客狀態轉移的變化,並考慮多組馬可夫鏈模型以更精準的捕捉顧客消費行為。資料驗證方面,生成模擬資料以最大概似估計法及是否考慮最大期望演算法估計上述模型之參數來檢視估計優劣,並以 kaggle 中的網路商城交易資料來展現本文方法運用在實際資料的成果。根據模擬實驗顯示,考慮最大期望演算法估計結果較優異但所耗費的時間較長,不使用最大期望演算法估計結果相對較差,然而計算時間則大幅減少。
In this thesis, a model for customer interpurchase times is proposed, where the generalized gamma distribution is used. In the proposed model, each customer has three states: very active, active and inactive state, and interpurchase times of a customer at different state may obey different distributions. The impact of product types on the interpurchase times is also considered. In addition, the customer states are allowed to change overtime, according to a Markov chain model. Model parameters are estimated using maximum likelihood estimation and consider whether to adopt the expectationmaximization algorithm. According to simulation experiments, using the expectation­maximization algorithm gives a better result but takes a longer
time, the result without using the expectation­maximization algorithm is relatively poor, but the calculation time is greatly reduced.
參考文獻 Akaike, H. (1974). A new look at the statistical model identification. IEEE transactions on automatic control, 19(6), 716–723.
Allenby, G. M., Leone, R. P., & Jen, L. (1999). A dynamic model of purchase timing with application to direct marketing. Journal of the American Statistical Association, 94(446), 365–374.
Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1), 1–22.
Giraud, C. (2015). Introduction to high­dimensional statistics. Monographs on Statistics and Applied Probability, 139, 139.
Hughes, A. M. (1994). Strategic database marketing. Chicago: Probus Publishing Company.
Markov, A. A. (1971). Extension of the limit theorems of probability theory to a sum of variables connected in a chain. Dynamic probabilistic systems, 1, 552–577.
Schwarz, G., et al. (1978). Estimating the dimension of a model. Annals of statistics, 6(2), 461–464.
Stacy, E. W., et al. (1962). A generalization of the gamma distribution. The Annals of mathematical statistics, 33(3), 1187–1192.
Wilks, S. S. (1938). The large­sample distribution of the likelihood ratio for testing composite hypotheses. The annals of mathematical statistics, 9(1), 60–62.
郭瑞祥, 蔣明晃, 陳薏棻, & 楊凱全. (2009). 應用層級貝氏理論於跨商品類別之顧客購買期間預測模型. 管理學報, 26(3), 291–308.
林倉億. (2014). 「轉移矩陣」二三事 (2):歷年高中課本中的穩定狀態. HPM 通訊, 17(6).
描述 碩士
國立政治大學
統計學系
108354022
資料來源 http://thesis.lib.nccu.edu.tw/record/#G0108354022
資料類型 thesis
dc.contributor.advisor 黃子銘zh_TW
dc.contributor.advisor Huang, Tzee-Mingen_US
dc.contributor.author (Authors) 柯瀚鈞zh_TW
dc.contributor.author (Authors) Ke, Han-Junen_US
dc.creator (作者) 柯瀚鈞zh_TW
dc.creator (作者) Ke, Han-Junen_US
dc.date (日期) 2021en_US
dc.date.accessioned 4-Aug-2021 14:42:59 (UTC+8)-
dc.date.available 4-Aug-2021 14:42:59 (UTC+8)-
dc.date.issued (上傳時間) 4-Aug-2021 14:42:59 (UTC+8)-
dc.identifier (Other Identifiers) G0108354022en_US
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/136323-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 統計學系zh_TW
dc.description (描述) 108354022zh_TW
dc.description.abstract (摘要)   本論文以廣義伽瑪分配作為顧客購買間隔時間之母體分配,建立混合模型推估顧客處於非常活躍、活躍及非活躍狀態的比例,並導入乘法模型探討商品類型在各狀態下對購買間隔時間的影響。除此之外,運用馬可夫鏈的特性建立轉移矩陣收集顧客狀態轉移的變化,並考慮多組馬可夫鏈模型以更精準的捕捉顧客消費行為。資料驗證方面,生成模擬資料以最大概似估計法及是否考慮最大期望演算法估計上述模型之參數來檢視估計優劣,並以 kaggle 中的網路商城交易資料來展現本文方法運用在實際資料的成果。根據模擬實驗顯示,考慮最大期望演算法估計結果較優異但所耗費的時間較長,不使用最大期望演算法估計結果相對較差,然而計算時間則大幅減少。zh_TW
dc.description.abstract (摘要) In this thesis, a model for customer interpurchase times is proposed, where the generalized gamma distribution is used. In the proposed model, each customer has three states: very active, active and inactive state, and interpurchase times of a customer at different state may obey different distributions. The impact of product types on the interpurchase times is also considered. In addition, the customer states are allowed to change overtime, according to a Markov chain model. Model parameters are estimated using maximum likelihood estimation and consider whether to adopt the expectationmaximization algorithm. According to simulation experiments, using the expectation­maximization algorithm gives a better result but takes a longer
time, the result without using the expectation­maximization algorithm is relatively poor, but the calculation time is greatly reduced.
en_US
dc.description.tableofcontents 摘要 i
Abstract ii
目錄 iii
圖目錄 v
表目錄 vi
第一章 緒論 1
第二章 文獻探討 3
2.1 廣義伽瑪分配 3
2.2 廣義伽瑪混合模型 5
2.3 乘法模型 6
2.4 馬可夫鏈 7
2.5 最大期望演算法 8
2.6 模型選取方法 9
第三章 研究方法 11
3.1 混合模型 11
3.1.1 最大概似估計法與 EM 演算法 12
3.2 混合模型導入乘法模型 14
3.3 馬可夫鏈模型 15
第四章 資料驗證 18
4.1 模擬資料之驗證 18
4.1.1 混合模型 18
4.1.2 混合模型導入乘法模型 25
4.1.3 馬可夫鏈模型 29
4.2 實際資料之應用 30
4.2.1 混合模型 31
4.2.2 混合模型導入乘法模型 32
4.2.3 馬可夫鏈模型 36
第五章 結論及未來展望 39
5.1 結論 39
5.2 未來展望 40
參考文獻 41
zh_TW
dc.format.extent 1229712 bytes-
dc.format.mimetype application/pdf-
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#G0108354022en_US
dc.subject (關鍵詞) 購買間隔時間zh_TW
dc.subject (關鍵詞) 混合模型zh_TW
dc.subject (關鍵詞) 乘法模型zh_TW
dc.subject (關鍵詞) 馬可夫鏈zh_TW
dc.subject (關鍵詞) interpurchase timesen_US
dc.subject (關鍵詞) mixture modelen_US
dc.subject (關鍵詞) multiplicative modelen_US
dc.subject (關鍵詞) Markov chainen_US
dc.title (題名) 基於混合廣義伽瑪分配之顧客購買間隔時間模型zh_TW
dc.title (題名) Customer interpurchase-­time models based on mixture generalized gamma distributionsen_US
dc.type (資料類型) thesisen_US
dc.relation.reference (參考文獻) Akaike, H. (1974). A new look at the statistical model identification. IEEE transactions on automatic control, 19(6), 716–723.
Allenby, G. M., Leone, R. P., & Jen, L. (1999). A dynamic model of purchase timing with application to direct marketing. Journal of the American Statistical Association, 94(446), 365–374.
Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1), 1–22.
Giraud, C. (2015). Introduction to high­dimensional statistics. Monographs on Statistics and Applied Probability, 139, 139.
Hughes, A. M. (1994). Strategic database marketing. Chicago: Probus Publishing Company.
Markov, A. A. (1971). Extension of the limit theorems of probability theory to a sum of variables connected in a chain. Dynamic probabilistic systems, 1, 552–577.
Schwarz, G., et al. (1978). Estimating the dimension of a model. Annals of statistics, 6(2), 461–464.
Stacy, E. W., et al. (1962). A generalization of the gamma distribution. The Annals of mathematical statistics, 33(3), 1187–1192.
Wilks, S. S. (1938). The large­sample distribution of the likelihood ratio for testing composite hypotheses. The annals of mathematical statistics, 9(1), 60–62.
郭瑞祥, 蔣明晃, 陳薏棻, & 楊凱全. (2009). 應用層級貝氏理論於跨商品類別之顧客購買期間預測模型. 管理學報, 26(3), 291–308.
林倉億. (2014). 「轉移矩陣」二三事 (2):歷年高中課本中的穩定狀態. HPM 通訊, 17(6).
zh_TW
dc.identifier.doi (DOI) 10.6814/NCCU202100840en_US