使用隨機漫步的監督式學習

Publications-Theses

Article View/Open

pdf(0)

Publication Export

Google Scholar^TM

題名	使用隨機漫步的監督式學習 Random Walk-based Supervised Learning
作者	羅嘉承 Lo, Chia-Cheng
貢獻者	周珮婷<br>陳怡如羅嘉承 Lo, Chia-Cheng
關鍵詞	監督式學習分類相似度隨機漫步馬可夫鏈 OutRank Supervised Learning Classification Similarity Random walk Markov Chain OutRank
日期	2023
上傳時間	6-Jul-2023 17:05:14 (UTC+8)
摘要	OutRank 原是一種基於對像相似性所進行的異常偵測方法。不同於以距離或是密度來偵測的形式，OutRank 以計算資料點間的相似性，來找出在資料中的異常小族群。本論文延伸此概念，擴展應用到分類、監督式學習的問題上。根據 OutRank 的性質，我們可以得到各筆資料間的相似度，因此我們假設同一族群間的相似度會較接近。在本論文中，我們會針對不同的資料去做驗證，並且與經典的分類方法 : Random Forest 去做比較。 OutRank was originally developed as an anomaly detection method based on object similarity. Unlike distance or density-based detection approaches, OutRank calculates the similarity between data points to identify small anomaly groups within the data. This study extends the concept of OutRank and applies it to classification and supervised learning problems. Based on the nature of OutRank, we assume that the similarity between data points within the same group will be higher. In this study,we verify this assumption using different datasets and compare the results with the classic classification method, Random Forest.
參考文獻	Ahmed, M., Kashem, M. A., Rahman, M., and Khatun, S. (2020). Review and analysis of risk factor of maternal health in remote area using the internet of things (iot). In InECCE2019: Proceedings of the 5th International Conference on Electrical, Control & Computer Engineering, Kuantan, Pahang, Malaysia, 29th July 2019, pages 357–365. Springer. Ait Mohamed, L., Cherfa, A., Cherfa, Y., Belkhamsa, N., and Alim-Ferhat, F. (2021). Hybrid method combining superpixel, supervised learning, and random walk for glioma segmentation. International journal of imaging systems and technology, 31(1):288– 301. Bachelier, L. (1900). Théorie de la spéculation. In Annales scientifiques de l’École normale supérieure, volume 17, pages 21–86. Backstrom, L. and Leskovec, J. (2011). Supervised random walks: predicting and recommending links in social networks. In Proceedings of the fourth ACM international conference on Web search and data mining, pages 635–644. Breiman, L. (2001). Random forests. Machine learning, 45:5–32. Charytanowicz, M., Niewczas, J., Kulczycki, P., Kowalski, P. A., Łukasik, S., and Żak, S. (2010). Complete gradient clustering algorithm for features analysis of x-ray images. In Information Technologies in Biomedicine: Volume 2, pages 15–24. Springer. Chotard, A. and Auger, A. (2019). Verifiable conditions for the irreducibility and aperiodicity of markov chains by analyzing underlying deterministic models. Chua, L. O. and Roska, T. (1993). The cnn paradigm. IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, 40(3):147–156. ÇINAR, İ., Koklu, M., and Taşdemir, Ş. (2020). Classification of raisin grains using machine vision and artificial intelligence methods. Gazi Mühendislik Bilimleri Dergisi, 6(3):200–209. Codling, E. A., Plank, M. J., and Benhamou, S. (2008). Random walk models in biology. Journal of the Royal society interface, 5(25):813–834. Cunningham, P., Cord, M., and Delany, S. J. (2008). Supervised learning. Machine learning techniques for multimedia: case studies on organization and retrieval, pages 21–49. Er, M. B. and Aydilek, I. B. (2019). Music emotion recognition by using chroma spectrogram and deep visual features. International Journal of Computational Intelligence Systems, 12(2):1622–1634. Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of eugenics, 7(2):179–188. King, R., Orlowska, M., and Studer, R. (2003). On the move to meaningful internet systems 2003. Kleinbaum, D. G., Dietz, K., Gail, M., Klein, M., and Klein, M. (2002). Logistic regression. Springer. Li, J. (2019). Regression and classification in supervised learning. In Proceedings of the 2nd International Conference on Computing and Big Data, pages 99–104. Liu, K., Xu, H. L., Liu, Y., and Zhao, J. (2013). Opinion target extraction using partiallysupervised word alignment model. In IJCAI, volume 13, pages 2134–2140. Liu, X., Yi, W., Xi, B., Dai, Q., et al. (2022). Identification of drug-disease associations using a random walk with restart method and supervised learning. Computational and Mathematical Methods in Medicine, 2022. Lu, W., Zhuang, Y., and Wu, J. (2009). Discovering calligraphy style relationships by supervised learning weighted random walk model. Multimedia systems, 15:221–242. Moghaddam, F. B., Bigham, B. S., et al. (2018). Extra: Expertise-boosted model for trustbased recommendation system based on supervised random walk. Comput. Informatics, 37(5):1209–1230. Moonesinghe, H. and Tan, P.-N. (2006). Outlier detection using random walks. In 2006 18th IEEE international conference on tools with artificial intelligence (ICTAI’06), pages 532–539. IEEE. Page, L., Brin, S., Motwani, R., and Winograd, T. (1999). The pagerank citation ranking: Bringing order to the web. Technical report, Stanford infolab. Pillai, S. U., Suel, T., and Cha, S. (2005). The perron-frobenius theorem: some of its applications. IEEE Signal Processing Magazine, 22(2):62–75. Rish, I. et al. (2001). An empirical study of the naive bayes classifier. In IJCAI 2001 workshop on empirical methods in artificial intelligence, volume 3, pages 41–46. Scalas, E. (2006). The application of continuous-time random walks in finance and economics. Physica A: Statistical Mechanics and its Applications, 362(2):225–239.
描述	碩士國立政治大學統計學系 110354010
資料來源	http://thesis.lib.nccu.edu.tw/record/#G0110354010
資料類型	thesis

dc.contributor.advisor	周珮婷<br>陳怡如	zh_TW
dc.contributor.author (Authors)	羅嘉承	zh_TW
dc.contributor.author (Authors)	Lo, Chia-Cheng	en_US
dc.creator (作者)	羅嘉承	zh_TW
dc.creator (作者)	Lo, Chia-Cheng	en_US
dc.date (日期)	2023	en_US
dc.date.accessioned	6-Jul-2023 17:05:14 (UTC+8)	-
dc.date.available	6-Jul-2023 17:05:14 (UTC+8)	-
dc.date.issued (上傳時間)	6-Jul-2023 17:05:14 (UTC+8)	-
dc.identifier (Other Identifiers)	G0110354010	en_US
dc.identifier.uri (URI)	http://nccur.lib.nccu.edu.tw/handle/140.119/145944	-
dc.description (描述)	碩士	zh_TW
dc.description (描述)	國立政治大學	zh_TW
dc.description (描述)	統計學系	zh_TW
dc.description (描述)	110354010	zh_TW
dc.description.abstract (摘要)	OutRank 原是一種基於對像相似性所進行的異常偵測方法。不同於以距離或是密度來偵測的形式，OutRank 以計算資料點間的相似性，來找出在資料中的異常小族群。本論文延伸此概念，擴展應用到分類、監督式學習的問題上。根據 OutRank 的性質，我們可以得到各筆資料間的相似度，因此我們假設同一族群間的相似度會較接近。在本論文中，我們會針對不同的資料去做驗證，並且與經典的分類方法 : Random Forest 去做比較。	zh_TW
dc.description.abstract (摘要)	OutRank was originally developed as an anomaly detection method based on object similarity. Unlike distance or density-based detection approaches, OutRank calculates the similarity between data points to identify small anomaly groups within the data. This study extends the concept of OutRank and applies it to classification and supervised learning problems. Based on the nature of OutRank, we assume that the similarity between data points within the same group will be higher. In this study,we verify this assumption using different datasets and compare the results with the classic classification method, Random Forest.	en_US
dc.description.tableofcontents	摘要 i Abstract ii 目次 iii 圖目錄 iv 表目錄 v 第一章緒論 1 第二章文獻回顧 2 2.1 隨機漫步 2 2.2 馬可夫鏈性質 3 2.3 隨機漫步與監督式學習的結合 4 2.4 總結 5 第三章研究方法 6 3.1 OutRank 方法介紹 6 3.1.1 馬可夫矩陣 6 3.1.2 隨機漫步 7 3.1.3 相聯性 (Connectivity) 7 3.1.4 監督式學習方法 8 3.2 隨機森林 (Random Forest) 8 第四章實證結果 9 4.1 評估準則 9 4.2 資料集介紹 10 4.3 預測結果 18 4.4 結論 20 第五章結論與建議 21 參考文獻 22	zh_TW
dc.format.extent	850300 bytes	-
dc.format.mimetype	application/pdf	-
dc.source.uri (資料來源)	http://thesis.lib.nccu.edu.tw/record/#G0110354010	en_US
dc.subject (關鍵詞)	監督式學習	zh_TW
dc.subject (關鍵詞)	分類	zh_TW
dc.subject (關鍵詞)	相似度	zh_TW
dc.subject (關鍵詞)	隨機漫步	zh_TW
dc.subject (關鍵詞)	馬可夫鏈	zh_TW
dc.subject (關鍵詞)	OutRank	zh_TW
dc.subject (關鍵詞)	Supervised Learning	en_US
dc.subject (關鍵詞)	Classification	en_US
dc.subject (關鍵詞)	Similarity	en_US
dc.subject (關鍵詞)	Random walk	en_US
dc.subject (關鍵詞)	Markov Chain	en_US
dc.subject (關鍵詞)	OutRank	en_US
dc.title (題名)	使用隨機漫步的監督式學習	zh_TW
dc.title (題名)	Random Walk-based Supervised Learning	en_US
dc.type (資料類型)	thesis	en_US
dc.relation.reference (參考文獻)	Ahmed, M., Kashem, M. A., Rahman, M., and Khatun, S. (2020). Review and analysis of risk factor of maternal health in remote area using the internet of things (iot). In InECCE2019: Proceedings of the 5th International Conference on Electrical, Control & Computer Engineering, Kuantan, Pahang, Malaysia, 29th July 2019, pages 357–365. Springer. Ait Mohamed, L., Cherfa, A., Cherfa, Y., Belkhamsa, N., and Alim-Ferhat, F. (2021). Hybrid method combining superpixel, supervised learning, and random walk for glioma segmentation. International journal of imaging systems and technology, 31(1):288– 301. Bachelier, L. (1900). Théorie de la spéculation. In Annales scientifiques de l’École normale supérieure, volume 17, pages 21–86. Backstrom, L. and Leskovec, J. (2011). Supervised random walks: predicting and recommending links in social networks. In Proceedings of the fourth ACM international conference on Web search and data mining, pages 635–644. Breiman, L. (2001). Random forests. Machine learning, 45:5–32. Charytanowicz, M., Niewczas, J., Kulczycki, P., Kowalski, P. A., Łukasik, S., and Żak, S. (2010). Complete gradient clustering algorithm for features analysis of x-ray images. In Information Technologies in Biomedicine: Volume 2, pages 15–24. Springer. Chotard, A. and Auger, A. (2019). Verifiable conditions for the irreducibility and aperiodicity of markov chains by analyzing underlying deterministic models. Chua, L. O. and Roska, T. (1993). The cnn paradigm. IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, 40(3):147–156. ÇINAR, İ., Koklu, M., and Taşdemir, Ş. (2020). Classification of raisin grains using machine vision and artificial intelligence methods. Gazi Mühendislik Bilimleri Dergisi, 6(3):200–209. Codling, E. A., Plank, M. J., and Benhamou, S. (2008). Random walk models in biology. Journal of the Royal society interface, 5(25):813–834. Cunningham, P., Cord, M., and Delany, S. J. (2008). Supervised learning. Machine learning techniques for multimedia: case studies on organization and retrieval, pages 21–49. Er, M. B. and Aydilek, I. B. (2019). Music emotion recognition by using chroma spectrogram and deep visual features. International Journal of Computational Intelligence Systems, 12(2):1622–1634. Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of eugenics, 7(2):179–188. King, R., Orlowska, M., and Studer, R. (2003). On the move to meaningful internet systems 2003. Kleinbaum, D. G., Dietz, K., Gail, M., Klein, M., and Klein, M. (2002). Logistic regression. Springer. Li, J. (2019). Regression and classification in supervised learning. In Proceedings of the 2nd International Conference on Computing and Big Data, pages 99–104. Liu, K., Xu, H. L., Liu, Y., and Zhao, J. (2013). Opinion target extraction using partiallysupervised word alignment model. In IJCAI, volume 13, pages 2134–2140. Liu, X., Yi, W., Xi, B., Dai, Q., et al. (2022). Identification of drug-disease associations using a random walk with restart method and supervised learning. Computational and Mathematical Methods in Medicine, 2022. Lu, W., Zhuang, Y., and Wu, J. (2009). Discovering calligraphy style relationships by supervised learning weighted random walk model. Multimedia systems, 15:221–242. Moghaddam, F. B., Bigham, B. S., et al. (2018). Extra: Expertise-boosted model for trustbased recommendation system based on supervised random walk. Comput. Informatics, 37(5):1209–1230. Moonesinghe, H. and Tan, P.-N. (2006). Outlier detection using random walks. In 2006 18th IEEE international conference on tools with artificial intelligence (ICTAI’06), pages 532–539. IEEE. Page, L., Brin, S., Motwani, R., and Winograd, T. (1999). The pagerank citation ranking: Bringing order to the web. Technical report, Stanford infolab. Pillai, S. U., Suel, T., and Cha, S. (2005). The perron-frobenius theorem: some of its applications. IEEE Signal Processing Magazine, 22(2):62–75. Rish, I. et al. (2001). An empirical study of the naive bayes classifier. In IJCAI 2001 workshop on empirical methods in artificial intelligence, volume 3, pages 41–46. Scalas, E. (2006). The application of continuous-time random walks in finance and economics. Physica A: Statistical Mechanics and its Applications, 362(2):225–239.	zh_TW

Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

Google Scholar^TM