題名 應用象徵性資料分析法於電影推薦系統之研究
The application of symbolic data analysis to movie recommendation systems
作者 張順益
貢獻者 吳漢銘
Wu, Han-Ming
關鍵詞 推薦系統
Recommendation System
Symbolic Data Analysis
Clustering Algorithm
Missing Value Imputation
日期 2023
上傳時間 1-九月-2023 14:57:45 (UTC+8)
摘要 推薦系統(Recommendation System)如今已廣泛應用於商業行銷,涵蓋範疇包括電影、音樂、新聞、書籍、餐廳、3C 商品以及金融服務等產品的推薦。推薦系統能為用戶提供精確的個性化推薦,從而提高商家的營利。協同過濾算法(collaborative filtering)\\citep{Resnick} 是推薦算法中最常見的一種,其根據用戶對商品的評分進行協同過濾,以便找出合適的產品進行推薦。該演算法的理論基礎在於消費行為相近的用戶應該會偏好類似的商品。然而,協同過濾算法面臨新用戶冷啟動(亦稱新商品問題)和稀疏矩陣等問題。在本研究中,我們針對電影推薦系統,根據用戶群的特徵將其對電影的評分依照電影類型轉換成多值模態象徵性資料(multi-valued modal symbolic data)。此轉換方法考慮到每部電影可能具有多種類型的特點,旨在克服新用戶冷啟動問題並減少缺失值導致的稀疏矩陣問題。我們進行了模擬實驗並分析了實際的電影評分資料,以驗證我們提出的新方法。結果顯示,應用象徵性資料分析法不僅可以提升推薦的效果,更為推薦系統的發展開創了一條新的思考途徑和方法。
Recommendation systems are now widely used in business marketing, spanning various domains such as movies, music, news, books, restaurants, 3C products, and financial services. Collaborative filtering, the most common recommendation algorithm, utilizes user ratings on products to perform collaborative filtering and identify suitable items for recommendations. The theoretical basis of this algorithm is that users with similar consumption behaviors are likely to prefer similar items. However, collaborative filtering algorithms face challenges such as the cold start problem for new users (also known as the new item problem) and the sparsity issue in matrices. In this study, we focus on a movie recommendation system and transform user ratings for movies into multi-valued modal symbolic data based on user group characteristics. This transformation method takes into account the multiple genres or characteristics that a movie may have, aiming to overcome the cold start problem for new users and reduce the sparsity issue caused by missing values in the matrix. We conducted simulation experiments and analyzed real movie rating data to validate the proposed approach. The results showed that the symbolic data analysis method not only improves recommendation effectiveness but also provides a new approach and method for the development of recommendation systems.
描述 碩士
dc.description.abstract (摘要) 推薦系統(Recommendation System)如今已廣泛應用於商業行銷,涵蓋範疇包括電影、音樂、新聞、書籍、餐廳、3C 商品以及金融服務等產品的推薦。推薦系統能為用戶提供精確的個性化推薦,從而提高商家的營利。協同過濾算法(collaborative filtering)\\citep{Resnick} 是推薦算法中最常見的一種,其根據用戶對商品的評分進行協同過濾,以便找出合適的產品進行推薦。該演算法的理論基礎在於消費行為相近的用戶應該會偏好類似的商品。然而,協同過濾算法面臨新用戶冷啟動(亦稱新商品問題)和稀疏矩陣等問題。在本研究中,我們針對電影推薦系統,根據用戶群的特徵將其對電影的評分依照電影類型轉換成多值模態象徵性資料(multi-valued modal symbolic data)。此轉換方法考慮到每部電影可能具有多種類型的特點,旨在克服新用戶冷啟動問題並減少缺失值導致的稀疏矩陣問題。我們進行了模擬實驗並分析了實際的電影評分資料,以驗證我們提出的新方法。結果顯示,應用象徵性資料分析法不僅可以提升推薦的效果,更為推薦系統的發展開創了一條新的思考途徑和方法。zh_TW
dc.description.abstract (摘要) Recommendation systems are now widely used in business marketing, spanning various domains such as movies, music, news, books, restaurants, 3C products, and financial services. Collaborative filtering, the most common recommendation algorithm, utilizes user ratings on products to perform collaborative filtering and identify suitable items for recommendations. The theoretical basis of this algorithm is that users with similar consumption behaviors are likely to prefer similar items. However, collaborative filtering algorithms face challenges such as the cold start problem for new users (also known as the new item problem) and the sparsity issue in matrices. In this study, we focus on a movie recommendation system and transform user ratings for movies into multi-valued modal symbolic data based on user group characteristics. This transformation method takes into account the multiple genres or characteristics that a movie may have, aiming to overcome the cold start problem for new users and reduce the sparsity issue caused by missing values in the matrix. We conducted simulation experiments and analyzed real movie rating data to validate the proposed approach. The results showed that the symbolic data analysis method not only improves recommendation effectiveness but also provides a new approach and method for the development of recommendation systems.en_US
dc.description.tableofcontents 1 緒論 8
1.1 研究動機 8
1.2 研究目的 9
1.3 文獻回顧 10
1.4 文章結構 16
2 記憶體型協同過濾演算法17
3 多值模態記憶體型協同過濾演算法 19
4 模擬實驗 27
4.1 缺失值比例的影響 27
4.2 Pearson 相關程度高低的影響 28
5 實例分析 29
6 結論與討論 33
參考文獻 34
dc.subject (關鍵詞) 推薦系統zh_TW
dc.subject (關鍵詞) 象徵性資料分析法zh_TW
dc.subject (關鍵詞) 分群演算法zh_TW
dc.subject (關鍵詞) 遺失值補值zh_TW
dc.subject (關鍵詞) Recommendation Systemen_US
dc.subject (關鍵詞) Symbolic Data Analysisen_US
dc.subject (關鍵詞) Clustering Algorithmen_US
dc.subject (關鍵詞) Missing Value Imputationen_US
dc.title (題名) 應用象徵性資料分析法於電影推薦系統之研究zh_TW
dc.title (題名) The application of symbolic data analysis to movie recommendation systemsen_US
dc.type (資料類型) thesisen_US
