學術產出-Periodical Articles

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

  • No doi shows Citation Infomation
題名 Dimension Reduction of High-Dimensional Datasets Based on Stepwise SVM
作者 周珮婷
Chou, Elizabeth P.
Ko, Tzu-Wei
貢獻者 統計系
日期 2017-11
上傳時間 3-Jul-2018 15:05:09 (UTC+8)
摘要 The current study proposes a dimension reduction method, stepwise support vector machine (SVM), to reduce the dimensions of large p small n datasets. The proposed method is compared with other dimension reduction methods, namely, the Pearson product difference correlation coefficient (PCCs), recursive feature elimination based on random forest (RF-RFE), and principal component analysis (PCA), by using five gene expression datasets. Additionally, the prediction performance of the variables selected by our method is evaluated. The study found that stepwise SVM can effectively select the important variables and achieve good prediction performance. Moreover, the predictions of stepwise SVM for reduced datasets was better than those for the unreduced datasets. The performance of stepwise SVM was more stable than that of PCA and RF-RFE, but the performance difference with respect to PCCs was minimal. It is necessary to reduce the dimensions of large p small n datasets. We believe that stepwise SVM can effectively eliminate noise in data and improve the prediction accuracy in any large p small n dataset.
關聯 arXiv:1711.03346v1
資料類型 article
dc.contributor 統計系
dc.creator (作者) 周珮婷zh_TW
dc.creator (作者) Chou, Elizabeth P.en_US
dc.creator (作者) Ko, Tzu-Weien_US
dc.date (日期) 2017-11
dc.date.accessioned 3-Jul-2018 15:05:09 (UTC+8)-
dc.date.available 3-Jul-2018 15:05:09 (UTC+8)-
dc.date.issued (上傳時間) 3-Jul-2018 15:05:09 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/118191-
dc.description.abstract (摘要) The current study proposes a dimension reduction method, stepwise support vector machine (SVM), to reduce the dimensions of large p small n datasets. The proposed method is compared with other dimension reduction methods, namely, the Pearson product difference correlation coefficient (PCCs), recursive feature elimination based on random forest (RF-RFE), and principal component analysis (PCA), by using five gene expression datasets. Additionally, the prediction performance of the variables selected by our method is evaluated. The study found that stepwise SVM can effectively select the important variables and achieve good prediction performance. Moreover, the predictions of stepwise SVM for reduced datasets was better than those for the unreduced datasets. The performance of stepwise SVM was more stable than that of PCA and RF-RFE, but the performance difference with respect to PCCs was minimal. It is necessary to reduce the dimensions of large p small n datasets. We believe that stepwise SVM can effectively eliminate noise in data and improve the prediction accuracy in any large p small n dataset.en_US
dc.format.extent 98 bytes-
dc.format.mimetype text/html-
dc.relation (關聯) arXiv:1711.03346v1
dc.title (題名) Dimension Reduction of High-Dimensional Datasets Based on Stepwise SVMen_US
dc.type (資料類型) article