Data driven geometry for learning | Publication | NCCU Academic Hub

Publications-Proceedings

Article View/Open

pdf(761)

Publication Export

Google Scholar^TM

NCCU Library

Discovery System

Citation Infomation

Related Publications in TAIR

Simple Record
Full Record

題名	Data driven geometry for learning
作者	周珮婷 Chou, Elizabeth P.
貢獻者	統計系
關鍵詞	Artificial intelligence; Biomarkers; Classification (of information); Data mining; Geometry; Learning systems; Microarrays; Pattern recognition; Supervised learning; Trees (mathematics); BiDCG; Complexity penalties; Data clouds; Dimension-reduction; High dimensionality; Microarray data sets; Pattern information; Semi- supervised learning; Learning algorithms
日期	2015-07
上傳時間	8-Aug-2017 16:59:37 (UTC+8)
摘要	High dimensional covariate information provides a detailed description of any individuals involved in a machine learning and classification problem. The inter-dependence patterns among these covariate vectors may be unknown to researchers. This fact is not well recognized in classic and modern machine learning literature; most model-based popular algorithms are implemented using some version of the dimensionreduction approach or even impose a built-in complexity penalty. This is a defensive attitude toward the high dimensionality. In contrast, an accommodating attitude can exploit such potential inter-dependence patterns embedded within the high dimensionality. In this research, we implement this latter attitude throughout by first computing the similarity between data nodes and then discovering pattern information in the form of Ultrametric tree geometry among almost all the covariate dimensions involved. We illustrate with real Microarray datasets, where we demonstrate that such dual-relationships are indeed class specific, each precisely representing the discovery of a biomarker. The whole collection of computed biomarkers constitutes a global feature-matrix, which is then shown to give rise to a very effective learning algorithm. © Springer International Publishing Switzerland 2015.
關聯	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9166, 395-402 11th International Conference on Machine Learning and Data Mining in Pattern Recognition, MLDM 2015; Hamburg; Germany; 20 July 2015 到 21 July 2015; 代碼 156629
資料類型	conference
DOI	http://dx.doi.org/10.1007/978-3-319-21024-7_27

dc.contributor	統計系	zh_Tw
dc.creator (作者)	周珮婷	zh_TW
dc.creator (作者)	Chou, Elizabeth P.	en_US
dc.date (日期)	2015-07	en_US
dc.date.accessioned	8-Aug-2017 16:59:37 (UTC+8)	-
dc.date.available	8-Aug-2017 16:59:37 (UTC+8)	-
dc.date.issued (上傳時間)	8-Aug-2017 16:59:37 (UTC+8)	-
dc.identifier.uri (URI)	https://ah.lib.nccu.edu.tw/item?item_id=128531	-
dc.description.abstract (摘要)	High dimensional covariate information provides a detailed description of any individuals involved in a machine learning and classification problem. The inter-dependence patterns among these covariate vectors may be unknown to researchers. This fact is not well recognized in classic and modern machine learning literature; most model-based popular algorithms are implemented using some version of the dimensionreduction approach or even impose a built-in complexity penalty. This is a defensive attitude toward the high dimensionality. In contrast, an accommodating attitude can exploit such potential inter-dependence patterns embedded within the high dimensionality. In this research, we implement this latter attitude throughout by first computing the similarity between data nodes and then discovering pattern information in the form of Ultrametric tree geometry among almost all the covariate dimensions involved. We illustrate with real Microarray datasets, where we demonstrate that such dual-relationships are indeed class specific, each precisely representing the discovery of a biomarker. The whole collection of computed biomarkers constitutes a global feature-matrix, which is then shown to give rise to a very effective learning algorithm. © Springer International Publishing Switzerland 2015.	en_US
dc.format.extent	129522 bytes	-
dc.format.mimetype	application/pdf	-
dc.relation (關聯)	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9166, 395-402	en_US
dc.relation (關聯)	11th International Conference on Machine Learning and Data Mining in Pattern Recognition, MLDM 2015; Hamburg; Germany; 20 July 2015 到 21 July 2015; 代碼 156629	en_US
dc.subject (關鍵詞)	Artificial intelligence; Biomarkers; Classification (of information); Data mining; Geometry; Learning systems; Microarrays; Pattern recognition; Supervised learning; Trees (mathematics); BiDCG; Complexity penalties; Data clouds; Dimension-reduction; High dimensionality; Microarray data sets; Pattern information; Semi- supervised learning; Learning algorithms	en_US
dc.title (題名)	Data driven geometry for learning	en_US
dc.type (資料類型)	conference
dc.identifier.doi (DOI)	10.1007/978-3-319-21024-7_27
dc.doi.uri (DOI)	http://dx.doi.org/10.1007/978-3-319-21024-7_27