Please use this identifier to cite or link to this item: https://ah.lib.nccu.edu.tw/handle/140.119/72230
DC FieldValueLanguage
dc.contributor統計系en_US
dc.creatorWang,Zhanfeng;Chang,Yuan-chin;Ying,Zhiliang;Liang,Zhu;Yang,Yaningen_US
dc.date2007-09en_US
dc.date.accessioned2014-12-23T07:20:05Z-
dc.date.available2014-12-23T07:20:05Z-
dc.date.issued2014-12-23T07:20:05Z-
dc.identifier.urihttp://nccur.lib.nccu.edu.tw/handle/140.119/72230-
dc.description.abstractMotivation: Protein expression profiling for differences indicative of early cancer holds promise for improving diagnostics. Due to their high dimensionality, statistical analysis of proteomic data from mass spectrometers is challenging in many aspects such as dimension reduction, feature subset selection as well as construction of classification rules. Search of an optimal feature subset, commonly known as the feature subset selection (FSS) problem, is an important step towards disease classification/diagnostics with biomarkers.Methods: We develop a parsimonious threshold-independent feature selection (PTIFS) method based on the concept of area under the curve (AUC) of the receiver operating characteristic (ROC). To reduce computational complexity to a manageable level, we use a sigmoid approximation to the empirical AUC as the criterion function. Starting from an anchor feature, the PTIFS method selects a feature subset through an iterative updating algorithm. Highly correlated features that have similar discriminating power are precluded from being selected simultaneously. The classification rule is then determined from the resulting feature subset.Results: The performance of the proposed approach is investigated by extensive simulation studies, and by applying the method to two mass spectrometry data sets of prostate cancer and of liver cancer. We compare the new approach with the threshold gradient descent regularization (TGDR) method. The results show that our method can achieve comparable performance to that of the TGDR method in terms of disease classification, but with fewer features selected.en_US
dc.format.extent128 bytes-
dc.format.mimetypetext/html-
dc.language.isoen_US-
dc.relationBioinformatics,23(20),2788-2794en_US
dc.titleA parsimonious threshold-independent protein feature selection method through the area under receiver operating characteristic curveen_US
dc.typearticleen
item.languageiso639-1en_US-
item.fulltextWith Fulltext-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.grantfulltextrestricted-
item.openairetypearticle-
item.cerifentitytypePublications-
Appears in Collections:期刊論文
Files in This Item:
File Description SizeFormat
index.html128 BHTML2View/Open
Show simple item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.