學術產出-Periodical Articles

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

題名 Incorporating the number of true null hypotheses to improve power in multiple testing: application to gene microarray data
作者 Hsueh, Huey-Miin ;Tsai, Chen-An ;Chen,James J.
薛慧敏;蔡政安
貢獻者 統計系
日期 2006-06
上傳時間 16-Dec-2014 10:39:05 (UTC+8)
摘要 Testing for significance with gene expression data from DNA microarray experiments involves simultaneous comparisons of hundreds or thousands of genes. In common exploratory microarray experiments, most genes are not expected to be differentially expressed. The family-wise error (FWE) rate and false discovery rate (FDR) are two common approaches used to account for multiple hypothesis tests to identify differentially expressed genes. When the number of hypotheses is very large and some null hypotheses are expected to be true, the power of an FWE or FDR procedure can be improved if the number of null hypotheses is known. The mean of differences (MD) of ranked p-values has been proposed to estimate the number of true null hypotheses under the independence model. This article proposes to incorporate the MD estimate into an FWE or FDR approach for gene identification. Simulation results show that the procedure appears to control the FWE and FDR well at the FWE=0.05 and FDR=0.05 significant levels; it exceeds the nominal level for FDR=0.01 when the null hypotheses are highly correlated, a correlation of 0.941. The proposed approach is applied to a public colon tumor data set for illustration.
關聯 Journal of Statistical Computation and Simulation,77(9),757-767.
資料類型 article
DOI http://dx.doi.org/10.1080/10629360600648651
dc.contributor 統計系en_US
dc.creator (作者) Hsueh, Huey-Miin ;Tsai, Chen-An ;Chen,James J.en_US
dc.creator (作者) 薛慧敏;蔡政安zh_TW
dc.date (日期) 2006-06en_US
dc.date.accessioned 16-Dec-2014 10:39:05 (UTC+8)-
dc.date.available 16-Dec-2014 10:39:05 (UTC+8)-
dc.date.issued (上傳時間) 16-Dec-2014 10:39:05 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/72089-
dc.description.abstract (摘要) Testing for significance with gene expression data from DNA microarray experiments involves simultaneous comparisons of hundreds or thousands of genes. In common exploratory microarray experiments, most genes are not expected to be differentially expressed. The family-wise error (FWE) rate and false discovery rate (FDR) are two common approaches used to account for multiple hypothesis tests to identify differentially expressed genes. When the number of hypotheses is very large and some null hypotheses are expected to be true, the power of an FWE or FDR procedure can be improved if the number of null hypotheses is known. The mean of differences (MD) of ranked p-values has been proposed to estimate the number of true null hypotheses under the independence model. This article proposes to incorporate the MD estimate into an FWE or FDR approach for gene identification. Simulation results show that the procedure appears to control the FWE and FDR well at the FWE=0.05 and FDR=0.05 significant levels; it exceeds the nominal level for FDR=0.01 when the null hypotheses are highly correlated, a correlation of 0.941. The proposed approach is applied to a public colon tumor data set for illustration.en_US
dc.format.extent 235 bytes-
dc.format.mimetype text/html-
dc.language.iso en_US-
dc.relation (關聯) Journal of Statistical Computation and Simulation,77(9),757-767.en_US
dc.title (題名) Incorporating the number of true null hypotheses to improve power in multiple testing: application to gene microarray dataen_US
dc.type (資料類型) articleen
dc.identifier.doi (DOI) 10.1080/10629360600648651en_US
dc.doi.uri (DOI) http://dx.doi.org/10.1080/10629360600648651en_US