學術產出-NSC Projects

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

  • No doi shows Citation Infomation
題名 運用核糖核酸測序實驗資料以偵測有差異表現量之基因
作者 薛慧敏
貢獻者 統計學系
關鍵詞 錯誤發現率;負二項分配;過度離散;擬概似函數;核糖核酸測序實驗;基因差異性檢定
Differential expression analysis; false discovery rate; Negative-Binomial; over-dispersion; pseudo likelihood estimation; RNA-Seq
日期 2014
上傳時間 25-Dec-2017 15:18:00 (UTC+8)
摘要 已知核糖核酸測序實驗(RNA-Seq)較微陣列實驗(microarray)具備多項優勢,如可避免雜交過程所帶來的雜訊,故準確度較高,以及不受序列為已知的需求限制等。預期隨著實驗成本逐漸降低,核糖核酸測序實驗將逐漸受到歡迎並且被廣泛運用。本研究將針對運用該類實驗資料以偵測顯著差異基因的問題提出統計分析方法。已知所獲得的基因資料為計數型態(count data),在考慮其過度離散(over-dispersion)性質下,我們將以負二項(Negative Binomial)分配來配適資料。為了降低計算難度,將採用最大擬概似函數(maximum pseudo likelihood)估計法。在基因的差異顯著性檢定上,我們則採用外顯類組(phenotypic group)均數差之Wald檢定統計量。本研究透過電腦模擬以驗證所提出的方法,並且運用此方法在數組實際資料上以評估其實用性。
Recently, the RNA-Seq experiment is developed for a high-throughput DNA sequencing method for mapping and quantifying the transcriptomes. The gene expression level obtained from a RNA-Seq experiment is of the count data type and is often fitted by a Negative-Binomial distribution to account for the over-dispersion. To identify the differentially expressed genes with a binary phenotypic response, we aim to develop a statistical test for comparing the means of two Negative-Binomial distributions. To ease the computational complexity, a maximum pseudo likelihood estimation (MPLE) is considered and the corresponding Wald’s test statistic is subsequently employed. Simulation studies are performed to justify the adequacy of the proposed test in comparison with existing methods. The applicability of the proposed method is demonstrated via the data analysis of some real example data sets.
關聯 執行起迄:2014/08/01~2015/08/31
103-2118-M-004-004
資料類型 report
dc.contributor 統計學系zh_Tw
dc.creator (作者) 薛慧敏zh_TW
dc.date (日期) 2014en_US
dc.date.accessioned 25-Dec-2017 15:18:00 (UTC+8)-
dc.date.available 25-Dec-2017 15:18:00 (UTC+8)-
dc.date.issued (上傳時間) 25-Dec-2017 15:18:00 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/115381-
dc.description.abstract (摘要) 已知核糖核酸測序實驗(RNA-Seq)較微陣列實驗(microarray)具備多項優勢,如可避免雜交過程所帶來的雜訊,故準確度較高,以及不受序列為已知的需求限制等。預期隨著實驗成本逐漸降低,核糖核酸測序實驗將逐漸受到歡迎並且被廣泛運用。本研究將針對運用該類實驗資料以偵測顯著差異基因的問題提出統計分析方法。已知所獲得的基因資料為計數型態(count data),在考慮其過度離散(over-dispersion)性質下,我們將以負二項(Negative Binomial)分配來配適資料。為了降低計算難度,將採用最大擬概似函數(maximum pseudo likelihood)估計法。在基因的差異顯著性檢定上,我們則採用外顯類組(phenotypic group)均數差之Wald檢定統計量。本研究透過電腦模擬以驗證所提出的方法,並且運用此方法在數組實際資料上以評估其實用性。zh_TW
dc.description.abstract (摘要) Recently, the RNA-Seq experiment is developed for a high-throughput DNA sequencing method for mapping and quantifying the transcriptomes. The gene expression level obtained from a RNA-Seq experiment is of the count data type and is often fitted by a Negative-Binomial distribution to account for the over-dispersion. To identify the differentially expressed genes with a binary phenotypic response, we aim to develop a statistical test for comparing the means of two Negative-Binomial distributions. To ease the computational complexity, a maximum pseudo likelihood estimation (MPLE) is considered and the corresponding Wald’s test statistic is subsequently employed. Simulation studies are performed to justify the adequacy of the proposed test in comparison with existing methods. The applicability of the proposed method is demonstrated via the data analysis of some real example data sets.en_US
dc.format.extent 4541701 bytes-
dc.format.mimetype application/pdf-
dc.relation (關聯) 執行起迄:2014/08/01~2015/08/31zh_TW
dc.relation (關聯) 103-2118-M-004-004zh_TW
dc.subject (關鍵詞) 錯誤發現率;負二項分配;過度離散;擬概似函數;核糖核酸測序實驗;基因差異性檢定zh_TW
dc.subject (關鍵詞) Differential expression analysis; false discovery rate; Negative-Binomial; over-dispersion; pseudo likelihood estimation; RNA-Seqen_US
dc.title (題名) 運用核糖核酸測序實驗資料以偵測有差異表現量之基因_TW
dc.type (資料類型) report