Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

題名 集成式模型應用於預測院內心臟驟停:整合多維度生命徵象數據
Bagged Ensemble Model for In-Hospital Cardiac Arrest Prediction: Integrating Multi-Dimensional Vital Signs Data
作者 翁祐祥
Weng, Yu-Hsiang
貢獻者 邱淑怡
Chiu, Shu-I
翁祐祥
Weng, Yu-Hsiang
關鍵詞 心臟驟停
數據不平衡
集成式學習
相互資訊
cardiac arrest
imbalanced data
ensemble learning
mutual information
日期 2024
上傳時間 4-Sep-2024 15:02:34 (UTC+8)
摘要 在本研究中,我們探討了如何利用生命徵象 (vital signs) 的時序資料來預測住院病人的心臟驟停 (In-Hospital Cardiac Arrest)。我們將生命徵象的時序數據轉換為三維的特徵資料,以便更有效地捕捉時間序列中的變化模式。為了提升特徵選擇的準確性,我們採用了相互資訊 (mutual information) 作為選擇指標,以確保選出的特徵對於預測模型具有最重要的貢獻。 面對數據不平衡 (imbalanced data) 的挑戰,我們運用了各種資料擴增方法來平衡樣本分佈,從而提高模型的泛化能力和預測準確性。接著,我們應用了多種機器學習算法來建立預測模型,並利用集成式學習 (ensemble learning) 方法進行模型的整合。這些方法的結合旨在充分發揮不同模型的優勢,提高預測的穩定性和準確性。 實驗結果顯示,經過特徵選擇、資料擴增以及集成式學習等一系列處理後,我們所建立的模型在預測住院病人心臟驟停的準確性上均取得了顯著的提升。本研究的結果不僅展示了時序數據在預測心臟驟停方面的潛力,也為未來在此領域的研究提供了新的思路和方法。
In this study, we investigated how to utilize time-series data of vital signs to predict in-hospital cardiac arrest. We transformed the time-series data of vital signs into three-dimensional feature data to more effectively capture the variation patterns within the time series. To enhance the accuracy of feature selection, we employed mutual information as a selection criterion to ensure that the chosen features made the most significant contribution to the predictive model. In addressing the challenge of imbalanced data, we used various data augmentation methods to balance the sample distribution, thereby improving the model's generalization capability and predictive accuracy. Subsequently, we applied multiple machine learning algorithms to build the prediction model and employed ensemble learning methods for model integration. The combination of these methods aimed to fully leverage the strengths of different models, enhancing prediction stability and accuracy. Experimental results demonstrated that after feature selection, data augmentation, and ensemble learning, the model we developed achieved significant improvements in the accuracy of predicting in-hospital cardiac arrest. The findings of this study not only highlight the potential of time-series data in predicting cardiac arrest but also provide new insights and methods for future research in this area.
參考文獻 1. Smith, G.B., et al., A Comparison of the Ability of the Physiologic Components of Medical Emergency Team Criteria and the U.K. National Early Warning Score to Discriminate Patients at Risk of a Range of Adverse Clinical Outcomes*. Critical Care Medicine, 2016. 44(12). 2. Subbe, C.P., et al., Validation of a modified Early Warning Score in medical admissions. QJM: An International Journal of Medicine, 2001. 94(10): p. 521-526. 3. Chang, H.K., et al. Early Detecting In-Hospital Cardiac Arrest Based on Machine Learning on Imbalanced Data. in 2019 IEEE International Conference on Healthcare Informatics (ICHI). 2019. 4. Tanii, R., et al., Impact of dynamic parameter of trends in vital signs on the prediction of serious events in hospitalized patients -a retrospective observational study. Resuscitation Plus, 2024. 18: p. 100628. 5. Kwon, J.m., et al., An Algorithm Based on Deep Learning for Predicting In‐Hospital Cardiac Arrest. Journal of the American Heart Association, 2018. 7(13): p. e008678. 6. Su, C.-F., et al., Improved inpatient deterioration detection in general wards by using time-series vital signs. Scientific Reports, 2022. 12(1): p. 11901. 7. Mahajan, P., et al. Ensemble Learning for Disease Prediction: A Review. Healthcare, 2023. 11, DOI: 10.3390/healthcare11121808. 8. Pei, C., et al., Ensemble Learning for Early-Response Prediction of Antidepressant Treatment in Major Depressive Disorder. Journal of Magnetic Resonance Imaging, 2020. 52(1): p. 161-171. 9. Awad, A., et al., Early hospital mortality prediction of intensive care unit patients using an ensemble learning approach. International Journal of Medical Informatics, 2017. 108: p. 185-195. 10. Su, C.F., et al., Improved inpatient deterioration detection in general wards by using time-series vital signs. Sci Rep, 2022. 12(1): p. 11901. 11. Vergara, J.R. and P.A. Estévez, A review of feature selection methods based on mutual information. Neural Computing and Applications, 2014. 24(1): p. 175-186. 12. Ross, B.C., Mutual information between discrete and continuous data sets. PloS one, 2014. 9(2): p. e87357. 13. Mohammed, R., J. Rawashdeh, and M. Abdullah. Machine Learning with Oversampling and Undersampling Techniques: Overview Study and Experimental Results. in 2020 11th International Conference on Information and Communication Systems (ICICS). 2020. 14. Chawla, N.V., et al., SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 2002. 16: p. 321-357. 15. He, H., et al. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. in 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence). 2008. Ieee. 16. Kumari, S., D. Kumar, and M. Mittal, An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier. International Journal of Cognitive Computing in Engineering, 2021. 2: p. 40-46. 17. Salur, M.U. and İ. Aydın, A soft voting ensemble learning-based approach for multimodal sentiment analysis. Neural Computing and Applications, 2022. 34(21): p. 18391-18406.
描述 碩士
國立政治大學
資訊科學系
111753226
資料來源 http://thesis.lib.nccu.edu.tw/record/#G0111753226
資料類型 thesis
dc.contributor.advisor 邱淑怡zh_TW
dc.contributor.advisor Chiu, Shu-Ien_US
dc.contributor.author (Authors) 翁祐祥zh_TW
dc.contributor.author (Authors) Weng, Yu-Hsiangen_US
dc.creator (作者) 翁祐祥zh_TW
dc.creator (作者) Weng, Yu-Hsiangen_US
dc.date (日期) 2024en_US
dc.date.accessioned 4-Sep-2024 15:02:34 (UTC+8)-
dc.date.available 4-Sep-2024 15:02:34 (UTC+8)-
dc.date.issued (上傳時間) 4-Sep-2024 15:02:34 (UTC+8)-
dc.identifier (Other Identifiers) G0111753226en_US
dc.identifier.uri (URI) https://nccur.lib.nccu.edu.tw/handle/140.119/153393-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 資訊科學系zh_TW
dc.description (描述) 111753226zh_TW
dc.description.abstract (摘要) 在本研究中,我們探討了如何利用生命徵象 (vital signs) 的時序資料來預測住院病人的心臟驟停 (In-Hospital Cardiac Arrest)。我們將生命徵象的時序數據轉換為三維的特徵資料,以便更有效地捕捉時間序列中的變化模式。為了提升特徵選擇的準確性,我們採用了相互資訊 (mutual information) 作為選擇指標,以確保選出的特徵對於預測模型具有最重要的貢獻。 面對數據不平衡 (imbalanced data) 的挑戰,我們運用了各種資料擴增方法來平衡樣本分佈,從而提高模型的泛化能力和預測準確性。接著,我們應用了多種機器學習算法來建立預測模型,並利用集成式學習 (ensemble learning) 方法進行模型的整合。這些方法的結合旨在充分發揮不同模型的優勢,提高預測的穩定性和準確性。 實驗結果顯示,經過特徵選擇、資料擴增以及集成式學習等一系列處理後,我們所建立的模型在預測住院病人心臟驟停的準確性上均取得了顯著的提升。本研究的結果不僅展示了時序數據在預測心臟驟停方面的潛力,也為未來在此領域的研究提供了新的思路和方法。zh_TW
dc.description.abstract (摘要) In this study, we investigated how to utilize time-series data of vital signs to predict in-hospital cardiac arrest. We transformed the time-series data of vital signs into three-dimensional feature data to more effectively capture the variation patterns within the time series. To enhance the accuracy of feature selection, we employed mutual information as a selection criterion to ensure that the chosen features made the most significant contribution to the predictive model. In addressing the challenge of imbalanced data, we used various data augmentation methods to balance the sample distribution, thereby improving the model's generalization capability and predictive accuracy. Subsequently, we applied multiple machine learning algorithms to build the prediction model and employed ensemble learning methods for model integration. The combination of these methods aimed to fully leverage the strengths of different models, enhancing prediction stability and accuracy. Experimental results demonstrated that after feature selection, data augmentation, and ensemble learning, the model we developed achieved significant improvements in the accuracy of predicting in-hospital cardiac arrest. The findings of this study not only highlight the potential of time-series data in predicting cardiac arrest but also provide new insights and methods for future research in this area.en_US
dc.description.tableofcontents 第一章 緒論 1 第二章 文獻探討 4 第一節 預測院內心臟驟停 4 第二節 集成式模型醫療數據的應用 6 第三章 研究方法 8 第一節 實驗數據 8 第二節 數據前處理 8 第三節 模型使用 10 第四節 模型結果整合 12 第五節 評估方法 13 第四章 實驗分析 16 第一節 資料集特徵擴增 16 第二節 訓練集與測試集 18 第三節 特徵選擇標準 19 第四節 特徵群選擇策略一:單一特徵 23 第五節 特徵群選擇策略二:變量方向篩選 25 第六節 比較整合模型與擴增方法 27 第七節 相關文獻方法的比較 31 第五章 結論 36 參考文獻 (References) 38zh_TW
dc.format.extent 1103558 bytes-
dc.format.mimetype application/pdf-
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#G0111753226en_US
dc.subject (關鍵詞) 心臟驟停zh_TW
dc.subject (關鍵詞) 數據不平衡zh_TW
dc.subject (關鍵詞) 集成式學習zh_TW
dc.subject (關鍵詞) 相互資訊zh_TW
dc.subject (關鍵詞) cardiac arresten_US
dc.subject (關鍵詞) imbalanced dataen_US
dc.subject (關鍵詞) ensemble learningen_US
dc.subject (關鍵詞) mutual informationen_US
dc.title (題名) 集成式模型應用於預測院內心臟驟停:整合多維度生命徵象數據zh_TW
dc.title (題名) Bagged Ensemble Model for In-Hospital Cardiac Arrest Prediction: Integrating Multi-Dimensional Vital Signs Dataen_US
dc.type (資料類型) thesisen_US
dc.relation.reference (參考文獻) 1. Smith, G.B., et al., A Comparison of the Ability of the Physiologic Components of Medical Emergency Team Criteria and the U.K. National Early Warning Score to Discriminate Patients at Risk of a Range of Adverse Clinical Outcomes*. Critical Care Medicine, 2016. 44(12). 2. Subbe, C.P., et al., Validation of a modified Early Warning Score in medical admissions. QJM: An International Journal of Medicine, 2001. 94(10): p. 521-526. 3. Chang, H.K., et al. Early Detecting In-Hospital Cardiac Arrest Based on Machine Learning on Imbalanced Data. in 2019 IEEE International Conference on Healthcare Informatics (ICHI). 2019. 4. Tanii, R., et al., Impact of dynamic parameter of trends in vital signs on the prediction of serious events in hospitalized patients -a retrospective observational study. Resuscitation Plus, 2024. 18: p. 100628. 5. Kwon, J.m., et al., An Algorithm Based on Deep Learning for Predicting In‐Hospital Cardiac Arrest. Journal of the American Heart Association, 2018. 7(13): p. e008678. 6. Su, C.-F., et al., Improved inpatient deterioration detection in general wards by using time-series vital signs. Scientific Reports, 2022. 12(1): p. 11901. 7. Mahajan, P., et al. Ensemble Learning for Disease Prediction: A Review. Healthcare, 2023. 11, DOI: 10.3390/healthcare11121808. 8. Pei, C., et al., Ensemble Learning for Early-Response Prediction of Antidepressant Treatment in Major Depressive Disorder. Journal of Magnetic Resonance Imaging, 2020. 52(1): p. 161-171. 9. Awad, A., et al., Early hospital mortality prediction of intensive care unit patients using an ensemble learning approach. International Journal of Medical Informatics, 2017. 108: p. 185-195. 10. Su, C.F., et al., Improved inpatient deterioration detection in general wards by using time-series vital signs. Sci Rep, 2022. 12(1): p. 11901. 11. Vergara, J.R. and P.A. Estévez, A review of feature selection methods based on mutual information. Neural Computing and Applications, 2014. 24(1): p. 175-186. 12. Ross, B.C., Mutual information between discrete and continuous data sets. PloS one, 2014. 9(2): p. e87357. 13. Mohammed, R., J. Rawashdeh, and M. Abdullah. Machine Learning with Oversampling and Undersampling Techniques: Overview Study and Experimental Results. in 2020 11th International Conference on Information and Communication Systems (ICICS). 2020. 14. Chawla, N.V., et al., SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 2002. 16: p. 321-357. 15. He, H., et al. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. in 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence). 2008. Ieee. 16. Kumari, S., D. Kumar, and M. Mittal, An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier. International Journal of Cognitive Computing in Engineering, 2021. 2: p. 40-46. 17. Salur, M.U. and İ. Aydın, A soft voting ensemble learning-based approach for multimodal sentiment analysis. Neural Computing and Applications, 2022. 34(21): p. 18391-18406.zh_TW