Publications-Periodical Articles

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

題名 Coreference resolution of medical concepts in discharge summaries by exploiting contextual information
作者 Lai, Po-Ting
賴柏廷
Chen, C.-Y.
Dai, H.-J.
貢獻者 資科系
關鍵詞 accuracy; article; hospital discharge; hospital patient; information dissemination; medical information; model; natural language processing; patient discharge summary; artificial intelligence; automated pattern recognition; computer simulation; data mining; electronic medical record; evaluation; hospital discharge; human; methodology; multicenter study; natural language processing; semantics; United States; Artificial Intelligence; Computer Simulation; Data Mining; Electronic Health Records; Humans; Natural Language Processing; Patient Discharge; Pattern Recognition, Automated; Semantics; United States
日期 2012-09
上傳時間 12-May-2015 16:05:59 (UTC+8)
摘要 Objective: Patient discharge summaries provide detailed medical information about hospitalized patients and are a rich resource of data for clinical record text mining. The textual expressions of this information are highly variable. In order to acquire a precise understanding of the patient, it is important to uncover the relationship between all instances in the text. In natural language processing (NLP), this task falls under the category of coreference resolution. Design: A key contribution of this paper is the application of contextual-dependent rules that describe relationships between coreference pairs. To resolve phrases that refer to the same entity, the authors use these rules in three representative NLP systems: one rule-based, another based on the maximum entropy model, and the last a system built on the Markov logic network (MLN) model. Results: The experimental results show that the proposed MLN-based system outperforms the baseline system (exact match) by average F-scores of 4.3% and 5.7% on the Beth and Partners datasets, respectively. Finally, the three systems were integrated into an ensemble system, further improving performance to 87.21%, which is 4.5% more than the official i2b2 Track 1C average (82.7%). Conclusion: In this paper, the main challenges in the resolution of coreference relations in patient discharge summaries are described. Several rules are proposed to exploit contextual information, and three approaches presented. While single systems provided promising results, an ensemble approach combining the three systems produced a better performance than even the best single system.
關聯 Journal of the American Medical Informatics Association, Volume 19, Issue 5, 2012, Pages 888-896
資料類型 article
DOI http://dx.doi.org/10.1136/amiajnl-2012-000808
dc.contributor 資科系-
dc.creator (作者) Lai, Po-Ting-
dc.creator (作者) 賴柏廷zh_TW
dc.creator (作者) Chen, C.-Y.en_US
dc.creator (作者) Dai, H.-J.en_US
dc.date (日期) 2012-09-
dc.date.accessioned 12-May-2015 16:05:59 (UTC+8)-
dc.date.available 12-May-2015 16:05:59 (UTC+8)-
dc.date.issued (上傳時間) 12-May-2015 16:05:59 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/75088-
dc.description.abstract (摘要) Objective: Patient discharge summaries provide detailed medical information about hospitalized patients and are a rich resource of data for clinical record text mining. The textual expressions of this information are highly variable. In order to acquire a precise understanding of the patient, it is important to uncover the relationship between all instances in the text. In natural language processing (NLP), this task falls under the category of coreference resolution. Design: A key contribution of this paper is the application of contextual-dependent rules that describe relationships between coreference pairs. To resolve phrases that refer to the same entity, the authors use these rules in three representative NLP systems: one rule-based, another based on the maximum entropy model, and the last a system built on the Markov logic network (MLN) model. Results: The experimental results show that the proposed MLN-based system outperforms the baseline system (exact match) by average F-scores of 4.3% and 5.7% on the Beth and Partners datasets, respectively. Finally, the three systems were integrated into an ensemble system, further improving performance to 87.21%, which is 4.5% more than the official i2b2 Track 1C average (82.7%). Conclusion: In this paper, the main challenges in the resolution of coreference relations in patient discharge summaries are described. Several rules are proposed to exploit contextual information, and three approaches presented. While single systems provided promising results, an ensemble approach combining the three systems produced a better performance than even the best single system.-
dc.format.extent 176 bytes-
dc.format.mimetype text/html-
dc.relation (關聯) Journal of the American Medical Informatics Association, Volume 19, Issue 5, 2012, Pages 888-896-
dc.subject (關鍵詞) accuracy; article; hospital discharge; hospital patient; information dissemination; medical information; model; natural language processing; patient discharge summary; artificial intelligence; automated pattern recognition; computer simulation; data mining; electronic medical record; evaluation; hospital discharge; human; methodology; multicenter study; natural language processing; semantics; United States; Artificial Intelligence; Computer Simulation; Data Mining; Electronic Health Records; Humans; Natural Language Processing; Patient Discharge; Pattern Recognition, Automated; Semantics; United States-
dc.title (題名) Coreference resolution of medical concepts in discharge summaries by exploiting contextual information-
dc.type (資料類型) articleen
dc.identifier.doi (DOI) 10.1136/amiajnl-2012-000808-
dc.doi.uri (DOI) http://dx.doi.org/10.1136/amiajnl-2012-000808-