學術產出-Proceedings

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

題名 Supervised word sense disambiguation on polysemy with bidirectional LSTM
作者 劉吉軒
Jyi-Shane Liu
Lai, Huei-Ling
Hsu, Hsiao-Ling
Lin, Chia-Hung
Chen, Yanhong
貢獻者 資科系
關鍵詞 Word sense disambiguation ; POSTaiwan Hakka ; low-resource language ; neural network models
日期 2020-05
上傳時間 4-Jun-2021 14:50:30 (UTC+8)
摘要 While word sense disambiguation (WSD) has been extensively studied in natural language processing, such a task in low-resource languages still receives little attention. Findings based on a few dominant languages may lead to narrow applications. A language-specific WSD system is in need to implement in low-resource languages, for instance, in Taiwan Hakka. This study examines the performance of DNN and Bi-LSTM in WSD tasks on polysemous BUNin Taiwan Hakka. Both models are trained and tested on a small amount of hand-crafted labeled data. Two experiments are designed with four kinds of input features and two window spans to explore what information is needed for the models to achieve their best performance. The results show that to achieve the best performance, DNN and Bi-LSTM models prefer different kinds of input features and window spans.
關聯 The 21st Chinese Lexical Semantics Workshop (CLSW2020), City University of Hong Kong
資料類型 conference
DOI https://doi.org/10.1142/S2717554520500113
dc.contributor 資科系
dc.creator (作者) 劉吉軒
dc.creator (作者) Jyi-Shane Liu
dc.creator (作者) Lai, Huei-Ling
dc.creator (作者) Hsu, Hsiao-Ling
dc.creator (作者) Lin, Chia-Hung
dc.creator (作者) Chen, Yanhong
dc.date (日期) 2020-05
dc.date.accessioned 4-Jun-2021 14:50:30 (UTC+8)-
dc.date.available 4-Jun-2021 14:50:30 (UTC+8)-
dc.date.issued (上傳時間) 4-Jun-2021 14:50:30 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/135539-
dc.description.abstract (摘要) While word sense disambiguation (WSD) has been extensively studied in natural language processing, such a task in low-resource languages still receives little attention. Findings based on a few dominant languages may lead to narrow applications. A language-specific WSD system is in need to implement in low-resource languages, for instance, in Taiwan Hakka. This study examines the performance of DNN and Bi-LSTM in WSD tasks on polysemous BUNin Taiwan Hakka. Both models are trained and tested on a small amount of hand-crafted labeled data. Two experiments are designed with four kinds of input features and two window spans to explore what information is needed for the models to achieve their best performance. The results show that to achieve the best performance, DNN and Bi-LSTM models prefer different kinds of input features and window spans.
dc.format.extent 191780 bytes-
dc.format.mimetype application/pdf-
dc.relation (關聯) The 21st Chinese Lexical Semantics Workshop (CLSW2020), City University of Hong Kong
dc.subject (關鍵詞) Word sense disambiguation ; POSTaiwan Hakka ; low-resource language ; neural network models
dc.title (題名) Supervised word sense disambiguation on polysemy with bidirectional LSTM
dc.type (資料類型) conference
dc.identifier.doi (DOI) 10.1142/S2717554520500113
dc.doi.uri (DOI) https://doi.org/10.1142/S2717554520500113