Publications-Periodical Articles

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

題名 From hate to harmony: Leveraging large language models for safer speech in times of COVID-19 crisis
作者 李博逸
Li, Bo-Yi;Chao, August F.Y.;Wang, Chen-Shu;Chen, Hong-Yan
貢獻者 資管博七
日期 2024-08
上傳時間 28-Oct-2024 11:42:59 (UTC+8)
摘要 This study investigates the rampant spread of offensive and derogatory language during the COVID-19 pandemic and aims to mitigate it through machine learning. Employing advanced Large Language Models (LLMs), the research develops a sophisticated framework adept at detecting and transforming abusive and hateful speech. The project begins by meticulously compiling a dataset, focusing specifically on Chinese language abuse and hate speech. It incorporates an extensive list of 30 pandemic-related terms, significantly enriching the resources available for this type of research. A two-tier detection model is then introduced, achieving a remarkable accuracy of 94.42 % in its first phase and an impressive 81.48 % in the second. Furthermore, the study enhances paraphrasing efficiency by integrating generative AI techniques, primarily Large Language Models, with a Latent Dirichlet Allocation (LDA) topic model. This combination allows for a thorough analysis of language before and after modification. The results highlight the transformative power of these methods. They show that the rephrased statements not only reduce the initial hostility but also preserve the essential themes and meanings. This breakthrough offers users effective rephrasing suggestions to prevent the spread of hate speech, contributing to more positive and constructive public discourse.
關聯 Heliyon, Vol.10, No.16, pp.1-32
資料類型 article
DOI https://doi.org/10.1016/j.heliyon.2024.e35468
dc.contributor 資管博七
dc.creator (作者) 李博逸
dc.creator (作者) Li, Bo-Yi;Chao, August F.Y.;Wang, Chen-Shu;Chen, Hong-Yan
dc.date (日期) 2024-08
dc.date.accessioned 28-Oct-2024 11:42:59 (UTC+8)-
dc.date.available 28-Oct-2024 11:42:59 (UTC+8)-
dc.date.issued (上傳時間) 28-Oct-2024 11:42:59 (UTC+8)-
dc.identifier.uri (URI) https://nccur.lib.nccu.edu.tw/handle/140.119/154120-
dc.description.abstract (摘要) This study investigates the rampant spread of offensive and derogatory language during the COVID-19 pandemic and aims to mitigate it through machine learning. Employing advanced Large Language Models (LLMs), the research develops a sophisticated framework adept at detecting and transforming abusive and hateful speech. The project begins by meticulously compiling a dataset, focusing specifically on Chinese language abuse and hate speech. It incorporates an extensive list of 30 pandemic-related terms, significantly enriching the resources available for this type of research. A two-tier detection model is then introduced, achieving a remarkable accuracy of 94.42 % in its first phase and an impressive 81.48 % in the second. Furthermore, the study enhances paraphrasing efficiency by integrating generative AI techniques, primarily Large Language Models, with a Latent Dirichlet Allocation (LDA) topic model. This combination allows for a thorough analysis of language before and after modification. The results highlight the transformative power of these methods. They show that the rephrased statements not only reduce the initial hostility but also preserve the essential themes and meanings. This breakthrough offers users effective rephrasing suggestions to prevent the spread of hate speech, contributing to more positive and constructive public discourse.
dc.format.extent 109 bytes-
dc.format.mimetype text/html-
dc.relation (關聯) Heliyon, Vol.10, No.16, pp.1-32
dc.title (題名) From hate to harmony: Leveraging large language models for safer speech in times of COVID-19 crisis
dc.type (資料類型) article
dc.identifier.doi (DOI) 10.1016/j.heliyon.2024.e35468
dc.doi.uri (DOI) https://doi.org/10.1016/j.heliyon.2024.e35468