From hate to harmony: Leveraging large language models for safer speech in times of COVID-19 crisis | Publication | NCCU Academic Hub

Publications-Periodical Articles

Article View/Open

html(573)

Publication Export

Google Scholar^TM

NCCU Library

Discovery System

Citation Infomation

Related Publications in TAIR

Simple Record
Full Record

題名	From hate to harmony: Leveraging large language models for safer speech in times of COVID-19 crisis
作者	李博逸 Li, Bo-Yi;Chao, August F.Y.;Wang, Chen-Shu;Chen, Hong-Yan
貢獻者	資管博七
日期	2024-08
上傳時間	28-Oct-2024 11:42:59 (UTC+8)
摘要	This study investigates the rampant spread of offensive and derogatory language during the COVID-19 pandemic and aims to mitigate it through machine learning. Employing advanced Large Language Models (LLMs), the research develops a sophisticated framework adept at detecting and transforming abusive and hateful speech. The project begins by meticulously compiling a dataset, focusing specifically on Chinese language abuse and hate speech. It incorporates an extensive list of 30 pandemic-related terms, significantly enriching the resources available for this type of research. A two-tier detection model is then introduced, achieving a remarkable accuracy of 94.42 % in its first phase and an impressive 81.48 % in the second. Furthermore, the study enhances paraphrasing efficiency by integrating generative AI techniques, primarily Large Language Models, with a Latent Dirichlet Allocation (LDA) topic model. This combination allows for a thorough analysis of language before and after modification. The results highlight the transformative power of these methods. They show that the rephrased statements not only reduce the initial hostility but also preserve the essential themes and meanings. This breakthrough offers users effective rephrasing suggestions to prevent the spread of hate speech, contributing to more positive and constructive public discourse.
關聯	Heliyon, Vol.10, No.16, pp.1-32
資料類型	article
DOI	https://doi.org/10.1016/j.heliyon.2024.e35468

dc.contributor	資管博七
dc.creator (作者)	李博逸
dc.creator (作者)	Li, Bo-Yi;Chao, August F.Y.;Wang, Chen-Shu;Chen, Hong-Yan
dc.date (日期)	2024-08
dc.date.accessioned	28-Oct-2024 11:42:59 (UTC+8)	-
dc.date.available	28-Oct-2024 11:42:59 (UTC+8)	-
dc.date.issued (上傳時間)	28-Oct-2024 11:42:59 (UTC+8)	-
dc.identifier.uri (URI)	https://ah.lib.nccu.edu.tw/item?item_id=173644	-
dc.description.abstract (摘要)	This study investigates the rampant spread of offensive and derogatory language during the COVID-19 pandemic and aims to mitigate it through machine learning. Employing advanced Large Language Models (LLMs), the research develops a sophisticated framework adept at detecting and transforming abusive and hateful speech. The project begins by meticulously compiling a dataset, focusing specifically on Chinese language abuse and hate speech. It incorporates an extensive list of 30 pandemic-related terms, significantly enriching the resources available for this type of research. A two-tier detection model is then introduced, achieving a remarkable accuracy of 94.42 % in its first phase and an impressive 81.48 % in the second. Furthermore, the study enhances paraphrasing efficiency by integrating generative AI techniques, primarily Large Language Models, with a Latent Dirichlet Allocation (LDA) topic model. This combination allows for a thorough analysis of language before and after modification. The results highlight the transformative power of these methods. They show that the rephrased statements not only reduce the initial hostility but also preserve the essential themes and meanings. This breakthrough offers users effective rephrasing suggestions to prevent the spread of hate speech, contributing to more positive and constructive public discourse.
dc.format.extent	109 bytes	-
dc.format.mimetype	text/html	-
dc.relation (關聯)	Heliyon, Vol.10, No.16, pp.1-32
dc.title (題名)	From hate to harmony: Leveraging large language models for safer speech in times of COVID-19 crisis
dc.type (資料類型)	article
dc.identifier.doi (DOI)	10.1016/j.heliyon.2024.e35468
dc.doi.uri (DOI)	https://doi.org/10.1016/j.heliyon.2024.e35468