TY - JOUR
T1 - Exploring NLP-Based Solutions to Social Media Moderation Challenges
AU - Saleous, Heba
AU - Gergely, Marton
AU - Shuaib, Khaled
N1 - Publisher Copyright:
Copyright © 2025 Heba Saleous et al. Human Behavior and Emerging Technologies published by John Wiley & Sons Ltd.
PY - 2025
Y1 - 2025
N2 - The rise of social media has revolutionized global communication, enabling users and businesses to connect, advertise, and monitor competitors. However, this expansion has also fueled toxic behaviors like hate speech and harassment, exposing innocent users to harmful content while overwhelming human moderators and impacting their well-being. To address these challenges, artificial intelligence (AI) and natural language processing (NLP) have been explored as potential solutions. The aim of this paper is to study existing AI-based moderation approaches to understand which models have been used, their effectiveness, and the challenges they face. This work conducts a targeted systematic literature review of research efforts that present a technical approach to the topic while sharing model results and highlighting the challenges encountered. The findings reveal that AI-driven moderation shows promise by achieving high accuracy but has some issues that need to be addressed, such as dataset imbalance, obstacles and inconsistencies, bias, and misinterpretation of message meanings. By summarizing existing research efforts and identifying key gaps, this study provides insights into the strengths and weaknesses of current AI-based solutions for content moderation.
AB - The rise of social media has revolutionized global communication, enabling users and businesses to connect, advertise, and monitor competitors. However, this expansion has also fueled toxic behaviors like hate speech and harassment, exposing innocent users to harmful content while overwhelming human moderators and impacting their well-being. To address these challenges, artificial intelligence (AI) and natural language processing (NLP) have been explored as potential solutions. The aim of this paper is to study existing AI-based moderation approaches to understand which models have been used, their effectiveness, and the challenges they face. This work conducts a targeted systematic literature review of research efforts that present a technical approach to the topic while sharing model results and highlighting the challenges encountered. The findings reveal that AI-driven moderation shows promise by achieving high accuracy but has some issues that need to be addressed, such as dataset imbalance, obstacles and inconsistencies, bias, and misinterpretation of message meanings. By summarizing existing research efforts and identifying key gaps, this study provides insights into the strengths and weaknesses of current AI-based solutions for content moderation.
KW - artificial intelligence
KW - content moderation
KW - hate speech
KW - natural language processing
KW - social media
KW - toxic behavior
UR - http://www.scopus.com/inward/record.url?scp=105008230849&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105008230849&partnerID=8YFLogxK
U2 - 10.1155/hbe2/9436490
DO - 10.1155/hbe2/9436490
M3 - Review article
AN - SCOPUS:105008230849
SN - 2578-1863
VL - 2025
JO - Human Behavior and Emerging Technologies
JF - Human Behavior and Emerging Technologies
IS - 1
M1 - 9436490
ER -