AUTOMATIC DETECTION OF HOMOPHOBIC SPEECH USING MACHINE LEARNING

Authors

  • Samuel Henrique Santos Silva Author
  • Erika Carlos Medeiros Author
  • Patrícia Cristina Moser Author
  • Bianca Gabriely Ferreira Silva Author
  • Fernando Ferreira de Carvalho Author
  • Jorge Cavalcanti Barbosa Fonsêca Author
  • Rômulo César Dias de Andrade Author
  • Marco Antônio de Oliveira Domingues Author

DOI:

https://doi.org/10.56238/arev7n5-029

Keywords:

Machine Learning. Hate Speech. Social Networks. Data mining. Natural Language Processing.  

Abstract

his research explores machine learning models to detect hate speech with homophobic contexts on social networks, a relevant problem in the digital age due to the negative impact on the LGBTQIA+ community. The overall objective is to train predictive models capable of identifying homophobic speech efficiently, contributing to the fight against hate speech and promoting a safer virtual environment. The CRISP-DM methodology was used, applying five phases: understanding the business, understanding and preparing data, modeling and evaluation. Six models were trained: Decision Tree, Random Forest, Extra Trees, Passive Aggressive, eXtreme Gradient Boosting and Support Vector Machine. The evaluation of the models used metrics such as accuracy, precision, recall and F1-Score, as well as analysis of the confusion matrix and the Receiver Operating Characteristic curve to measure the performance of each model. The SVM model had the best overall performance, with an accuracy of 87.10%, a precision of 79.15%, and an area under the curve of 0.9227, highlighting its effectiveness in minimizing false positives. The results highlight the potential of learning models in identifying hate speech and contribute to the construction of safer and more inclusive digital environments.

Downloads

Download data is not yet available.

Published

2025-05-02

Issue

Section

Articles

How to Cite

SILVA, Samuel Henrique Santos; MEDEIROS, Erika Carlos; MOSER, Patrícia Cristina; SILVA, Bianca Gabriely Ferreira; DE CARVALHO, Fernando Ferreira; FONSÊCA, Jorge Cavalcanti Barbosa; DE ANDRADE, Rômulo César Dias; DOMINGUES, Marco Antônio de Oliveira. AUTOMATIC DETECTION OF HOMOPHOBIC SPEECH USING MACHINE LEARNING. ARACÊ , [S. l.], v. 7, n. 5, p. 21496–21518, 2025. DOI: 10.56238/arev7n5-029. Disponível em: https://periodicos.newsciencepubl.com/arace/article/view/4820. Acesso em: 22 may. 2025.