HanSEL: Italian Hate Speech detection through Ensemble Learning and Deep Neural Networks

Marco Polignano et al., « HanSEL: Italian Hate Speech detection through Ensemble Learning and Deep Neural Networks », Accademia University Press, ID : 10.4000/books.aaccademia.4766

Partage / Export

Résumé En It

The detection of hate speeches, over social media and online forums, is a relevant task for the research area of natural language processing. This interest is motivated by the complexity of the task and the social impact of its use in real scenarios. The task solution proposed in this work is based on an ensemble of three classification strategies, mediated by a majority vote algorithm: Support Vector Machine (Hearst et al., 1998) (SVM with RBF kernel), Random Forest (Breiman,2001), Deep Multilayer Perceptron (Kolmogorov, 1992) (MLP). Each classifier has been tuned using a greedy strategy of hyper-parameters optimization over the “F1” score calculated on a 5-fold random subdivision of the training set. Each sentence has been pre-processed to transform it into word embeddings and TF-IDF bag of words. The results obtained on the cross-validation over the training sets have shown an F1 value of 0.8034 for Facebook sentences and 0.7102 for Twitter. The code of the system proposed can be downloaded from GitHub: https://github.com/marcopoli/haspeede_hate_detect

HanSEL: Italian Hate Speech detection through Ensemble Learning and Deep Neural Networks

Fiche du document

Mots-clés It Fr En Und

Sujets proches En

Citer ce document

Métriques

Partage / Export

Résumé En It

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en