A comparison of classification algorithms for

Abstract

Freedom of opinion through social media is frequently affect a negative impact that spreads hatred. This study aims to automatically detect Indonesian tweets that contain hate speech on Twitter social media. The data used amounted to 4,002 tweets related to politics, religion, ethnicity and race in Indonesia. The application model uses classification methods with machine learning algorithms such as Naïve Bayes, Multi Level Perceptron, AdaBoost Classifier, Decision Tree and Support Vector Machine. The study also compared the performance of the model using SMOTE to overcome imbalanced data. The results show that the Multinomial Naive Bayes algorithm produces the best model with the highest recall value of 93.2% which has an accuracy value of 71.2% for the classification of hate speech. Therefore, the Multinomial Naïve Bayes algorithm without SMOTE is recommended as the model to detect hate speech on social media.

Details

Title

A comparison of classification algorithms for hate speech detection

Author

Putri, T T A¹; Sriadhi, S¹; Sari, R D¹; Rahmadani, R¹; Hutahaean, H D¹

¹ PTIK-FT, Universitas Negeri Medan, Indonesia

Publication year

2020

Publication date

Apr 2020

Publisher

IOP Publishing

ISSN

17578981

e-ISSN

1757899X

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1088/1757-899X/830/3/032006

ProQuest document ID

2562402524

© 2020. This work is published under http://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

A comparison of classification algorithms for hate speech detection

Jump to:

Abstract

Details

Suggested sources