000 | 02608nam a22002657a 4500 | ||
---|---|---|---|
003 | KOHA | ||
005 | 20250110111010.0 | ||
008 | 240927d2024 cy ed||| |||| 00| 0 eng d | ||
040 |
_aCY-NiCIU _beng _cCY-NiCIU _erda |
||
041 | _aeng | ||
090 |
_aYL 3578 _bN46 2024 |
||
100 | 1 | _aNgoy, Isaac Mumba | |
245 | 1 | 0 |
_aUTILIZATION OF ADVANCED MACHINE LEARNING TECHNIQUES FOR DETECTING HATE SPEECH ON SOCIAL MEDIA PLATFORMS / _cISAAC MUMBA NGOY ; SUPERVISOR, ASST. PROF. DR. KIAN JAZAYERI |
264 | _c2024 | ||
300 |
_a64 sheets ; _c30 cm _e+1 CD ROM |
||
336 |
_2rdacontent _atext _btxt |
||
337 |
_2rdamedia _aunmediated _bn |
||
338 |
_2rdacarrier _avolume _bnc |
||
502 | _aThesis (MSc) - Cyprus International University. Institute of Graduate Studies and Research Management Information System | ||
520 | _aThis study examines the efficacy of diverse machine learning models in detecting hate speech within English-language tweets, with a focus on advanced ensemble methods. The study evaluates a range of models, including Random Forest, Stacking Classifier, Support Vector Machine (SVM), Logistic Regression, Naive Bayes, K-Nearest Neighbors (KNN), AdaBoost, and Gradient Boosting. Random Forest emerged as the top performer, achieving an accuracy of 99.90%, precision of 99.94%, recall of 99.87%, F1-score of 99.90%, and an AUC-ROC of 0.999, closely followed by the Stacking Classifier and SVM. A key contribution of this research lies in its emphasis on preprocessing techniques, particularly the use of lemmatization and contraction expansion, which have been less commonly applied in the field compared to stemming. These techniques, along with text cleaning, normalization, and tokenization, were crucial in improving the models' accuracy and ability to capture the nuances of hate speech. Feature extraction was conducted using the Term Frequency-Inverse Document Frequency (TF-IDF), further augmenting the models' ability to differentiate between hate speech and non-hate speech content. The study highlights the significance of sophisticated preprocessing in increasing the robustness of machine learning models for hate speech detection. This research delivers critical insights that can enhance the effectiveness of hate speech detection systems on social media platforms and establishes a foundation for future studies focused on advanced deep learning approaches and the ethical aspects of deploying these models. | ||
650 | 0 |
_aManagement Information System _vDissertations, Academic |
|
700 | 1 |
_aJazayeri, Kian _esupervisor |
|
942 |
_2ddc _cTS |
||
999 |
_c293069 _d293069 |