000 03086nam a22002657a 4500
003 KOHA
005 20250110130645.0
008 240927d2024 cy e|||| |||| 00| 0 eng d
040 _aCY-NiCIU
_beng
_cCY-NiCIU
_erda
041 _aeng
090 _aYL 3559
_bK35 2024
100 1 _aAl-Kaladi, Bandar Khaled Mohammed
245 1 0 _aIDENTIFYING HARMFUL TWEETS: COMPARATIVE ANALYSIS OF MACHINE LEARNING ALGORITHMS /
_cBANDAR KHALED MOHAMMED AL-KALADI ; SUPERVISOR, ASST. PROF. DR. KIAN JAZAYERI
264 _c2024
300 _a46 sheets ;
_c30 cm
_e+1 CD ROM
336 _2rdacontent
_atext
_btxt
337 _2rdamedia
_aunmediated
_bn
338 _2rdacarrier
_avolume
_bnc
502 _aThesis (MSc) - Cyprus International University. Institute of Graduate Studies and Research Information Technologies
520 _aSentiment analysis on platforms along the lines of Twitter provides actual perception into public opinions and trends. This study evaluated the performance of one deep learning model, the Gated Recurrent Unit, and three machine learning algorithms, Support Vector Machine, Random Forest, and Naive Bayes, in differentiating between normal and harmful tweets. The dataset, sourced from Twitter, goes through preprocessing in order to remove noise, including user mentions, hashtags, links, and stop words. The cleaned dataset was split into training and testing sets in order to train each classifier. Performance metrics such as accuracy, precision, recall, F1-score, and confusion matrices were used in order to evaluate each model. An accuracy of 92.02% was accomplished by Naive Bayes, with normal tweets possessing a precision of 0.96 and recall of 0.88, and harmful tweets possessing a precision of 0.89 and recall of 0.96. Random Forest accomplished the highest accuracy at 97.28%, with normal tweets possessing a precision of 0.98 and recall of 0.97, and harmful tweets possessing a precision of 0.97 and recall of 0.98. Support Vector Machine (SVM) accomplished an accuracy of 96.92%, with normal tweets having a precision of 0.96 and recall of 0.98, and harmful tweets possessing a precision of 0.98 and recall of 0.96. The Gated Recurrent Unit (GRU) achieved an accuracy of 95.02%, with normal tweets having a precision of 0.98 and recall of 0.92, and harmful tweets possessing a precision of 0.92 and recall of 0.98. Each algorithm had advantages and disadvantages: Naive Bayes expressed high precision but lower recall for harmful tweets. Random Forest displayed balanced precision and recall. SVM achieved high accuracy with strong performance in both precision and recall. And GRU successfully handled sequential patterns. The findings highlight the relative advantages and limitations of deep learning and machine learning perspectives in Twitter sentiment analysis, with Random Forest and Support Vector Machine appearing as the majority of its effective among the evaluated methods.
650 0 _aInformation Technologies
_vDissertations, Academic
700 1 _aJazayeri, Kian
_esupervısor
942 _2ddc
_cTS
999 _c293078
_d293078