Detecting spam tweets using machine learning and effective preprocessing

Kardaş, Berk; Bayar, İsmail Erdem; Özyer, Tansel; Alhajj, Reda

Detecting spam tweets using machine learning and effective preprocessing

dc.contributor.author	Kardaş, Berk
dc.contributor.author	Bayar, İsmail Erdem
dc.contributor.author	Özyer, Tansel
dc.contributor.author	Alhajj, Reda
dc.date.accessioned	2022-02-28T11:24:19Z
dc.date.available	2022-02-28T11:24:19Z
dc.date.issued	2021
dc.department	İstanbul Medipol Üniversitesi, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description.abstract	Nowadays, with the rapid increase in popularity of online social networks (OSNs), these platforms are realized as ideal places for spammers. Unfortunately, these spammers can easily publish malicious content, advertise phishing scams by taking advantage of OSNs. Therefore, effective identification and filtering of spam tweets will be beneficial to both OSNs and users. However, it is becoming increasingly difficult to check and eliminate spam tweets due to this great flow of posts. Motivated by these observations, in this paper we propose an approach for the detection of spam tweets using machine learning and effective preprocessing techniques. The approach proposes the advantages of the preprocessing and which of these preprocessing techniques are the most effective. To compare these techniques UtkML Twitter spam dataset is used in testing. After the most effective methods determined, the detection accuracy of the spam tweets will be better optimized by combining them. We have evaluated our solution with four different machine learning algorithms namely - Naïve Bayes Classifier, Neural Network, Logistic Regression and Support Vector Machine. With SVM Classifier, we are able to achieve an accuracy of 93.02%. Experimental results show that our approach can improve the performance of spam tweet classification effectively.
dc.description.sponsorship	ACM Special Interest Group on Knowledge Discovery in Data (SIGKDD) ; Elsevier ; IEEE Computer Society ; IEEE TCDE ; Springer	en_US
dc.identifier.citation	Kardaş, B., Bayar, İ. E., Özyer, T. ve Alhajj, R. (2021). Detecting spam tweets using machine learning and effective preprocessing. 13th IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM içinde (393-398. ss.). Virtual, Online, 8 November 2021. https://doi.org/10.1145/3487351.3490968
dc.identifier.doi	10.1145/3487351.3490968
dc.identifier.endpage	398
dc.identifier.isbn	9781450391283
dc.identifier.scopus	2-s2.0-85124395764
dc.identifier.scopusquality	N/A
dc.identifier.startpage	393
dc.identifier.uri	https://doi.org/10.1145/3487351.3490968
dc.identifier.uri	https://hdl.handle.net/20.500.12511/9025
dc.indekslendigikaynak	Scopus
dc.institutionauthor	Özyer, Tansel
dc.language.iso	en
dc.publisher	Association for Computing Machinery, Inc
dc.relation.ispartof	13th IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM	en_US
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	Machine Learning
dc.subject	Preprocessing
dc.subject	Social Media
dc.subject	Spam Detection
dc.subject	Twitter
dc.title	Detecting spam tweets using machine learning and effective preprocessing
dc.type	Conference Object

Dosyalar

Lisans paketi

Listeleniyor 1 - 1 / 1

İsim:: license.txt
Boyut:: 1.44 KB
Biçim:: Item-specific license agreed upon to submission
Açıklama:

İndir

Koleksiyon

Bildiri Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu