The role of machine learning in identifying students at-risk and minimizing failure

dc.authorid0000-0003-1712-1581
dc.authorid0000-0001-6657-9738
dc.contributor.authorPek, Reyhan Zeynep
dc.contributor.authorTarıyan Özyer, Sibel
dc.contributor.authorElhage, Tarek
dc.contributor.authorÖzyer, Tansel
dc.contributor.authorAlhajj, Reda
dc.date.accessioned2023-02-02T07:12:38Z
dc.date.available2023-02-02T07:12:38Z
dc.date.issued2023
dc.departmentİstanbul Medipol Üniversitesi, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description.abstractEducation is very important for students' future success. The performance of students can be supported by the extra assignments and projects given by the instructors for students with low performance. However, a major problem is that students at-risk cannot be identified early. This situation is being investigated by various researchers using Machine Learning techniques. Machine learning is used in a variety of areas and has also begun to be used to identify students at-risk early and to provide support by instructors. This research paper discusses the performance results found using Machine learning algorithms to identify at-risk students and minimize student failure. The main purpose of this project is to create a hybrid model using the ensemble stacking method and to predict at-risk students using this model. We used machine learning algorithms such as Naive Bayes, Random Forest, Decision Tree, K-Nearest Neighbors, Support Vector Machine, AdaBoost Classifier and Logistic Regression in this project. The performance of each machine learning algorithm presented in the project was measured with various metrics. Thus, the hybrid model by combining algorithms that give the best prediction results is presented in this study. The data set containing the demographic and academic information of the students was used to train and test the model. In addition, a web application developed for the effective use of the hybrid model and for obtaining prediction results is presented in the report. In the proposed method, it has been realized that stratified k-fold cross validation and hyperparameter optimization techniques increased the performance of the models. The hybrid ensemble model was tested with a combination of two different datasets to understand the importance of the data features. In first combination, the accuracy of the hybrid model was obtained as 94.8% by using both demographic and academic data. In the second combination, when only academic data was used, the accuracy of the hybrid model increased to 98.4%. This study focuses on predicting the performance of at-risk students early. Thus, teachers will be able to provide extra assistance to students with low performance.
dc.identifier.citationPek, R. Z., Tarıyan Özyer, S., Elhage, T., Özyer, T. ve Alhajj, R. (2023). The role of machine learning in identifying students at-risk and minimizing failure. IEEE Access, 11, 1224-1243. https://dx.doi.org/10.1109/ACCESS.2022.3232984
dc.identifier.doi10.1109/ACCESS.2022.3232984
dc.identifier.endpage1243
dc.identifier.issn2169-3536
dc.identifier.scopus2-s2.0-85146240510
dc.identifier.scopusqualityQ1
dc.identifier.startpage1224
dc.identifier.urihttps://dx.doi.org/10.1109/ACCESS.2022.3232984
dc.identifier.urihttps://hdl.handle.net/20.500.12511/10389
dc.identifier.volume11
dc.identifier.wos000910204900001en_US
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.institutionauthorPek, Reyhan Zeynep
dc.institutionauthorAlhajj, Reda
dc.language.isoen
dc.publisherIEEE-Institute of Electrical and Electronics Engineers Inc.
dc.relation.ispartofIEEE Accessen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.relation.tubitakinfo:eu-repo/grantAgreement/TUBITAK/SOBAG/2209-A
dc.rightsAttribution 4.0 International*
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/*
dc.subjectAt-Risk Students
dc.subjectClassification
dc.subjectDropout Prediction
dc.subjectHybrid Model
dc.subjectMachine Learning Techniques
dc.subjectStacking Ensemble Model
dc.subjectStudent Performance Prediction
dc.titleThe role of machine learning in identifying students at-risk and minimizing failure
dc.typeArticle

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
Pek-Reyhan-2023.pdf
Boyut:
3.73 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Tam Metin / Full Text
Lisans paketi
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: