A Hybrid Machine Learning Approach for Predicting Student Performance Using Multi-class Educational Datasets

Research output: Contribution to JournalArticlepeer-review

Abstract

Prediction of students’ academic performance has garnered considerable interest, with many institutions seek to enhance students’ performance and their quality of education. The integration of both unsupervised and supervised machine learning techniques has demonstrated significant efficacy in predicting student performance. This paper explores the application of different machine learn- ing methods in predicting student academic performance. Initially, Principal Component Analysis (PCA) was utilised to reduce the dataset’s dimensionality, thereby improving its visualisation. Subsequently, K-Means clustering was employed to segregate students into distinct groups, reflective of their learning behaviors. Afterwards, the observed clusters were utilised for training classification models to address each student cluster individually. This approach was implemented in a case study involving an un- dergraduate science course at a North American University (NAU) and the Open University Learning Analytics Dataset (OULAD). Empirical findings indicate that the combined use of Feedforward Dense Network (FDN), Random Forest (RF), and Decision Tree (DT), specifically in their clustered forms, outperforms other classifiers in predicting student academic performance effectively.
Original languageEnglish
Pages (from-to)888-895
Number of pages8
JournalProcedia Computer Science
Volume238
DOIs
Publication statusPublished - 8 Jul 2024

Keywords

  • Hybrid Machine Learning
  • Principal Component Analysis (PCA)
  • K-Means Clustering
  • Student Performance

Fingerprint

Dive into the research topics of 'A Hybrid Machine Learning Approach for Predicting Student Performance Using Multi-class Educational Datasets'. Together they form a unique fingerprint.

Cite this