Academic Performance Prediction from Student–VLE Bipartite Interaction Graphs Using Centrality Features A Comparative Study with Classical Classifiers

Authors

  • Ai Irma Sumiati Universitas Amikom Purwokerto
  • Taqwa Hariguna Universitas Amikom Purwokerto
  • Azhari Shouni Barkah Universitas Amikom Purwokerto

DOI:

10.33395/sinkron.v10i1.15798

Keywords:

Educational Data Mining, Student Performance Prediction, Bipartite Graph, Centrality Features, XGBoost, Random Forest, SVM

Abstract

The rapid growth of digital learning platforms has increased the availability of student academic records and fine-grained interaction logs, creating opportunities for Educational Data Mining (EDM) to support early academic monitoring. However, many predictive models still rely mainly on individual tabular attributes and underutilize relational signals embedded in learning interactions. This study proposes a graph-mining feature approach for predicting student academic performance using a bipartite Student–VLE interaction graph. Centrality measures—degree, weighted degree, HITS hub, PageRank, and eigenvector centrality—are extracted to form a centrality feature set and combined with standard student information features. Using the public OULAD dataset, we compare three supervised classifiers: Random Forest, Support Vector Machine, and XGBoost. Experiments show that adding the centrality feature set consistently and substantially improves performance across all models compared to baseline tabular features. On the test set, XGBoost achieves the strongest results with accuracy 0.842, ROC-AUC 0.922, PR-AUC 0.902, and MCC 0.684, while Random Forest is close behind (accuracy 0.834, ROC-AUC 0.916, PR-AUC 0.894, MCC 0.672). The SVM model also benefits (accuracy 0.800, ROC-AUC 0.869, PR-AUC 0.811, MCC 0.599), confirming the robustness of the graph-derived signal. Scientifically, this study provides empirical evidence that a multi-centrality representation offers more systematic and transferable predictive value than relying on a single graph metric, across multiple classical model families under the same evaluation protocol. These findings indicate that graph-mining centrality features capture complementary structural information about learning engagement that is not represented by tabular attributes alone, and they offer a practical, interpretable enhancement to classic EDM pipelines for academic performance prediction.

GS Cited Analysis

Downloads

Download data is not yet available.

References

Alamgir, Z., Akram, H., Karim, S., & Wali, A. (2024). Enhancing Student Performance Prediction via Educational Data Mining on Academic Data. Informatics in Education, 23(1), 1–24. https://doi.org/10.15388/infedu.2024.04

Beer, C., Jones, D., & Lawson, C. (2021). The Challenge of Learning Analytics Implementation: Lessons Learned. Ascilite Publications. https://doi.org/10.14742/apubs.2019.5

Gašević, D., Tsai, Y., Dawson, S., & Pardo, A. (2019). How Do We Start? An Approach to Learning Analytics Adoption in Higher Education. International Journal of Information and Learning Technology. https://doi.org/10.1108/ijilt-02-2019-0024

Groß, R. (2024). Stochastic Contingency Machines Feeding on Meaning: On the Computational Determination of Social Reality in Machine Learning. Ai & Society. https://doi.org/10.1007/s00146-024-02079-8

Han, J., Kamber, M., & Pei, J. (2011). Data Mining: Concepts and Techniques (3rd ed.). Morgan Kaufmann.

Ifenthaler, D., Gibson, D., & Dobozy, E. (2024). The Synergistic and Dynamic Relationship Between Learning Design and Learning Analytics. Ascilite Publications. https://doi.org/10.14742/apubs.2017.752

Ismanto, E., Ghani, H. A., Saleh, N. I. M., Al Amien, J., & Gunawan, R. (2022). Recent systematic review on student performance prediction using backpropagation algorithms. Telkomnika (Telecommunication Computing Electronics and Control), 20(3), 597–606. https://doi.org/10.12928/℡KOMNIKA.v20i3.21963

Khalil, M., Prinsloo, P., & Slade, S. (2022). The Use and Application of Learning Theory in Learning Analytics: A Scoping Review. Journal of Computing in Higher Education. https://doi.org/10.1007/s12528-022-09340-3

Knight, S., Gibson, A., & Shibani, A. (2020). Implementing Learning Analytics for Learning Impact: Taking Tools to Task. The Internet and Higher Education. https://doi.org/10.1016/j.iheduc.2020.100729

Knobbout, J., & der Stappen, E. van. (2020). A Capability Model for Learning Analytics Adoption: Identifying Organizational Capabilities From Literature on Learning Analytics, Big Data Analytics, and Business Analytics. International Journal of Learning Analytics and Artificial Intelligence for Education (Ijai). https://doi.org/10.3991/ijai.v2i1.12793

Nazir, M., Noraziah, A., Rahmah, M., Fakherldin, M., & Khawaji, A. (2025). Transforming Education with Deep Learning: A Systematic Review on Predicting Student Performance and Critical Challenges. Fusion: Practice and Applications, 18(2), 79–99. https://doi.org/10.54216/FPA.180207

Nugroho, M. W. (2025). Analisis Performa Algoritma Random Forest dalam Mengatasi Overfitting pada Model Prediksi.

Pan, J., Zhao, Z., & Han, D. (2025). Academic Performance Prediction Using Machine Learning Approaches: A Survey. IEEE Transactions on Learning Technologies, 18, 351–368. https://doi.org/10.1109/TLT.2025.3554174

Peach, R. L., Greenbury, S. F., Johnston, I. G., Yaliraki, S. N., Lefèvre, D., & Barahona, M. (2021). Understanding Learner Behaviour in Online Courses With Bayesian Modelling and Time Series Characterisation. Scientific Reports. https://doi.org/10.1038/s41598-021-81709-3

Poquet, O. (2024). A Shared Lens Around Sensemaking in Learning Analytics: What Activity Theory, Definition of a Situation and Affordances Can Offer. British Journal of Educational Technology. https://doi.org/10.1111/bjet.13435

Qiao, S. (2024). The Analysis and Prediction Mode of Students’ Academic Performance and Social Behavior in the Background of Big Data. Proceedings of 2024 3rd International Conference on Artificial Intelligence and Education (ICAIE), 648–652. https://doi.org/10.1145/3722237.3722350

Rachmatika, R., Bisri, A., Puspiptek, J. R., Pamulang, K., & Tangerang Selatan, K. (2020). JEPIN (Jurnal Edukasi dan Penelitian Informatika) Perbandingan Model Klasifikasi untuk Evaluasi Kinerja Akademik Mahasiswa. JEPIN (Jurnal Edukasi Dan Penelitian Informatika).

Salim, M., Al-Din, N., & Al Abdulqader, H. A. (2024). Students’ Academic Performance Prediction Using Educational Data Mining and Machine Learning: A Systematic Review. International Journal of Research and Innovation in Social Science. https://doi.org/10.47772/IJRISS

Sathe, M. T., & Adamuthe, A. C. (2021). Comparative study of supervised algorithms for prediction of students’ performance. International Journal of Modern Education and Computer Science, 13(1), 1–21. https://doi.org/10.5815/ijmecs.2021.01.01

Thaher, T., & Jayousi, R. (2020). Prediction of Student’s Academic Performance using Feedforward Neural Network Augmented with Stochastic Trainers. 14th IEEE International Conference on Application of Information and Communication Technologies (AICT). https://doi.org/10.1109/AICT50176.2020.9368820

Torras Virgili, M. E. (2019). Learning Analytics: Online Higher Education in Management. Sociology and Anthropology. https://doi.org/10.13189/sa.2019.070202

Yu, D., & Yan, Z. (2024). Knowledge diffusion trajectories of PageRank: A main path analysis. Journal of Information Science, 50(1), 273–287. https://doi.org/10.1177/01655515231167388

Downloads


Crossmark Updates

How to Cite

Sumiati, A. I., Hariguna, T., & Barkah, A. S. (2026). Academic Performance Prediction from Student–VLE Bipartite Interaction Graphs Using Centrality Features A Comparative Study with Classical Classifiers. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 10(1), 676-678. https://doi.org/10.33395/sinkron.v10i1.15798