Methods of Balancing Model Explainability and Performance in Identifying At-Risk Students

Authors

  • Tiffany T.Y. HSU International College of Innovation, National Chengchi University Author
  • Brendan FLANAGAN Center for Innovative Research and Education in Data Science, Kyoto University Author
  • Owen H.T. LU International College of Innovation, National Chengchi University Author

DOI:

https://doi.org/10.58459/icce.2024.4972

Abstract

This study will explore and experiment with various combinations of methods to handle data imbalance in order to address the common issue of insufficient minority samples in at-risk student prediction. Additionally, we will examine the purpose of applying computer tools to educational issues and emphasize the necessity of adhering to models with high transparency and explainability, ensuring that the decision-making process can be transparent and comprehensive in the context of learning analytics. After comparing model performance, we selected the logistic regression model combined with correlation analysis and threshold adjustment, which showed outstanding performance in UAR, G-means, and other evaluation metrics. We will analyze the reasons behind students' academic performance based on the feature importance ranking from the model, thereby establishing a high-performance and high- transparency benchmark model for the LBLS593 dataset.

Downloads

Download data is not yet available.

Downloads

Published

2024-11-25

How to Cite

Methods of Balancing Model Explainability and Performance in Identifying At-Risk Students. (2024). International Conference on Computers in Education. https://doi.org/10.58459/icce.2024.4972