Shapley values to explain machine learning models of school student’s academic performance during COVID-19

Shapley values to explain machine learning models of school student’s academic performance during COVID-19

Descargar PDF Descargar PDF

Publicado en 3C TIC – Volume 11 Issue 2 (Ed. 41)

Autores

Resumen

Abstract

In this work we perform an analysis of distance learning format influence, caused by COVID-19 pandemic on school students’ academic performance. This study is based on a large dataset consisting of school students grades for 2020 academic year taken from “Electronic education in Tatarstan Republic” system. The analysis is based on the use of machine learning methods and feature importance technique realized by using Python programming language. One of the priorities of this work is to identify the academic factors causing the most sensitive impact on school students’ performance. In this work we used the Shapley values method for solving this task. This method is widely used for the feature importance estimation task and can evaluate impact of every studied feature on the output of machine learning models. The study-related conditional factors include characteristics of teachers, types and kinds of educational organization, area of their location and subjects for which marks were obtained.

Artículo

Palabras clave

Keywords

Data Science, Python, education, Machine Learning, Feature Importance.

Articulos relacionados