Back to Search Start Over

Participation-based student final performance prediction model through interpretable Genetic Programming: Integrating learning analytics, educational data mining and theory

Authors :
Rui Guo
Sean Goggins
Eva Petakovic
Wanli Xing
Source :
Computers in Human Behavior. 47:168-181
Publication Year :
2015
Publisher :
Elsevier BV, 2015.

Abstract

A theory-based method for a computational student performance prediction.A Genetic Programming model for grade prediction is described and tested.Model evaluation suggests high success rates for predicting student grades. Building a student performance prediction model that is both practical and understandable for users is a challenging task fraught with confounding factors to collect and measure. Most current prediction models are difficult for teachers to interpret. This poses significant problems for model use (e.g. personalizing education and intervention) as well as model evaluation. In this paper, we synthesize learning analytics approaches, educational data mining (EDM) and HCI theory to explore the development of more usable prediction models and prediction model representations using data from a collaborative geometry problem solving environment: Virtual Math Teams with Geogebra (VMTwG). First, based on theory proposed by Hrastinski (2009) establishing online learning as online participation, we operationalized activity theory to holistically quantify students' participation in the CSCL (Computer-supported Collaborative Learning) course. As a result, 6 variables, Subject, Rules, Tools, Division of Labor, Community, and Object, are constructed. This analysis of variables prior to the application of a model distinguishes our approach from prior approaches (feature selection, Ad-hoc guesswork etc.). The approach described diminishes data dimensionality and systematically contextualizes data in a semantic background. Secondly, an advanced modeling technique, Genetic Programming (GP), underlies the developed prediction model. We demonstrate how connecting the structure of VMTwG trace data to a theoretical framework and processing that data using the GP algorithmic approach outperforms traditional models in prediction rate and interpretability. Theoretical and practical implications are then discussed.

Details

ISSN :
07475632
Volume :
47
Database :
OpenAIRE
Journal :
Computers in Human Behavior
Accession number :
edsair.doi...........de85995dcc3a933bd40969e2247c1fc4
Full Text :
https://doi.org/10.1016/j.chb.2014.09.034