Back to Search Start Over

Evaluating Sources of Course Information and Models of Representation on a Variety of Institutional Prediction Tasks

Authors :
Jiang, Weijie
Pardos, Zachary A.
Source :
International Educational Data Mining Society. 2020.
Publication Year :
2020

Abstract

Data mining of course enrollment and course description records has soared as institutions of higher education begin tapping into the value of these data for academic and internal research purposes. This has led to a more than doubling of papers on course prediction tasks every year. The papers often center around a single prediction task and introduce a single novel modeling approach utilizing one or two data sources. In this paper, we provide the most comprehensive evaluation to date of data sources, models, and their performance on downstream prediction tasks. We separately incorporate syllabus, catalog description, and enrollment history data to represent courses using graph embedding, course2vec (i.e., skip-gram), and classic bag-of-words models. We evaluate these representations on the tasks of predicting course prerequisites, credit equivalencies, student next semester enrollments, and student course grades. Most notably, our results show that syllabi bag-of-words representations performed better than course descriptions in predicting prerequisite relationships, though enrollment-based graph embeddings performed substantially better still. Course descriptions provided the highest single representation accuracy in predicting course similarity, with descriptions, syllabi, and course2vec combined representations providing the highest ensembled accuracy on this task. [For the full proceedings, see ED607784.]

Details

Language :
English
Database :
ERIC
Journal :
International Educational Data Mining Society
Publication Type :
Conference
Accession number :
ED607904
Document Type :
Speeches/Meeting Papers<br />Reports - Research