Back to Search Start Over

UNIFORM: Automatic Alignment of Open Learning Datasets

Authors :
Lorenzo Canale
Luca Cagliero
Laura Farinetti
Source :
COMPSAC
Publication Year :
2020
Publisher :
IEEE, 2020.

Abstract

Learning Analytics aims at supporting the understanding of learning mechanisms and their effects by means of data-driven strategies. LA approaches commonly face two big challenges: first, due to privacy reasons, most of the analyzed data are not in the public domain. Secondly, the open data collections, which come from diverse learning contexts, are quite heterogeneous. Therefore, the research findings are not easily reproducible and the publicly available datasets are often too small to enable further data analytics. To overcome these issues, there is an increasing need for integrating open learning data into unified models. This paper proposes UNIFORM, an open relational database integrating various learning data sources. It presents also a machine learning supported approach to automatically extending the integrated dataset as soon as new data sources become available. The proposed approach exploits a classifier to predict attribute alignments based on the correlations among the corresponding textual attribute descriptions. The integration phase has reached a promising quality level on most of the analyzed bechmark datasets. Furthermore, the usability of the UNIFORM data model has been demonstrated in a real case study, where the integrated data have been exploited to support learners' outcome prediction. The F1-score achieved on the integrated data is approximately 30% higher that those obtained on the original data.

Details

Language :
English
Database :
OpenAIRE
Journal :
COMPSAC
Accession number :
edsair.doi.dedup.....2e11ab45f4a23fc2bd6d1dbc1dda9a7d