Back to Search Start Over

The Promise and Limitations of Synthetic Data as a Strategy to Expand Access to State-Level Multi-Agency Longitudinal Data

Authors :
Bonnéry, Daniel
Feng, Yi
Henneberger, Angela K.
Johnson, Tessa L.
Lachowicz, Mark
Rose, Bess A.
Shaw, Terry
Stapleton, Laura M.
Woolley, Michael E.
Zheng, Yating
Source :
Journal of Research on Educational Effectiveness. 2019 12(4):616-647.
Publication Year :
2019

Abstract

There is demand among policy-makers for the use of state education longitudinal data systems, yet laws and policies regulating data disclosure limit access to such data, and security concerns and risks remain high. Well-developed synthetic datasets that statistically mimic the relations among the variables in the data from which they were derived, but which contain no records that represent actual persons, present a viable solution to these laws, policies, concerns, and risks. We present a case study in the development of a synthetic data system and highlight potential applications of synthetic data. We begin with an overview of synthetic data, what it is, how it has been utilized thus far, and the potential benefits and concerns in its application to education data systems. We then describe our federally-funded project, proposing the steps required to synthesize a statewide longitudinal data system covering high school, postsecondary, and workforce data. Last, for use as a template for other agencies considering synthetic data, we review the challenges we have confronted in the development of our synthetic data system for research and policy evaluation purposes.

Details

Language :
English
ISSN :
1934-5747
Volume :
12
Issue :
4
Database :
ERIC
Journal :
Journal of Research on Educational Effectiveness
Publication Type :
Academic Journal
Accession number :
EJ1236531
Document Type :
Journal Articles<br />Reports - Descriptive
Full Text :
https://doi.org/10.1080/19345747.2019.1631421