Back to Search
Start Over
Integrating Big Data Into Evaluation: R Code for Topic Identification and Modeling
- Source :
- American Journal of Evaluation. 43:412-436
- Publication Year :
- 2021
- Publisher :
- SAGE Publications, 2021.
-
Abstract
- Despite the rising popularity of big data, there is speculation that evaluators have been slow adopters of these new statistical approaches. Several possible reasons have been offered for why this is the case: ethical concerns, institutional capacity, and evaluator capacity and values. In this method note, we address one of these barriers and aim to build evaluator capacity to integrate big data analytics into their studies. We focus our efforts on a specific topic modeling technique referred to as latent Dirichlet allocation (LDA) because of the ubiquitousness of qualitative textual data in evaluation. Given current equity debates, both within evaluation and the communities in which we practice, we analyze 1,796 tweets that use the hashtag #equity with the R packages topicmodels and ldatuning to illustrate the use of LDA. Furthermore, a freely available workbook for implementing LDA topic modeling is provided as Supplemental Material Online.
- Subjects :
- Topic model
Health (social science)
Sociology and Political Science
Social Psychology
Computer science
business.industry
Strategy and Management
Big data
Popularity
Data science
Latent Dirichlet allocation
Education
Identification (information)
symbols.namesake
Evaluation theory
symbols
Business and International Management
Speculation
business
Subjects
Details
- ISSN :
- 15570878 and 10982140
- Volume :
- 43
- Database :
- OpenAIRE
- Journal :
- American Journal of Evaluation
- Accession number :
- edsair.doi...........dc255e7b93cfeab271d886c45bda5863