Back to Search
Start Over
Analysis validation has been neglected in the Age of Reproducibility
- Source :
- PLoS Biology, PLoS Biology, Vol 16, Iss 12, p e3000070 (2018)
- Publication Year :
- 2018
- Publisher :
- Public Library of Science, 2018.
-
Abstract
- Increasingly complex statistical models are being used for the analysis of biological data. Recent commentary has focused on the ability to compute the same outcome for a given dataset (reproducibility). We argue that a reproducible statistical analysis is not necessarily valid because of unique patterns of nonindependence in every biological dataset. We advocate that analyses should be evaluated with known-truth simulations that capture biological reality, a process we call “analysis validation.” We review the process of validation and suggest criteria that a validation project should meet. We find that different fields of science have historically failed to meet all criteria, and we suggest ways to implement meaningful validation in training and practice.<br />Just as we do controls for experiments we should all do controls for data analysis – this is easy to say but requires dedication to implement. This Essay explains the need for analysis validation and provides specific suggestions for how to get started.
- Subjects :
- 0301 basic medicine
Computer and Information Sciences
Process (engineering)
QH301-705.5
Essay
Biology
Research and Analysis Methods
General Biochemistry, Genetics and Molecular Biology
Machine Learning
03 medical and health sciences
Data visualization
Artificial Intelligence
Genetics
Humans
Statistical analysis
Biology (General)
Statistical Data
Reproducibility
Biological data
Models, Statistical
General Immunology and Microbiology
business.industry
Statistical Models
General Neuroscience
Simulation and Modeling
Data Visualization
Statistics
Data interpretation
Biology and Life Sciences
Computational Biology
Reproducibility of Results
Statistical model
Genomics
Research Assessment
Genome Analysis
Data science
030104 developmental biology
Data Interpretation, Statistical
Physical Sciences
General Agricultural and Biological Sciences
business
Mathematics
Subjects
Details
- Language :
- English
- ISSN :
- 15457885 and 15449173
- Volume :
- 16
- Issue :
- 12
- Database :
- OpenAIRE
- Journal :
- PLoS Biology
- Accession number :
- edsair.doi.dedup.....69d9ded68725053eb3ff311c30700e92