1. A large-scale dataset of single and mixed-source short tandem repeat profiles to inform human identification strategies: PROVEDIt
- Author
-
Lauren E. Alfonse, Ken R. Duffy, Desmond S. Lun, Catherine M. Grgicak, and Amanda D. Garrett
- Subjects
Forensic Genetics ,0301 basic medicine ,Genotype ,STR multiplex system ,Datasets as Topic ,Computational biology ,Biology ,Polymerase Chain Reaction ,Pathology and Forensic Medicine ,03 medical and health sciences ,Forensic dna ,0302 clinical medicine ,Genetics ,Humans ,Multiplex ,030216 legal & forensic medicine ,Alleles ,social sciences ,Human identity ,DNA Fingerprinting ,Identification (information) ,030104 developmental biology ,Microsatellite ,Scale (map) ,DNA Damage ,Microsatellite Repeats - Abstract
DNA-based human identity testing is conducted by comparison of PCR-amplified polymorphic Short Tandem Repeat (STR) motifs from a known source with the STR profiles obtained from uncertain sources. Samples such as those found at crime scenes often result in signal that is a composite of incomplete STR profiles from an unknown number of unknown contributors, making interpretation an arduous task. To facilitate advancement in STR interpretation challenges we provide over 25,000 multiplex STR profiles produced from one to five known individuals at target levels ranging from one to 160 copies of DNA. The data, generated under 144 laboratory conditions, are classified by total copy number and contributor proportions. For the 70% of samples that were synthetically compromised, we report the level of DNA damage using quantitative and end-point PCR. In addition, we characterize the complexity of the signal by exploring the number of detected alleles in each profile.
- Published
- 2018
- Full Text
- View/download PDF