1. Prediction of breed composition in an admixed cattle population
- Author
-
U. Schnyder, A. Frkonja, Birgit Gredler, Ino Curik, and Johann Sölkner
- Subjects
Genetic Markers ,Genotype ,Population ,Single-nucleotide polymorphism ,Feature selection ,admixture ,breed composition ,pedigree ,single nucleotide polymorphisms ,Swiss Fleckvieh ,Biology ,Polymorphism, Single Nucleotide ,Correlation ,Lasso (statistics) ,Partial least squares regression ,Genetics ,Animals ,Selection, Genetic ,education ,Allele frequency ,education.field_of_study ,Genome ,Models, Genetic ,General Medicine ,Markov Chains ,Breed ,Pedigree ,Hybridization, Genetic ,Cattle ,Animal Science and Zoology ,Algorithms - Abstract
Swiss Fleckvieh was established in 1970 as a composite of Simmental (SI) and Red Holstein Friesian (RHF) cattle. Breed composition is currently reported based on pedigree information. Information on a large number of molecular markers potentially provides more accurate information. For the analysis, we used Illumina BovineSNP50 Genotyping Beadchip data for 90 pure SI, 100 pure RHF and 305 admixed bulls. The scope of the study was to compare the performance of hidden Markov models, as implemented in structure software, with methods conventionally used in genomic selection [BayesB, partial least squares regression (PLSR), least absolute shrinkage and selection operator (LASSO) variable selection)] for predicting breed composition. We checked the performance of algorithms for a set of 40 492 single nucleotide polymorphisms (SNPs), subsets of evenly distributed SNPs and subsets with different allele frequencies in the pure populations, using F(ST) as an indicator. Key results are correlations of admixture levels estimated with the various algorithms with admixture based on pedigree information. For the full set, PLSR, BayesB and structure performed in a very similar manner (correlations of 0.97), whereas the correlation of LASSO and pedigree admixture was lower (0.93). With decreasing number of SNPs, correlations decreased substantially only for 5% or 1% of all SNPs. With SNPs chosen according to F(ST) , results were similar to results obtained with the full set. Only when using 96 and 48 SNPs with the highest F(ST) , correlations dropped to 0.92 and 0.90 respectively. Reducing the number of pure animals in training sets to 50, 20 and 10 each did not cause a drop in the correlation with pedigree admixture.
- Published
- 2012
- Full Text
- View/download PDF