Back to Search
Start Over
High density LD-based structural variations analysis in cattle genome
- Source :
- PLoS ONE, Vol 9, Iss 7, p e103046 (2014), PLoS ONE
- Publication Year :
- 2014
- Publisher :
- Public Library of Science (PLoS), 2014.
-
Abstract
- Genomic structural variations represent an important source of genetic variation in mammal genomes, thus, they are commonly related to phenotypic expressions. In this work, ∼770,000 single nucleotide polymorphism genotypes from 506 animals from 19 cattle breeds were analyzed. A simple LD-based structural variation was defined, and a genome-wide analysis was performed. After applying some quality control filters, for each breed and each chromosome we calculated the linkage disequilibrium (r 2) of short range (≤100 Kb). We sorted SNP pairs by distance and obtained a set of LD means (called the expected means) using bins of 5 Kb. We identified 15,246 segments of at least 1 Kb, among the 19 breeds, consisting of sets of at least 3 adjacent SNPs so that, for each SNP, r 2 within its neighbors in a 100 Kb range, to the right side of that SNP, were all bigger than, or all smaller than, the corresponding expected mean, and their P-value were significant after a Benjamini-Hochberg multiple testing correction. In addition, to account just for homogeneously distributed regions we considered only SNPs having at least 15 SNP neighbors within 100 Kb. We defined such segments as structural variations. By grouping all variations across all animals in the sample we defined 9,146 regions, involving a total of 53,137 SNPs; representing the 6.40% (160.98 Mb) from the bovine genome. The identified structural variations covered 3,109 genes. Clustering analysis showed the relatedness of breeds given the geographic region in which they are evolving. In summary, we present an analysis of structural variations based on the deviation of the expected short range LD between SNPs in the bovine genome. With an intuitive and simple definition based only on SNPs data it was possible to discern closeness of breeds due to grouping by geographic region in which they are evolving.
- Subjects :
- Male
Linkage disequilibrium
Heredity
Genetic Linkage
Text Mining
Microarrays
lcsh:Medicine
Breeding
Biochemistry
Genome
Linkage Disequilibrium
Gene Frequency
Natural Selection
Copy-number variation
lcsh:Science
Genetics
Numerical Analysis
Multidisciplinary
Tag SNP
SNP genotyping
Bovine genome
Bioassays and Physiological Analysis
Physical Sciences
Female
Information Technology
Sequence Analysis
Statistics (Mathematics)
Research Article
Computer and Information Sciences
Evolutionary Processes
Genotype
Single-nucleotide polymorphism
Biostatistics
Biology
Research and Analysis Methods
Polymorphism, Single Nucleotide
Structural variation
Confidence Intervals
Animals
Statistical Methods
Molecular Biology Techniques
Sequencing Techniques
Molecular Biology
Evolutionary Biology
Biology and life sciences
lcsh:R
DNA structure
Computational Biology
DNA
Haplotypes
Mutation
Cattle
Veterinary Science
lcsh:Q
Animal Genetics
Population Genetics
Mathematics
Genome-Wide Association Study
Subjects
Details
- Language :
- English
- ISSN :
- 19326203
- Volume :
- 9
- Issue :
- 7
- Database :
- OpenAIRE
- Journal :
- PLoS ONE
- Accession number :
- edsair.doi.dedup.....548b4ca10f1ef1ba43bf5cfc2bdbad72