Back to Search
Start Over
parallelnewhybrid: an R package for the parallelization of hybrid detection using<scp>newhybrids</scp>
- Source :
- Molecular Ecology Resources. 17:91-95
- Publication Year :
- 2016
- Publisher :
- Wiley, 2016.
-
Abstract
- Hybridization among populations and species is a central theme in many areas of biology, and the study of hybridization has direct applicability to testing hypotheses about evolution, speciation and genetic recombination, as well as having conservation, legal and regulatory implications. Yet, despite being a topic of considerable interest, the identification of hybrid individuals, and quantification of the (un)certainty surrounding the identifications, remains difficult. Unlike other programs that exist to identify hybrids based on genotypic information, newhybrids is able to assign individuals to specific hybrid classes (e.g. F1 , F2 ) because it makes use of patterns of gene inheritance within each locus, rather than just the proportions of gene inheritance within each individual. For each comparison and set of markers, multiple independent runs of each data set should be used to develop an estimate of the hybrid class assignment accuracy. The necessity of analysing multiple simulated data sets, constructed from large genomewide data sets, presents significant computational challenges. To address these challenges, we present parallelnewhybrid, an r package designed to decrease user burden when undertaking multiple newhybrids analyses. parallelnewhybrid does so by taking advantage of the parallel computational capabilities inherent in modern computers to efficiently and automatically execute separate newhybrids runs in parallel. We show that parallelization of analyses using this package affords users several-fold reductions in time over a traditional serial analysis. parallelnewhybrid consists of an example data set, a readme and three operating system-specific functions to execute parallel newhybrids analyses on each of a computer's c cores. parallelnewhybrid is freely available on the long-term software hosting site github (www.github.com/bwringe/parallelnewhybrid).
- Subjects :
- 0106 biological sciences
0301 basic medicine
Chimera
business.industry
Population structure
Computational Biology
Parallel computing
Biostatistics
Biology
010603 evolutionary biology
01 natural sciences
Data set
Population genomics
03 medical and health sciences
R package
Genetics, Population
030104 developmental biology
Software
Simulated data
README
Genetics
business
Ecology, Evolution, Behavior and Systematics
Biotechnology
Subjects
Details
- ISSN :
- 1755098X
- Volume :
- 17
- Database :
- OpenAIRE
- Journal :
- Molecular Ecology Resources
- Accession number :
- edsair.doi.dedup.....7e2e62a7bc36708785db3998ceb97ec1
- Full Text :
- https://doi.org/10.1111/1755-0998.12597