1. rehh 2.0: a reimplementation of the R package rehh to detect positive selection from haplotype structure.
- Author
-
Gautier, Mathieu, Klassmann, Alexander, and Vitalis, Renaud
- Subjects
- *
R (Computer program language) , *HAPLOTYPES , *BIG data , *HOMOZYGOSITY , *FOOTPRINTS , *NUCLEOTIDE sequencing - Abstract
Identifying genomic regions with unusually high local haplotype homozygosity represents a powerful strategy to characterize candidate genes responding to natural or artificial positive selection. To that end, statistics measuring the extent of haplotype homozygosity within (e.g. EHH, iHS) and between (Rsb or XP- EHH) populations have been proposed in the literature. The rehh package for r was previously developed to facilitate genome-wide scans of selection, based on the analysis of long-range haplotypes. However, its performance was not sufficient to cope with the growing size of available data sets. Here, we propose a major upgrade of the rehh package, which includes an improved processing of the input files, a faster algorithm to enumerate haplotypes, as well as multithreading. As illustrated with the analysis of large human haplotype data sets, these improvements decrease the computation time by more than one order of magnitude. This new version of rehh will thus allow performing iHS-, Rsb- or XP- EHH-based scans on large data sets. The package rehh 2.0 is available from the CRAN repository () together with help files and a detailed manual. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF