1. Unravelling population structure heterogeneity within the genome of the malaria vector Anopheles gambiae
- Author
-
Luisa D. P. Rona, George K. Christophides, Katie Willis, Melina Campos, Robert M. MacCallum, Wellcome Trust, National Institutes of Health, and The Royal Society
- Subjects
Genome evolution ,Species complex ,Bioinformatics ,Population genetics ,Anopheles gambiae ,Population ,Mosquito Vectors ,T-SNE ,QH426-470 ,Genome ,Gene flow ,03 medical and health sciences ,0302 clinical medicine ,Anopheles ,parasitic diseases ,Visualization method ,Genetics ,Animals ,Guinea-Bissau ,education ,11 Medical and Health Sciences ,030304 developmental biology ,Islands ,0303 health sciences ,education.field_of_study ,biology ,06 Biological Sciences ,biology.organism_classification ,Whole-genome analysis ,Malaria ,Evolutionary biology ,Africa ,08 Information and Computing Sciences ,Chromosomal inversions ,030217 neurology & neurosurgery ,TP248.13-248.65 ,Research Article ,Biotechnology - Abstract
Background Whole genome re-sequencing provides powerful data for population genomic studies, allowing robust inferences of population structure, gene flow and evolutionary history. For the major malaria vector in Africa, Anopheles gambiae, other genetic aspects such as selection and adaptation are also important. In the present study, we explore population genetic variation from genome-wide sequencing of 765 An. gambiae and An. coluzzii specimens collected from across Africa. We used t-SNE, a recently popularized dimensionality reduction method, to create a 2D-map of An. gambiae and An. coluzzii genes that reflect their population structure similarities. Results The map allows intuitive navigation among genes distributed throughout the so-called “mainland” and numerous surrounding “island-like” gene clusters. These gene clusters of various sizes correspond predominantly to low recombination genomic regions such as inversions and centromeres, and also to recent selective sweeps. Because this mosquito species complex has been studied extensively, we were able to support our interpretations with previously published findings. Several novel observations and hypotheses are also made, including selective sweeps and a multi-locus selection event in Guinea-Bissau, a known intense hybridization zone between An. gambiae and An. coluzzii. Conclusions Our results present a rich dataset that could be utilized in functional investigations aiming to shed light onto An. gambiae s.l genome evolution and eventual speciation. In addition, the methodology presented here can be used to further characterize other species not so well studied as An. gambiae, shortening the time required to progress from field sampling to the identification of genes and genomic regions under unique evolutionary processes.
- Published
- 2021