1. The complex genetic architecture of the metabolome
- Author
-
Eva K. F. Chan, Heather C. Rowe, Daniel J. Kliebenstein, Bjarne Gram Hansen, and Copenhaver, Gregory P
- Subjects
0106 biological sciences ,Cancer Research ,lcsh:QH426-470 ,Population ,Arabidopsis ,Inheritance Patterns ,Genome-wide association study ,Single-nucleotide polymorphism ,Quantitative trait locus ,Biology ,Genetics and Genomics/Complex Traits ,01 natural sciences ,Genetics and Genomics/Plant Genetics and Gene Expression ,Linkage Disequilibrium ,Plant Biology/Plant Biochemistry and Physiology ,Evolutionary Biology/Plant Genomes and Evolution ,03 medical and health sciences ,Computational Biology/Metabolic Networks ,Genetic variation ,Metabolome ,Genetics ,2.1 Biological and endogenous factors ,Aetiology ,education ,Molecular Biology ,Genetics (clinical) ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,Genetic association ,Nutrition ,2. Zero hunger ,0303 health sciences ,education.field_of_study ,Cell Biology/Plant Cell Biology ,Human Genome ,Chromosome Mapping ,Chemical Biology/Chemical Biology of the Cell ,Genetic architecture ,lcsh:Genetics ,Biochemistry/Bioinformatics ,Metabolic Networks and Pathways ,010606 plant biology & botany ,Research Article ,Genome-Wide Association Study ,Developmental Biology - Abstract
Discovering links between the genotype of an organism and its metabolite levels can increase our understanding of metabolism, its controls, and the indirect effects of metabolism on other quantitative traits. Recent technological advances in both DNA sequencing and metabolite profiling allow the use of broad-spectrum, untargeted metabolite profiling to generate phenotypic data for genome-wide association studies that investigate quantitative genetic control of metabolism within species. We conducted a genome-wide association study of natural variation in plant metabolism using the results of untargeted metabolite analyses performed on a collection of wild Arabidopsis thaliana accessions. Testing 327 metabolites against >200,000 single nucleotide polymorphisms identified numerous genotype–metabolite associations distributed non-randomly within the genome. These clusters of genotype–metabolite associations (hotspots) included regions of the A. thaliana genome previously identified as subject to recent strong positive selection (selective sweeps) and regions showing trans-linkage to these putative sweeps, suggesting that these selective forces have impacted genome-wide control of A. thaliana metabolism. Comparing the metabolic variation detected within this collection of wild accessions to a laboratory-derived population of recombinant inbred lines (derived from two of the accessions used in this study) showed that the higher level of genetic variation present within the wild accessions did not correspond to higher variance in metabolic phenotypes, suggesting that evolutionary constraints limit metabolic variation. While a major goal of genome-wide association studies is to develop catalogues of intraspecific variation, the results of multiple independent experiments performed for this study showed that the genotype–metabolite associations identified are sensitive to environmental fluctuations. Thus, studies of intraspecific variation conducted via genome-wide association will require analyses of genotype by environment interaction. Interestingly, the network structure of metabolite linkages was also sensitive to environmental differences, suggesting that key aspects of network architecture are malleable., Author Summary Understanding how genetic variation can control phenotypic variation is a fundamental goal of modern biology. We combined genome-wide association mapping with metabolomics in the plant Arabidopsis thaliana to explore how species-wide genetic variation controls metabolism. We identified numerous naturally-variable genes that may influence plant metabolism, often clustering in “hotspots.” These hotspots were proximal to selective sweeps, regions of the genome showing decreased diversity possibly from a strong selective advantage of specific variants within the region. This suggests that metabolism may be connected to the selective advantage. Interestingly, metabolite variation in wild Arabidopsis is highly constrained despite the significant genetic variation, thus providing the plant un-sampled metabolic space if the environment shifts. The observed structuring of genetic and metabolic variation suggests individual convergence upon similar phenotypes via different genotypes, possibly intra-specific parallel evolution. This phenotypic convergence couples with a pattern of genotype—phenotype association consistent with metabolite variation largely controlled by numerous small effect genetic variants. This supports the supposition that large magnitude variation is likely unstable in a complex and interconnected metabolism. If this pattern proves generally applicable to other species, it could present a significant hurdle to identifying genes controlling metabolic trait variation via genome-wide association studies.
- Published
- 2010