1. Predicting Fitness-Related Traits Using Gene Expression and Machine Learning.
- Author
-
Henry, Georgia A and Stinchcombe, John R
- Abstract
Evolution by natural selection occurs at its most basic through the change in frequencies of alleles; connecting those genomic targets to phenotypic selection is an important goal for evolutionary biology in the genomics era. The relative abundance of gene products expressed in a tissue can be considered a phenotype intermediate to the genes and genomic regulatory elements themselves and more traditionally measured macroscopic phenotypic traits such as flowering time, size, or growth. The high dimensionality, low sample size nature of transcriptomic sequence data is a double-edged sword, however, as it provides abundant information but makes traditional statistics difficult. Machine learning (ML) has many features which handle high-dimensional data well and is thus useful in genetic sequence applications. Here, we examined the association of fitness components with gene expression data in Ipomoea hederacea (Ivyleaf morning glory) grown under field conditions. We combine the results of two different ML approaches and find evidence that expression of photosynthesis-related genes is likely under selection. We also find that genes related to stress and light responses were overall important in predicting fitness. With this study, we demonstrate the utility of ML models for smaller samples and their potential application for understanding natural selection. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF