Back to Search
Start Over
Codon Bias in Human Genes
- Publication Year :
- 2022
- Publisher :
- University of Guelph, 2022.
-
Abstract
- Codon bias describes the tendency to use certain synonymous codons to encode amino acids. It is well established that codon bias varies between different organisms and plays a role in gene expression and co-translational folding. It is important to understand codon bias because a better understanding of gene expression and translation mechanics may allow for more efficient recombinant protein production, and could ultimately improve the ability to create synthetic genes. Human genes were investigated to elucidate the connection between their codon bias and the subsequent impact on structure, function, and tissue specific expression levels. Analysis was performed by representing human genes according to their codon bias, then clustering genes together that have a similar codon bias. Gene clusters were studied to see if genes that use similar codons are statistically more likely to share other properties. Clustering was performed using a novel data driven approach to a simple clustering algorithm called anchor clustering. Anchor clustering was used because it is fast and deterministic; two qualities that other approaches can struggle with when clustering data in high dimensional spaces. To study the connection between gene product structure and codon bias, clusters were analysed according to their likelihood to contain intrinsically disordered proteins. Because structure and function are so closely related, clusters were also analysed for GO term overrepresentation. Last, clusters were examined through the lens of tissue specific gene expression by incorporating expression information at the mRNA and protein levels. The analyses revealed an association between codon usage and the propensity of a gene product to be intrinsically disordered, while the functional analyses revealed that codon bias is associated with cell cycle regulation and cell type differentiation. Expression analysis revealed that in humans there may be a codon bias associated with highly expressed genes indiscriminate of tissue, as well as tissue specific codon biases in the cortex, testis, and liver. Some of the tissue specific findings have been found by other groups, but this investigation distinguishes between an organism-wide codon bias associated with high expression and particular codon biases associated with high expression in individual tissues. In addition, this work builds on the current knowledge of codon bias, determining if these findings previously only evaluated using mRNA levels also appear at the protein concentration level. The results suggest that codon harmonization can be improved further by seeking to replicate the tissue codon bias in which a gene could be highly expressed.
- Subjects :
- Disordered proteins
tRNA concentration
Elongation rate
Codon bias
Bioinformatics
Genetic code
Unsupervised Learning
Tissue specific expression
Coding sequence
Codon harmonization
Clustering
Intrinsically unstructured proteins
Human gene analysis
Data driven
Algorithm initialization
Ribosomal elongation
Gene translation
Protein concentration
Protein folding
Codon
tRNA
Human genome
Evolutionary compuation
mRNA expression
Packing problems
Anchor clustering
Amino acid
Enrichment
Genetic algorithm
Co-translational folding
Protein structure
Gene expression
Codon usage
Gene function
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.od.......453..aa6d6d86ad3b7d1fe768f4e28874a175