Back to Search Start Over

An entropy-based study on the mutational landscape of SARS-CoV-2 in USA: Comparing different variants and revealing co-mutational behavior of proteins.

Authors :
Santoni, Daniele
Source :
Gene. Sep2024, Vol. 922, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

[Display omitted] • SARS-CoV-2 mutational landscape over time is shown through an entropy-based approach. • Entropy and Hellinger distance provide information about mutational viral trajectory. • Profiles coming from different functional protein classes show a different behaviour. • Structural and accessory proteins mostly show non uniform and high profiles. • A mutational phylogenetic tree is built to infer functional link among proteins. COVID-19 emergency has pushed the international scientific community to use every resource to combat the spread of the virus, to understand its biology and predict its possible evolution in terms of new variants. Since the first SARS-CoV-2 virus nucleotide and amino acid sequences were made available, information theory was used to study how viral information content was changing over time and then trace the evolution of its mutational landscape. In this work we analyzed SARS-CoV-2 sequences collected mainly in the USA in a period from March 2020 until December 2022 and computed mutation profiles of viral proteins over time through an entropy-based approach using Shannon Entropy and Hellinger distance. This representation allows an at-a-glance view of the mutational landscape of viral proteins over time and can provide new insights on the evolution of the virus from different points of view. Non-structural proteins typically showed flat mutation profiles, characterized by a very low Average mutation Entropy, while accessory and structural proteins showed mostly non uniform and high mutation profiles, often coupled with the predominance of variants. Interestingly NSP2 protein, whose function is currently still debated, falls in the same branch of NSP14 and NSP10 in the phylogenetic tree of mutations constructed through correlations of mutation profiles, suggesting a co-evolution of those proteins and a possible functional link with each other. To the best of our knowledge this is the first study based on a massive amount of data (n = 107,939,973) that analyzes from an entropy point of view the mutational landscape of SARS-CoV-2 over time and depicts a mutational temporal profile of each protein of the virus. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
03781119
Volume :
922
Database :
Academic Search Index
Journal :
Gene
Publication Type :
Academic Journal
Accession number :
177749120
Full Text :
https://doi.org/10.1016/j.gene.2024.148556