Back to Search
Start Over
Large-scale analysis of SARS-CoV-2 spike-glycoprotein mutants demonstrates the need for continuous screening of virus isolates
- Source :
- PLoS ONE, Vol 16, Iss 9, p e0249254 (2021), PLoS ONE
- Publication Year :
- 2021
- Publisher :
- Public Library of Science (PLoS), 2021.
-
Abstract
- Due to the widespread of the COVID-19 pandemic, the SARS-CoV-2 genome is evolving in diverse human populations. Several studies already reported different strains and an increase in the mutation rate. Particularly, mutations in SARS-CoV-2 spike-glycoprotein are of great interest as it mediates infection in human and recently approved mRNA vaccines are designed to induce immune responses against it. We analyzed 1,036,030 SARS-CoV-2 genome assemblies and 30,806 NGS datasets from GISAID and European Nucleotide Archive (ENA) focusing on non-synonymous mutations in the spike protein. Only around 2.5% of the samples contained the wild-type spike protein with no variation from the reference. Among the spike protein mutants, we confirmed a low mutation rate exhibiting less than 10 non-synonymous mutations in 99.6% of the analyzed sequences, but the mean and median number of spike protein mutations per sample increased over time. 5,472 distinct variants were found in total. The majority of the observed variants were recurrent, but only 21 and 14 recurrent variants were found in at least 1% of the mutant genome assemblies and NGS samples, respectively. Further, we found high-confidence subclonal variants in about 2.6% of the NGS data sets with mutant spike protein, which might indicate co-infection with various SARS-CoV-2 strains and/or intra-host evolution. Lastly, some variants might have an effect on antibody binding or T-cell recognition. These findings demonstrate the continuous importance of monitoring SARS-CoV-2 sequences for an early detection of variants that require adaptations in preventive and therapeutic strategies.
- Subjects :
- RNA viruses
Mutation rate
Coronaviruses
Epidemiology
Molecular biology
T-Lymphocytes
Mutant
Gene Identification and Analysis
medicine.disease_cause
Genome
White Blood Cells
Database and Informatics Methods
Sequencing techniques
Mutation Rate
Animal Cells
DNA sequencing
Pathology and laboratory medicine
Genetics
Mutation
Multidisciplinary
T Cells
Microbial Mutation
High-Throughput Nucleotide Sequencing
Genomics
Medical microbiology
Viruses
Spike Glycoprotein, Coronavirus
Medicine
SARS CoV 2
Pathogens
Cellular Types
Transcriptome Analysis
Sequence Analysis
Research Article
Next-Generation Sequencing
SARS coronavirus
Bioinformatics
Immune Cells
Science
Immunology
Protein domain
Sequence alignment
Genome, Viral
Biology
Microbiology
Antibodies
Protein Domains
medicine
Humans
Mutation Detection
Pandemics
Medicine and health sciences
Blood Cells
Biology and life sciences
SARS-CoV-2
Organisms
Viral pathogens
Computational Biology
COVID-19
Cell Biology
Genome Analysis
Microbial pathogens
Research and analysis methods
Molecular biology techniques
Sequence Alignment
Subjects
Details
- ISSN :
- 19326203
- Volume :
- 16
- Database :
- OpenAIRE
- Journal :
- PLOS ONE
- Accession number :
- edsair.doi.dedup.....76fb7e6bd956666948451823d42722de
- Full Text :
- https://doi.org/10.1371/journal.pone.0249254