1. Prediction of SARS-CoV-2 Variant Lineages Using the S1-Encoding Region Sequence Obtained by PacBio Single-Molecule Real-Time Sequencing
- Author
-
Sébastien Lhomme, Justine Latour, Nicolas Jeanne, Pauline Trémeaux, Noémie Ranger, Marion Migueres, Gérald Salin, Cécile Donnadieu, Jacques Izopet, Laboratoire Virologie [CHU Toulouse], Institut Fédératif de Biologie (IFB), Centre Hospitalier Universitaire de Toulouse (CHU Toulouse)-Centre Hospitalier Universitaire de Toulouse (CHU Toulouse)-Pôle Biologie [CHU Toulouse], Centre Hospitalier Universitaire de Toulouse (CHU Toulouse), Institut Toulousain des Maladies Infectieuses et Inflammatoires (Infinity), Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Génome et Transcriptome - Plateforme Génomique ( GeT-PlaGe), Plateforme Génome & Transcriptome (GET), Génopole Toulouse Midi-Pyrénées [Auzeville] (GENOTOUL), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Ecole Nationale Vétérinaire de Toulouse (ENVT), Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Institut National de la Santé et de la Recherche Médicale (INSERM)-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Institut National de la Santé et de la Recherche Médicale (INSERM)-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)-Génopole Toulouse Midi-Pyrénées [Auzeville] (GENOTOUL), Université Fédérale Toulouse Midi-Pyrénées-Institut National de la Santé et de la Recherche Médicale (INSERM)-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE), and Benson-Rumiz, Alicia
- Subjects
[SDV.MHEP] Life Sciences [q-bio]/Human health and pathology ,Genotype ,clade ,SARS-CoV-2 ,S1 domain ,Illumina sequencing ,COVID-19 ,Sequence Analysis, DNA ,PacBio SMRT sequencing ,lineage ,Microbiology ,Article ,QR1-502 ,Infectious Diseases ,[SDV.MHEP.MI]Life Sciences [q-bio]/Human health and pathology/Infectious diseases ,Virology ,Mutation ,Spike Glycoprotein, Coronavirus ,[SDV.MHEP.MI] Life Sciences [q-bio]/Human health and pathology/Infectious diseases ,Humans ,Protein Interaction Domains and Motifs ,[SDV.MHEP]Life Sciences [q-bio]/Human health and pathology - Abstract
International audience; The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is the causal agent of the COVID-19 pandemic that emerged in late 2019. The outbreak of variants with mutations in the region encoding the spike protein S1 sub-unit that can make them more resistant to neutralizing or monoclonal antibodies is the main point of the current monitoring. This study examines the feasibility of predicting the variant lineage and monitoring the appearance of reported mutations by sequencing only the region encoding the S1 domain by Pacific Bioscience Single Molecule Real-Time sequencing (PacBio SMRT). Using the PacBio SMRT system, we successfully sequenced 186 of the 200 samples previously sequenced with the Illumina COVIDSeq (whole genome) system. PacBio SMRT detected mutations in the S1 domain that were missed by the COVIDseq system in 27/186 samples (14.5%), due to amplification failure. These missing positions included mutations that are decisive for lineage assignation, such as G142D (n = 11), N501Y (n = 6), or E484K (n = 2). The lineage of 172/186 (92.5%) samples was accurately determined by analyzing the region encoding the S1 domain with a pipeline that uses key positions in S1. Thus, the PacBio SMRT protocol is appropriate for determining virus lineages and detecting key mutations.
- Published
- 2021