1. Reply to Grigoriev et al., 'Sequences of SARS-CoV-2 'Hybrids' with the Human Genome: Signs of Non-coding RNA?'
- Author
-
Luopin Wang, Srishti Chakravorty, Michail S. Lionakis, Jorge L Trujillo-Ochoa, Behdad Afzali, Carmen Mirabelli, Majid Kazemian, Christiane E. Wobus, Daniel Chauss, Bingyu Yan, Matthew R. Olson, and Dhaneshwar Kumar
- Subjects
RNA, Untranslated ,biology ,Genome, Human ,SARS-CoV-2 ,Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ,Immunology ,COVID-19 ,Computational biology ,Genome, Viral ,Non-coding RNA ,Microbiology ,DNA sequencing ,Viral gene ,Transcriptome ,Virology ,Insect Science ,biology.protein ,Humans ,RNA, Viral ,Human genome ,Polymerase ,Sequence (medicine) - Abstract
High throughput sequencing reads from virally infected cells provide detailed information about both the infected host cells and invading viruses (1). For example, RNA-sequencing techniques from infected cells contains reads that unequivocally align to either the host or the viral transcriptomes, enabling quantification of host and viral gene expressions (2). Occasionally, there are reads with split characteristics, having one part (e.g., the 5' end) unambiguously matching the host and another part (e.g., the 3' end) clearly matching the viral genomes. The split characteristic with unambiguous matching on either part is the key here, typically requiring convincing stretches of sequence matches such as >30bp that we used in our analysis (3). Such reads are termed host-virus chimeric reads (HVCRs). Indeed, HVCRs that surpass statistical reproducibility and signal-to-noise standards might carry novel insights into the biology of host-virus interactions (4, 5). Thus, it is important to unambiguously detect statistically rigorous and biologically relevant HVCRs. We and others have shown that detection of relevant HVCRs is complicated by unfaithful reverse-transcriptase and polymerase enzymes that template-switch during typical high throughput sequencing library preparation protocols (6-9).
- Published
- 2021