Michael J. Bocek, Andrew Fire, Yuri I. Wolf, Yongjie Wang, Mart Krupovic, Valerian V. Dolja, Shuang Wu, Darius Kazlauskas, Sukrit Silas, Eugene V. Koonin, National Library of Medicine (NLM), National Institutes of Health [Bethesda] (NIH)-National Center for Biotechnology Information (NCBI), Department of Cellular and Molecular Pharmacology [San Francisco] (CMP), University of California [San Francisco] (UCSF), University of California-University of California, College of Food Science and Technology [Shangai], Shanghai Ocean University, Qingdao National Laboratory for Marine Science and Technology, Institute of Biotechnology [Vilnius], Life Science Center [Vilnius], Vilnius University [Vilnius]-Vilnius University [Vilnius], Virologie des archées - Archaeal Virology, Institut Pasteur [Paris], Department of Pathology [Stanford], Stanford Medicine, Stanford University-Stanford University, Department of Botany and Plant Pathology, Oregon State University (OSU), Y.I.W. and E.V.K. are supported through the Intramural Research Program of the US National Institutes of Health (National Library of Medicine). S.S. is a Damon Runyon Fellow supported by the Damon Runyon Cancer Research Foundation (DRG-(2352-19)). A.F. was supported by National Institutes of Health awards R01GM37706 and R35GM130366. M.K. was supported by l’Agence Nationale de la Recherche grant ANR-17-CE15-0005-01 (ENVIRA). D.K. was funded by the European Social Fund under no. 09.3.3-LMT-K-712 ‘Development of Competences of Scientists, other Researchers and Students through Practical Research Activities’ measure. Y.W. was supported by the National Natural Science Foundation of China (nos. 41376135, 31570112 and 41876195)., We thank N. Yutin for providing protein multiple-sequence analysis for DNA virus search., ANR-17-CE15-0005,ENVIRA,Remodelage de la membrane cytoplasmique par les virus enveloppés d'archées(2017), University of California [San Francisco] (UC San Francisco), University of California (UC)-University of California (UC), and Institut Pasteur [Paris] (IP)
RNA viruses in aquatic environments remain poorly studied. Here, we analysed the RNA virome from approximately 10 l water from Yangshan Deep-Water Harbour near the Yangtze River estuary in China and identified more than 4,500 distinct RNA viruses, doubling the previously known set of viruses. Phylogenomic analysis identified several major lineages, roughly, at the taxonomic ranks of class, order and family. The 719-member-strong Yangshan virus assemblage is the sister clade to the expansive class Alsuviricetes and consists of viruses with simple genomes that typically encode only RNA-dependent RNA polymerase (RdRP), capping enzyme and capsid protein. Several clades within the Yangshan assemblage independently evolved domain permutation in the RdRP. Another previously unknown clade shares ancestry with Potyviridae, the largest known plant virus family. The ‘Aquatic picorna-like viruses/Marnaviridae’ clade was greatly expanded, with more than 800 added viruses. Several RdRP-linked protein domains not previously detected in any RNA viruses were identified, such as the small ubiquitin-like modifier (SUMO) domain, phospholipase A2 and PrsW-family protease domain. Multiple viruses utilize alternative genetic codes implying protist (especially ciliate) hosts. The results reveal a vast RNA virome that includes many previously unknown groups. However, phylogenetic analysis of the RdRPs supports the previously established five-branch structure of the RNA virus evolutionary tree, with no additional phyla., Metagenomic analysis of a single RNA virome from the Yangshan Deep-Water Harbour in China enabled the recovery of more than 4,500 distinct RNA viruses, doubling the known set of RNA viruses to date, and provided insights into their biology.