Royo-Llonch, Marta, Sánchez, Pablo, Ruiz-González, Clara, Salazar, Guillem, Pedrós-Alió, Carlos, Sebastián, Marta, Labadie, Karine, Paoli, Lucas, M. Ibarbalz, Federico, Zinger, Lucie, Churcheward, Benjamin, Babin, Marcel, Bork, Peer, Boss, Emmanuel, Cochrane, Guy, de Vargas, Colomban, Gorsky, Gabriel, Grimsley, Nigel, Guidi, Lionel, Hingamp, Pascal, Iudicone, Daniele, Jaillon, Olivier, Kandels, Stefanie, Not, Fabrice, Ogata, Hiroyuki, Pesant, Stéphane, Poulton, Nicole, Raes, Jeroen, Sardet, Christian, Speich, Sabrina, Setmmann, Lars, Sullivan, Matthew B., Chaffron, Samuel, Eveillard, Damien, Karsenti, Eric, Sunagawa, Shinichi, Wincker, Patrick, Karp-Boss, Lee, Bowler, Chris, Acinas, Silvia G., Agencia Estatal de Investigación (España), Ministerio de Ciencia, Innovación y Universidades (España), Centre National de la Recherche Scientifique (France), European Commission, European Molecular Biology Laboratory, Centro de Investigaciones Biológicas (CSIC), Consejo Superior de Investigaciones Científicas [Madrid] (CSIC), Institute of Marine Sciences / Institut de Ciències del Mar [Barcelona] (ICM), Department of Biosystems Science and Engineering [ETH Zürich] (D-BSSE), Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich] (ETH Zürich), Centro Nacional de Biotecnología [Madrid] (CNB-CSIC), Universidad de Granada = University of Granada (UGR), Genoscope - Centre national de séquençage [Evry] (GENOSCOPE), Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA), Institute of Microbiology and Swiss Institute of Bioinformatics, Institut de biologie de l'Ecole Normale Supérieure (IBENS), École normale supérieure - Paris (ENS-PSL), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Consejo Nacional de Investigaciones Científicas y Técnicas [Buenos Aires] (CONICET), Institut de biologie de l'ENS Paris (IBENS), Département de Biologie - ENS Paris, Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-École normale supérieure - Paris (ENS-PSL), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Laboratoire des Sciences du Numérique de Nantes (LS2N), Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-École Centrale de Nantes (Nantes Univ - ECN), Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ)-Nantes université - UFR des Sciences et des Techniques (Nantes univ - UFR ST), Nantes Université - pôle Sciences et technologie, Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ)-Nantes Université - pôle Sciences et technologie, Nantes Université (Nantes Univ), Global Oceans Systems Ecology & Evolution - Tara Oceans (GOSEE), Université de Perpignan Via Domitia (UPVD)-École Pratique des Hautes Études (EPHE), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Aix Marseille Université (AMU)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université d'Évry-Val-d'Essonne (UEVE)-Université de Toulon (UTLN)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut de Recherche pour le Développement (IRD [France-Nord])-Ecole Normale Supérieure Paris-Saclay (ENS Paris Saclay)-European Molecular Biology Laboratory (EMBL)-École Centrale de Nantes (Nantes Univ - ECN), Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ)-Université australe du Chili, European Molecular Biology Laboratory [Heidelberg] (EMBL), Génomique métabolique (UMR 8030), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université d'Évry-Val-d'Essonne (UEVE)-Centre National de la Recherche Scientifique (CNRS), University of Maine, ANR-11-BTBR-0008,OCEANOMICS,Biotechnologies et bioressources pour la valorisation des écosystèmes marins planctoniques(2011), ANR-10-INBS-0009,France-Génomique,Organisation et montée en puissance d'une Infrastructure Nationale de Génomique(2010), ANR-10-LABX-0054,MEMOLIFE,Memory in living systems: an integrated approach(2010), ANR-11-IDEX-0001,Amidex,INITIATIVE D'EXCELLENCE AIX MARSEILLE UNIVERSITE(2011), and European Project: 862923,AtlantECO
14 pages, 6 figures, additional information https://doi.org/10.1038/s41564-021-00979-9.-- Data availability: Accession numbers for the data used and generated in this study can be found in Supplementary Table 12, which includes the Arctic MAGs Catalogue and their functional annotation (European Bioinformatics Institute BioStudies ID: S-BSST451) and the co-assembly of metagenomic samples used to generate the metagenomic bins (European Nucleotide Archive PRJEB41575). Accession numbers for the metagenomic and metatranscriptomic samples used in the fragment recruitment analyses can be found in Supplementary Table 13. Publicly available datasets used in this study include the following: CheckM v.1.0.11 (https://github.com/Ecogenomics/CheckM/releases/tag/v1.1.0), GTDB release 89 (https://data.gtdb.ecogenomic.org/releases/release89/), SILVA 132 (https://www.arb-silva.de/documentation/release-132/), KEGG release 89.1 (https://www.genome.jp/kegg/docs/relnote.html) and Pfam database release 31.0 (http://ftp.ebi.ac.uk/pub/databases/Pfam/releases/Pfam31.0/). Source data are provided with this paper, The role of the Arctic Ocean ecosystem in climate regulation may depend on the responses of marine microorganisms to environmental change. We applied genome-resolved metagenomics to 41 Arctic seawater samples, collected at various depths in different seasons during the Tara Oceans Polar Circle expedition, to evaluate the ecology, metabolic potential and activity of resident bacteria and archaea. We assembled 530 metagenome-assembled genomes (MAGs) to form the Arctic MAGs catalogue comprising 526 species. A total of 441 MAGs belonged to species that have not previously been reported and 299 genomes showed an exclusively polar distribution. Most Arctic MAGs have large genomes and the potential for fast generation times, both of which may enable adaptation to a copiotrophic lifestyle in nutrient-rich waters. We identified 38 habitat generalists and 111 specialists in the Arctic Ocean. We also found a general prevalence of 14 mixotrophs, while chemolithoautotrophs were mostly present in the mesopelagic layer during spring and autumn. We revealed 62 MAGs classified as key Arctic species, found only in the Arctic Ocean, showing the highest gene expression values and predicted to have habitat-specific traits. The Artic MAGs catalogue will inform our understanding of polar microorganisms that drive global biogeochemical cycles, This work acknowledges the ‘Severo Ochoa Centre of Excellence’ accreditation (CEX2019-000928-S). We thank the commitment of the following sponsors and research funding agencies: the Spanish Ministry of Economy and Competitiveness (project MAGGY, grant no. CTM2017-87736-R and Polar EcoGen PID2020-116489RB-I00), Horizon 2020-Research and Innovation Framework Programme (Atlantic ECOsystems assessment, forecasting & sustainability, grant no. H2020-BG-2019-2), Centre National de la Recherche Scientifique (in particular Groupement de Recherche GDR3280 and the Research Federation for the study of Global Ocean Systems Ecology and Evolution, FR2022/Tara Oceans-GOSEE), European Molecular Biology Laboratory, Genoscope/Commissariat à l’Énergie Atomique et aux Énergies Alternatives, the French Ministry of Research and the French Government’s ‘Investissements d’Avenir’ programmes OCEANOMICS (project no. ANR-11-BTBR-0008), FRANCE GENOMIQUE (project no. ANR-10-INBS-09-08), MEMO LIFE (project no. ANR-10-LABX-54), Paris Sciences et Lettres University (project no. ANR-11-IDEX-0001-02), Eidgenössische Technische Hochschule Zürich and Helmut Horten Foundation, the Swiss National Foundation (project no. 205321_184955), MEXT/JSPS/KAKENHI (project nos. 16H06429, 16K21723, 16H06437 and 18H02279)