Back to Search Start Over

Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space

Authors :
Michael C. Schatz
Anthony A. Philippakis
Enis Afgan
Eric Banks
Vincent J. Carey
Robert J. Carroll
Alessandro Culotti
Kyle Ellrott
Jeremy Goecks
Robert L. Grossman
Ira M. Hall
Kasper D. Hansen
Jonathan Lawson
Jeffrey T. Leek
Anne O’Donnell Luria
Stephen Mosher
Martin Morgan
Anton Nekrutenko
Brian D. O’Connor
Kevin Osborn
Benedict Paten
Candace Patterson
Frederick J. Tan
Casey Overby Taylor
Jennifer Vessio
Levi Waldron
Ting Wang
Kristin Wuichet
Alexander Baumann
Andrew Rula
Anton Kovalsy
Clare Bernard
Derek Caetano-Anollés
Geraldine A. Van der Auwera
Justin Canas
Kaan Yuksel
Kate Herman
M. Morgan Taylor
Marianie Simeon
Michael Baumann
Qi Wang
Robert Title
Ruchi Munshi
Sushma Chaluvadi
Valerie Reeves
William Disman
Salin Thomas
Allie Hajian
Elizabeth Kiernan
Namrata Gupta
Trish Vosburg
Ludwig Geistlinger
Marcel Ramos
Sehyun Oh
Dave Rogers
Frances McDade
Mim Hastie
Nitesh Turaga
Alexander Ostrovsky
Alexandru Mahmoud
Dannon Baker
Dave Clements
Katherine E.L. Cox
Keith Suderman
Nataliya Kucher
Sergey Golitsynskiy
Samantha Zarate
Sarah J. Wheelan
Kai Kammers
Ana Stevens
Carolyn Hutter
Christopher Wellington
Elena M. Ghanaim
Ken L. Wiley, Jr.
Shurjo K. Sen
Valentina Di Francesco
Deni s Yuen
Brian Walsh
Luke Sargent
Vahid Jalili
John Chilton
Lori Shepherd
B.J. Stubbs
Ash O’Farrell
Benton A. Vizzier, Jr.
Charles Overbeck
Charles Reid
David Charles Steinberg
Elizabeth A. Sheets
Julian Lucas
Lon Blauvelt
Louise Cabansay
Noah Warren
Brian Hannafious
Tim Harris
Radhika Reddy
Eric Torstenson
M. Katie Banasiewicz
Haley J. Abel
Jason Walker
Source :
Cell Genomics, Vol 2, Iss 1, Pp 100085- (2022)
Publication Year :
2022
Publisher :
Elsevier, 2022.

Abstract

Summary: The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, describe its ecosystem and interoperability with other platforms, and highlight how this platform and associated initiatives contribute to improved genomic data sharing efforts. The AnVIL is a federated cloud platform designed to manage and store genomics and related data, enable population-scale analysis, and facilitate collaboration through the sharing of data, code, and analysis results. By inverting the traditional model of data sharing, the AnVIL eliminates the need for data movement while also adding security measures for active threat detection and monitoring and provides scalable, shared computing resources for any researcher. We describe the core data management and analysis components of the AnVIL, which currently consists of Terra, Gen3, Galaxy, RStudio/Bioconductor, Dockstore, and Jupyter, and describe several flagship genomics datasets available within the AnVIL. We continue to extend and innovate the AnVIL ecosystem by implementing new capabilities, including mechanisms for interoperability and responsible data sharing, while streamlining access management. The AnVIL opens many new opportunities for analysis, collaboration, and data sharing that are needed to drive research and to make discoveries through the joint analysis of hundreds of thousands to millions of genomes along with associated clinical and molecular data types.

Details

Language :
English
ISSN :
2666979X
Volume :
2
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Cell Genomics
Publication Type :
Academic Journal
Accession number :
edsdoj.95ac1623e901491c821d128f529da364
Document Type :
article
Full Text :
https://doi.org/10.1016/j.xgen.2021.100085