1. St. Jude Cloud: A Pediatric Cancer Genomic Data-Sharing Ecosystem
- Author
-
Zhaoming Wang, J. Robert Michael, Darrell Gentry, Suzanne J. Baker, Jobin Sunny, S M Ashiqul Islam, Clay McLeod, David W. Ellison, Michael A. Dyer, Mark R. Wilkinson, Jinghui Zhang, Ludmil B. Alexandrov, Chaitanya Bangur, Bob Davidson, Singer Ma, Geralyn Miller, Pamella Tater, Yong Cheng, Arthur Chiao, Alexander M. Gout, Tuan Nguyen, James R. Downing, Edgar Sioson, Gang Wu, Delaram Rahbarinia, Ed Suh, Xiaotu Ma, Shaohua Lei, Yutaka Yasui, Andrew Frantz, Kirby Birch, Scott G. Foy, Nedra Robison, Kim E. Nichols, Aman Patel, Richard Daly, Alberto S. Pappo, Naina Thangaraj, Xin Zhou, Leslie L. Robison, Matthew Lear, Vijay Kandali, Christopher P. Meyer, David Finkelstein, Stephanie Wiggins, Tracy Ard, Irina McGuire, Yu Liu, Samuel W. Brady, Gregory T. Armstrong, Liqing Tian, Charles G. Mullighan, Brent A. Orr, Ti-Cheng Chang, Keith Perry, Michael Macias, Shuoguo Wang, Lance E. Palmer, Soheil Meshinchi, Carmen L. Wilson, James McMurry, Andrew Swistak, Michael Rusch, Scott Newman, Leigh Tanner, Madison Treadway, Xing Tang, Omar Serang, Jian Wang, Andrew Thrasher, Rahul Mudunuri, Mitchell J. Weiss, and Michael N. Edmonson
- Subjects
0301 basic medicine ,Genomic data ,MEDLINE ,Cloud computing ,Anemia, Sickle Cell ,Article ,03 medical and health sciences ,0302 clinical medicine ,Neoplasms ,Humans ,Medicine ,Child ,Ecosystem ,Information Dissemination ,business.industry ,Cancer ,Genomics ,Cloud Computing ,Hospitals, Pediatric ,medicine.disease ,Pediatric cancer ,Data science ,Treatment efficacy ,Data sharing ,030104 developmental biology ,Workflow ,Oncology ,030220 oncology & carcinogenesis ,business - Abstract
Effective data sharing is key to accelerating research to improve diagnostic precision, treatment efficacy, and long-term survival in pediatric cancer and other childhood catastrophic diseases. We present St. Jude Cloud (https://www.stjude.cloud), a cloud-based data-sharing ecosystem for accessing, analyzing, and visualizing genomic data from >10,000 pediatric patients with cancer and long-term survivors, and >800 pediatric sickle cell patients. Harmonized genomic data totaling 1.25 petabytes are freely available, including 12,104 whole genomes, 7,697 whole exomes, and 2,202 transcriptomes. The resource is expanding rapidly, with regular data uploads from St. Jude's prospective clinical genomics programs. Three interconnected apps within the ecosystem—Genomics Platform, Pediatric Cancer Knowledgebase, and Visualization Community—enable simultaneously performing advanced data analysis in the cloud and enhancing the Pediatric Cancer knowledgebase. We demonstrate the value of the ecosystem through use cases that classify 135 pediatric cancer subtypes by gene expression profiling and map mutational signatures across 35 pediatric cancer subtypes. Significance: To advance research and treatment of pediatric cancer, we developed St. Jude Cloud, a data-sharing ecosystem for accessing >1.2 petabytes of raw genomic data from >10,000 pediatric patients and survivors, innovative analysis workflows, integrative multiomics visualizations, and a knowledgebase of published data contributed by the global pediatric cancer community. This article is highlighted in the In This Issue feature, p. 995
- Published
- 2021