Back to Search Start Over

A data processing pipeline for the AACR project GENIE biopharma collaborative data with the {genieBPC} R package.

Authors :
Lavery JA
Brown S
Curry MA
Martin A
Sjoberg DD
Whiting K
Source :
Bioinformatics (Oxford, England) [Bioinformatics] 2023 Jan 01; Vol. 39 (1).
Publication Year :
2023

Abstract

Motivation: Data from the American Association for Cancer Research Project Genomics Evidence Neoplasia Information Exchange Biopharma Collaborative (GENIE BPC) represent comprehensive clinical data linked to high-throughput sequencing data, providing a multi-institution, pan-cancer, publicly available data repository. GENIE BPC data provide detailed demographic, clinical, treatment, genomic and outcome data for patients with cancer. These data result in a unique observational database of molecularly characterized tumors with comprehensive clinical annotation that can be used for health outcomes and precision medicine research in oncology. Due to the inherently complex structure of the multiple phenomic and genomic datasets, the use of these data requires a robust process for data integration and preparation in order to build analytic models.<br />Results: We present the {genieBPC} package, a user-friendly data processing pipeline to facilitate the creation of analytic cohorts from the GENIE BPC data that are ready for clinico-genomic modeling and analyses.<br />Availability and Implementation: {genieBPC} is available on CRAN and GitHub.<br /> (© The Author(s) 2022. Published by Oxford University Press.)

Details

Language :
English
ISSN :
1367-4811
Volume :
39
Issue :
1
Database :
MEDLINE
Journal :
Bioinformatics (Oxford, England)
Publication Type :
Academic Journal
Accession number :
36519837
Full Text :
https://doi.org/10.1093/bioinformatics/btac796