Back to Search Start Over

Exploring Drivers of Gene Expression in The Cancer Genome Atlas

Authors :
Paul L. Auer
Hallgeir Rui
Michael J. Flister
Andrea Rau
Génétique Animale et Biologie Intégrative (GABI)
Institut National de la Recherche Agronomique (INRA)-AgroParisTech
Université Paris Saclay (COmUE)
University of Wisconsin - Milwaukee
Department of Physiology
Medical College of Wisconsin
European Project: 267196,EC:FP7:PEOPLE,FP7-PEOPLE-2010-COFUND,AGREENSKILLS(2012)
Source :
Bioinformatics, Bioinformatics, Oxford University Press (OUP), 2019, 35 (1), pp.62-68. ⟨10.1093/bioinformatics/bty551⟩, Bioinformatics, 2019, 35 (1), pp.62-68. ⟨10.1093/bioinformatics/bty551⟩
Publication Year :
2017
Publisher :
Cold Spring Harbor Laboratory, 2017.

Abstract

Motivation The Cancer Genome Atlas (TCGA) has greatly advanced cancer research by generating, curating and publicly releasing deeply measured molecular data from thousands of tumor samples. In particular, gene expression measures, both within and across cancer types, have been used to determine the genes and proteins that are active in tumor cells. Results To more thoroughly investigate the behavior of gene expression in TCGA tumor samples, we introduce a statistical framework for partitioning the variation in gene expression due to a variety of molecular variables including somatic mutations, transcription factors (TFs), microRNAs, copy number alternations, methylation and germ-line genetic variation. As proof-of-principle, we identify and validate specific TFs that influence the expression of PTPN14 in breast cancer cells. Availability and implementation We provide a freely available, user-friendly, browseable interactive web-based application for exploring the results of our transcriptome-wide analyses across 17 different cancers in TCGA at http://ls-shiny-prod.uwm.edu/edge_in_tcga. All TCGA Open Access tier data are available at the Broad Institute GDAC Firehose and were downloaded using the TCGA2STAT R package. TCGA Controlled Access tier data are available via controlled access through the Genomic Data Commons (GDC). R scripts used to download, format and analyze the data and produce the interactive R/Shiny web app have been made available on GitHub at https://github.com/andreamrau/EDGE-in-TCGA.

Details

Language :
English
ISSN :
13674803 and 13674811
Database :
OpenAIRE
Journal :
Bioinformatics, Bioinformatics, Oxford University Press (OUP), 2019, 35 (1), pp.62-68. ⟨10.1093/bioinformatics/bty551⟩, Bioinformatics, 2019, 35 (1), pp.62-68. ⟨10.1093/bioinformatics/bty551⟩
Accession number :
edsair.doi.dedup.....1e4c3924b7d7b588c0e29684794b36be
Full Text :
https://doi.org/10.1101/227926