Back to Search
Start Over
Co-fuse: a new class discovery analysis tool to identify and prioritize recurrent fusion genes from RNA-sequencing data
- Source :
- Molecular Genetics and Genomics. 293:1217-1229
- Publication Year :
- 2018
- Publisher :
- Springer Science and Business Media LLC, 2018.
-
Abstract
- Recurrent oncogenic fusion genes play a critical role in the development of various cancers and diseases and provide, in some cases, excellent therapeutic targets. To date, analysis tools that can identify and compare recurrent fusion genes across multiple samples have not been available to researchers. To address this deficiency, we developed Co-occurrence Fusion (Co-fuse), a new and easy to use software tool that enables biologists to merge RNA-seq information, allowing them to identify recurrent fusion genes, without the need for exhaustive data processing. Notably, Co-fuse is based on pattern mining and statistical analysis which enables the identification of hidden patterns of recurrent fusion genes. In this report, we show that Co-fuse can be used to identify 2 distinct groups within a set of 49 leukemic cell lines based on their recurrent fusion genes: a multiple myeloma (MM) samples-enriched cluster and an acute myeloid leukemia (AML) samples-enriched cluster. Our experimental results further demonstrate that Co-fuse can identify known driver fusion genes (e.g., IGH-MYC, IGH-WHSC1) in MM, when compared to AML samples, indicating the potential of Co-fuse to aid the discovery of yet unknown driver fusion genes through cohort comparisons. Additionally, using a 272 primary glioma sample RNA-seq dataset, Co-fuse was able to validate recurrent fusion genes, further demonstrating the power of this analysis tool to identify recurrent fusion genes. Taken together, Co-fuse is a powerful new analysis tool that can be readily applied to large RNA-seq datasets, and may lead to the discovery of new disease subgroups and potentially new driver genes, for which, targeted therapies could be developed. The Co-fuse R source code is publicly available at https://github.com/sakrapee/co-fuse .
- Subjects :
- 0301 basic medicine
Oncogene Proteins, Fusion
Sequence Analysis, RNA
Software tool
Sequencing data
Computational Biology
RNA
Genomics
General Medicine
Computational biology
Biology
Human genetics
Fusion gene
Leukemia, Myeloid, Acute
03 medical and health sciences
030104 developmental biology
New disease
Databases, Genetic
Genetics
Humans
Analysis tools
Molecular Biology
Gene
Software
Subjects
Details
- ISSN :
- 16174623 and 16174615
- Volume :
- 293
- Database :
- OpenAIRE
- Journal :
- Molecular Genetics and Genomics
- Accession number :
- edsair.doi.dedup.....c32a80482561c43a7f602a2760b4f664
- Full Text :
- https://doi.org/10.1007/s00438-018-1454-1