Back to Search Start Over

Transcript-targeted analysis reveals isoform alterations and double-hop fusions in breast cancer

Authors :
Shinichi Namba
Noriko Maeda
Hiroyuki Mano
Fumishi Kishigami
Mizuo Ando
Shinya Kojima
Katsushige Kawase
Satoshi Inoue
Ueno Toshihide
Yosuke Tanaka
Kenya Kobayashi
Shusuke Kawashima
Togashi Yosuke
Tomoko Ogawa
Masahito Kawazu
Yuichi Shiraishi
Shoichi Hazama
Source :
Communications Biology, Vol 4, Iss 1, Pp 1-16 (2021), Communications Biology
Publication Year :
2021
Publisher :
Nature Portfolio, 2021.

Abstract

Although transcriptome alteration is an essential driver of carcinogenesis, the effects of chromosomal structural alterations on the cancer transcriptome are not yet fully understood. Short-read transcript sequencing has prevented researchers from directly exploring full-length transcripts, forcing them to focus on individual splice sites. Here, we develop a pipeline for Multi-Sample long-read Transcriptome Assembly (MuSTA), which enables construction of a transcriptome from long-read sequence data. Using the constructed transcriptome as a reference, we analyze RNA extracted from 22 clinical breast cancer specimens. We identify a comprehensive set of subtype-specific and differentially used isoforms, which extended our knowledge of isoform regulation to unannotated isoforms including a short form TNS3. We also find that the exon–intron structure of fusion transcripts depends on their genomic context, and we identify double-hop fusion transcripts that are transcribed from complex structural rearrangements. For example, a double-hop fusion results in aberrant expression of an endogenous retroviral gene, ERVFRD-1, which is normally expressed exclusively in placenta and is thought to protect fetus from maternal rejection; expression is elevated in several TCGA samples with ERVFRD-1 fusions. Our analyses provide direct evidence that full-length transcript sequencing of clinical samples can add to our understanding of cancer biology and genomics in general.<br />Namba et al develop a new pipeline called MuSTA to enable the efficient assembly of transcriptome from long-read sequencing data of breast cancer samples. This method enables the authors to discover subtype-specific isoforms, find that fusion transcript structures depend on their genomic context and identify a double-hop fusion that results in aberrant expression of an endogenous retroviral gene.

Details

Language :
English
ISSN :
23993642
Volume :
4
Issue :
1
Database :
OpenAIRE
Journal :
Communications Biology
Accession number :
edsair.doi.dedup.....53f542b716ab266fa9e42efaa345e653