Back to Search Start Over

The gene expression classifier ALLCatchR identifies B-precursor ALL subtypes and underlying developmental trajectories across age

Authors :
Thomas Beder
Björn-Thore Hansen
Alina M. Hartmann
Johannes Zimmermann
Eric Amelunxen
Nadine Wolgast
Wencke Walter
Marketa Zaliova
Željko Antić
Philippe Chouvarine
Lorenz Bartsch
Malwine Barz
Miriam Bultmann
Johanna Horns
Sonja Bendig
Jan Kässens
Christoph Kaleta
Gunnar Cario
Martin Schrappe
Martin Neumann
Nicola Gökbuget
Anke Katharina Bergmann
Jan Trka
Claudia Haferlach
Monika Brüggemann
Claudia D. Baldus
Lorenz Bastian
Publication Year :
2023
Publisher :
Cold Spring Harbor Laboratory, 2023.

Abstract

Current classifications (WHO-HAEM5 / ICC) define up to 26 molecular B-cell precursor acute lymphoblastic leukemia (BCP-ALL) disease subtypes, which are defined by genomic driver aberrations and corresponding gene expression signatures. Identification of driver aberrations by RNA-Seq is well established, while systematic approaches for gene expression analysis are less advanced. Therefore, we developed ALLCatchR, a machine learning based classifier using RNA-Seq expression data to allocate BCP-ALL samples to 21 defined molecular subtypes. Trained on n=1,869 transcriptome profiles with established subtype definitions (4 cohorts; 55% pediatric / 45% adult), ALLCatchR allowed subtype allocation in 3 independent hold-out cohorts (n=1,018; 75% pediatric / 25% adult) with 95.7% accuracy (averaged sensitivity across subtypes: 91.1% / specificity: 99.8%). ‘High confidence predictions’ were achieved in 84.6% of samples with 99.7% accuracy. Only 1.2% of samples remained ‘unclassified’. ALLCatchR outperformed existing tools and identified novel candidates in previously unassigned samples. We established a novel RNA-Seq reference of human B-lymphopoiesis. Implementation in ALLCatchR enabled projection of BCP-ALL samples to this trajectory, which identified shared patterns of proximity of BCP-ALL subtypes to normal lymphopoiesis stages. ALLCatchR sustains RNA-Seq routine application in BCP-ALL diagnostics with systematic gene expression analysis for accurate subtype allocations and novel insights into underlying developmental trajectories.

Details

Database :
OpenAIRE
Accession number :
edsair.doi...........2c352f050053db7a1641511221b3ef54