Back to Search Start Over

MultiCapsNet: A General Framework for Data Integration and Interpretable Classification.

Authors :
Wang, Lifei
Miao, Xuexia
Nie, Rui
Zhang, Zhang
Zhang, Jiang
Cai, Jun
Source :
Frontiers in Genetics; 1/27/2022, Vol. 13, p1-12, 12p
Publication Year :
2022

Abstract

The latest progresses of experimental biology have generated a large number of data with different formats and lengths. Deep learning is an ideal tool to deal with complex datasets, but its inherent "black box" nature needs more interpretability. At the same time, traditional interpretable machine learning methods, such as linear regression or random forest, could only deal with numerical features instead of modular features often encountered in the biological field. Here, we present MultiCapsNet (https://github.com/wanglf19/MultiCapsNet), a new deep learning model built on CapsNet and scCapsNet, which possesses the merits such as easy data integration and high model interpretability. To demonstrate the ability of this model as an interpretable classifier to deal with modular inputs, we test MultiCapsNet on three datasets with different data type and application scenarios. Firstly, on the labeled variant call dataset, MultiCapsNet shows a similar classification performance with neural network model, and provides importance scores for data sources directly without an extra importance determination step required by the neural network model. The importance scores generated by these two models are highly correlated. Secondly, on single cell RNA sequence (scRNA-seq) dataset, MultiCapsNet integrates information about protein-protein interaction (PPI), and protein-DNA interaction (PDI). The classification accuracy of MultiCapsNet is comparable to the neural network and random forest model. Meanwhile, MultiCapsNet reveals how each transcription factor (TF) or PPI cluster node contributes to classification of cell type. Thirdly, we made a comparison between MultiCapsNet and SCENIC. The results show several cell type relevant TFs identified by both methods, further proving the validity and interpretability of the MultiCapsNet. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
16648021
Volume :
13
Database :
Complementary Index
Journal :
Frontiers in Genetics
Publication Type :
Academic Journal
Accession number :
154941972
Full Text :
https://doi.org/10.3389/fgene.2021.767602