Back to Search Start Over

Application of a Deep Matrix Factorization Model on Integrated Gene Expression Data

Authors :
Jialiang Yang
Muhammad Afzal
Pravinkumar Patil
Muhammad Abdul Qadir Najmul Ikram
Julio Montes-Avila
Xiang Zhen Kong
Francesco Bertagna , Domenico Albano, Elisabetta Cerudelli, Maria Gazzilli, Raffaele Giubbini
A. C. Iliopoulos, G. Beis, P. Apostolou
Giorgio Treglia
Xuesi Ma, Baohang Xi, Yi Zhang, Lijuan Zhu, Xin Sui
Qingyong Li
Sainath B. Zangade
I. Papasotiriou
Source :
Current Bioinformatics. 15:359-367
Publication Year :
2020
Publisher :
Bentham Science Publishers Ltd., 2020.

Abstract

Background: Non-negative Matrix Factorization (NMF) has been extensively used in gene expression data. However, most NMF-based methods have single-layer structures, which may achieve poor performance for complex data. Deep learning, with its carefully designed hierarchical structure, has shown significant advantages in learning data features. Objective: In bioinformatics, on the one hand, to discover differentially expressed genes in gene expression data; on the other hand, to obtain higher sample clustering results. It can provide the reference value for the prevention and treatment of cancer. Method: In this paper, we apply a deep NMF method called Deep Semi-NMF on the integrated gene expression data. In each layer, the coefficient matrix is directly decomposed into the basic and coefficient matrix of the next layer. We apply this factorization model on The Cancer Genome Atlas (TCGA) genomic data. Results: The experimental results demonstrate the superiority of Deep Semi-NMF method in identifying differentially expressed genes and clustering samples. Conclusion: The Deep Semi-NMF model decomposes a matrix into multiple matrices and multiplies them to form a matrix. It can also improve the clustering performance of samples while digging out more accurate key genes for disease treatment.

Details

ISSN :
15748936
Volume :
15
Database :
OpenAIRE
Journal :
Current Bioinformatics
Accession number :
edsair.doi...........e7011930dce3cb1ad688e136f51e6c7c