Back to Search Start Over

Comparative Study of three Clustering Algorithms for Microarray Data

Authors :
Noveenaa Pious
Dicky John Davis G
Source :
Global Journal of Science Frontier Research. :11-17
Publication Year :
2022
Publisher :
Global Journals, 2022.

Abstract

High throughput genomic data analysis is becoming an increasingly integral part of biomedical research. The information derived from gene expression analysis helps in diagnosing the treatment modality given to the patient. However, the amount of data is humongous and becomes complex to examine manually. Unsupervised machine learning algorithms perform complex tasks on an unlabelled data by clustering to comprehend the underlying structure and behaviour of the pattern. Clustering microarray data, examines the differential expressed genes found by grouping the genes based on the similarity of the expression values. In this study, we propose to elucidate the best clustering algorithm for gene expression data on various clinical conditions. The proposed study was carried on three gene expression datasets of Severe acute respiratory syndrome, Amyotrophic lateral sclerosis and Parkinson’s disease. Differentially expressed genes were found at three p-values 0.01, 0.05, 0.001 and the most significant number of genes were retrieved at p-value 0.05. We experimented the differential expressed genes on three clustering algorithms, namely Hierarchical clustering, k-means clustering and fuzzy clustering of the three diseases. The performance of the three clustering algorithms was evaluated using the internal validity index, wherein Hierarchical clustering was found to be best for gene expression data.

Subjects

Subjects :
General Medicine

Details

ISSN :
22494626 and 09755896
Database :
OpenAIRE
Journal :
Global Journal of Science Frontier Research
Accession number :
edsair.doi...........f20cf1a9890dcd89a3df50635e2766dd