Back to Search Start Over

Information Theory and Voting Based Consensus Clustering for Combining Multiple Clusterings of Chemical Structures

Authors :
Faisal Saeed
Ammar Abdo
Naomie Salim
Source :
Molecular Informatics. 32:591-598
Publication Year :
2013
Publisher :
Wiley, 2013.

Abstract

Many consensus clustering methods have been applied in different areas such as pattern recognition, machine learning, information theory and bioinformatics. However, few methods have been used for chemical compounds clustering. In this paper, an information theory and voting based algorithm (Adaptive Cumulative Voting-based Aggregation Algorithm A-CVAA) was examined for combining multiple clusterings of chemical structures. The effectiveness of clusterings was evaluated based on the ability of the clustering method to separate active from inactive molecules in each cluster, and the results were compared with Ward's method. The chemical dataset MDL Drug Data Report (MDDR) and the Maximum Unbiased Validation (MUV) dataset were used. Experiments suggest that the adaptive cumulative voting-based consensus method can improve the effectiveness of combining multiple clusterings of chemical structures.

Details

ISSN :
18681743
Volume :
32
Database :
OpenAIRE
Journal :
Molecular Informatics
Accession number :
edsair.doi.dedup.....e0a6290abb40dc2c114ad5ba4c7c58b9