Back to Search Start Over

An automatic tool to analyze and cluster macromolecular conformations based on self-organizing maps.

Authors :
Bouvier G
Desdouits N
Ferber M
Blondel A
Nilges M
Source :
Bioinformatics (Oxford, England) [Bioinformatics] 2015 May 01; Vol. 31 (9), pp. 1490-2. Date of Electronic Publication: 2014 Dec 26.
Publication Year :
2015

Abstract

Motivation: Sampling the conformational space of biological macromolecules generates large sets of data with considerable complexity. Data-mining techniques, such as clustering, can extract meaningful information. Among them, the self-organizing maps (SOMs) algorithm has shown great promise; in particular since its computation time rises only linearly with the size of the data set. Whereas SOMs are generally used with few neurons, we investigate here their behavior with large numbers of neurons.<br />Results: We present here a python library implementing the full SOM analysis workflow. Large SOMs can readily be applied on heavy data sets. Coupled with visualization tools they have very interesting properties. Descriptors for each conformation of a trajectory are calculated and mapped onto a 3D landscape, the U-matrix, reporting the distance between neighboring neurons. To delineate clusters, we developed the flooding algorithm, which hierarchically identifies local basins of the U-matrix from the global minimum to the maximum.<br />Availability and Implementation: The python implementation of the SOM library is freely available on github: https://github.com/bougui505/SOM.<br />Contact: michael.nilges@pasteur.fr or guillaume.bouvier@pasteur.fr<br />Supplementary Information: Supplementary data are available at Bioinformatics online.<br /> (© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.)

Details

Language :
English
ISSN :
1367-4811
Volume :
31
Issue :
9
Database :
MEDLINE
Journal :
Bioinformatics (Oxford, England)
Publication Type :
Academic Journal
Accession number :
25543048
Full Text :
https://doi.org/10.1093/bioinformatics/btu849