Back to Search
Start Over
Chemical structure-aware molecular image representation learning.
- Source :
- Briefings in Bioinformatics; Nov2023, Vol. 24 Issue 6, p1-10, 10p
- Publication Year :
- 2023
-
Abstract
- Current methods of molecular image-based drug discovery face two major challenges: (1) work effectively in absence of labels, and (2) capture chemical structure from implicitly encoded images. Given that chemical structures are explicitly encoded by molecular graphs (such as nitrogen, benzene rings and double bonds), we leverage self-supervised contrastive learning to transfer chemical knowledge from graphs to images. Specifically, we propose a novel Contrastive Graph-Image Pre-training (CGIP) framework for molecular representation learning, which learns explicit information in graphs and implicit information in images from large-scale unlabeled molecules via carefully designed intra- and inter-modal contrastive learning. We evaluate the performance of CGIP on multiple experimental settings (molecular property prediction, cross-modal retrieval and distribution similarity), and the results show that CGIP can achieve state-of-the-art performance on all 12 benchmark datasets and demonstrate that CGIP transfers chemical knowledge in graphs to molecular images, enabling image encoder to perceive chemical structures in images. We hope this simple and effective framework will inspire people to think about the value of image for molecular representation learning. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 14675463
- Volume :
- 24
- Issue :
- 6
- Database :
- Complementary Index
- Journal :
- Briefings in Bioinformatics
- Publication Type :
- Academic Journal
- Accession number :
- 173782151
- Full Text :
- https://doi.org/10.1093/bib/bbad404