1. Deep learning-enhanced drug discovery: innovative molecule clustering and interaction prediction through graph analysis
- Author
-
Akcora, Cuneyt (Computer Science), Cardona, Silvia (Microbiology), Hu, Pingzhao, Leung, Carson, Hadipour, Hamid, Akcora, Cuneyt (Computer Science), Cardona, Silvia (Microbiology), Hu, Pingzhao, Leung, Carson, and Hadipour, Hamid
- Abstract
Motivation The quest for efficient drug discovery processes necessitates a comprehensive approach that integrates molecular feature analysis with accurate compound-protein interaction (CPI) prediction. This study introduces models that combine deep learning (DL) techniques for intricate molecular feature engineering and innovative CPI prediction methods. This integration responds to the need for detailed molecular dataset analysis and the prediction of interactions between novel compounds and proteins, thereby enhancing drug discovery. Methods and Results Chapter 3 - Molecular Clustering and Feature Analysis: The framework implements a feature engineering scheme focusing on molecule-specific atomic and bonding information. It utilizes principal component analysis (PCA) for encoding this information and a variational autoencoder (VAE)-based method for embedding both global chemical properties and local features. This approach facilitated the clustering of a large dataset containing over 47,000 molecules. Using the K-means method with 32 embedding`s size based on the VAE method, 50 distinct molecular clusters were identified. These clusters were visualized through t-distributed Stochastic Neighbor Embedding (t-SNE), showcasing the framework's capability in effectively grouping molecules based on their complex features. Chapter 4 - CPI Prediction with GraphBAN: For CPI prediction, the study introduces GraphBAN, a novel inductive-based approach using graph knowledge distillation (KD). This component incorporates a deep bilinear attention network (BAN) and a KD module for graph analysis, enabling the alignment of interaction features across different distributions. GraphBAN's functionality extends to both transductive and inductive link predictions in a bi-partite graph of CPIs. Tested against three benchmark datasets, GraphBAN demonstrated superior performance, outperforming six baseline models. It shows that it is able to predict interactions between unseen compounds a
- Published
- 2023