Back to Search Start Over

MF-SuP-pKa: Multi-fidelity modeling with subgraph pooling mechanism for pKa prediction

Authors :
Jialu Wu
Yue Wan
Zhenxing Wu
Shengyu Zhang
Dongsheng Cao
Chang-Yu Hsieh
Tingjun Hou
Source :
Acta Pharmaceutica Sinica B, Vol 13, Iss 6, Pp 2572-2584 (2023)
Publication Year :
2023
Publisher :
Elsevier, 2023.

Abstract

Acid-base dissociation constant (pKa) is a key physicochemical parameter in chemical science, especially in organic synthesis and drug discovery. Current methodologies for pKa prediction still suffer from limited applicability domain and lack of chemical insight. Here we present MF-SuP-pKa (multi-fidelity modeling with subgraph pooling for pKa prediction), a novel pKa prediction model that utilizes subgraph pooling, multi-fidelity learning and data augmentation. In our model, a knowledge-aware subgraph pooling strategy was designed to capture the local and global environments around the ionization sites for micro-pKa prediction. To overcome the scarcity of accurate pKa data, low-fidelity data (computational pKa) was used to fit the high-fidelity data (experimental pKa) through transfer learning. The final MF-SuP-pKa model was constructed by pre-training on the augmented ChEMBL data set and fine-tuning on the DataWarrior data set. Extensive evaluation on the DataWarrior data set and three benchmark data sets shows that MF-SuP-pKa achieves superior performances to the state-of-the-art pKa prediction models while requires much less high-fidelity training data. Compared with Attentive FP, MF-SuP-pKa achieves 23.83% and 20.12% improvement in terms of mean absolute error (MAE) on the acidic and basic sets, respectively.

Details

Language :
English
ISSN :
22113835
Volume :
13
Issue :
6
Database :
Directory of Open Access Journals
Journal :
Acta Pharmaceutica Sinica B
Publication Type :
Academic Journal
Accession number :
edsdoj.0e9b2d370de74671970f23b33e9bc0d1
Document Type :
article
Full Text :
https://doi.org/10.1016/j.apsb.2022.11.010