Back to Search Start Over

High-Throughput Screening and Prediction of Nucleophilicity of Amines Using Machine Learning and DFT Calculations.

Authors :
Li X
Zhong H
Yang H
Li L
Wang Q
Source :
Journal of chemical information and modeling [J Chem Inf Model] 2024 Aug 26; Vol. 64 (16), pp. 6361-6368. Date of Electronic Publication: 2024 Aug 08.
Publication Year :
2024

Abstract

Nucleophilic index ( N <subscript> Nu </subscript> ) as a significant parameter plays a crucial role in screening of amine catalysts. Indeed, the quantity and variety of amines are extensive. However, only limited amines exhibit an N <subscript> Nu </subscript> value exceeding 4.0 eV, rendering them potential nucleophiles in chemical reactions. To address this issue, we proposed a computational method to quickly identify amines with high N <subscript> Nu </subscript> values by using Machine Learning (ML) and high-throughput Density Functional Theory (DFT) calculations. Our approach commenced by training ML models and the exploration of Molecular Fingerprint methods as well as the development of quantitative structure-activity relationship (QSAR) models for the well-known amines based on N <subscript> Nu </subscript> values derived from DFT calculations. Utilizing explainable Shapley Additive Explanation plots, we were able to determine the five critical substructures that significantly impact the N <subscript> Nu </subscript> values of amine. The aforementioned conclusion can be applied to produce and cultivate 4920 novel hypothetical amines with high N <subscript> Nu </subscript> values. The QSAR models were employed to predict the N <subscript> Nu </subscript> values of 259 well-known and 4920 hypothetical amines, resulting in the identification of five novel hypothetical amines with exceptional N <subscript> Nu </subscript> values (>4.55 eV). The enhanced N <subscript> Nu </subscript> values of these novel amines were validated by DFT calculations. One novel hypothetical amine, H1, exhibits an unprecedentedly high N <subscript> Nu </subscript> value of 5.36 eV, surpassing the maximum value (5.35 eV) observed in well-established amines. Our research strategy efficiently accelerates the discovery of the high nucleophilicity of amines using ML predictions, as well as the DFT calculations.

Details

Language :
English
ISSN :
1549-960X
Volume :
64
Issue :
16
Database :
MEDLINE
Journal :
Journal of chemical information and modeling
Publication Type :
Academic Journal
Accession number :
39116323
Full Text :
https://doi.org/10.1021/acs.jcim.4c00724