Back to Search Start Over

When SMILES Smiles, Practicality Judgment and Yield Prediction of Chemical Reaction via Deep Chemical Language Processing

Authors :
Shu Jiang
Zhuosheng Zhang
Hai Zhao
Jiangtong Li
Yang Yang
Bao-Liang Lu
Ning Xia
Source :
IEEE Access, Vol 9, Pp 85071-85083 (2021)
Publication Year :
2021
Publisher :
IEEE, 2021.

Abstract

Simplified Molecular Input Line Entry System (SMILES) provides a text-based encoding method to describe the structure of chemical species and formulize general chemical reactions. Considering that chemical reactions have been represented in a language form, we present a symbol only model to generally predict the yield of organic synthesis reaction without considering complex quantum physical modeling or chemistry knowledge. Our model is the first deep neural network application that treats chemical reaction text segments as embedding representation to the most recent deep natural language processing. Experimental results show our model can effectively predict chemical reactions, which achieves a high accuracy of 99.76% on practicality judgment and the Root Mean Square Error (RMSE) is around 0.2 for yield prediction. Our work shows the great potential for automatic yield prediction for organic reactions under general conditions and further applications in synthesis path prediction with the least modeling cost.

Details

Language :
English
ISSN :
21693536
Volume :
9
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.fc82ce5472db433cbdcbd4ea00f3310a
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2021.3083838