Back to Search
Start Over
Comparative Studies on Resampling Techniques in Machine Learning and Deep Learning Models for Drug-Target Interaction Prediction.
- Source :
- Molecules; Feb2023, Vol. 28 Issue 4, p1663, 22p
- Publication Year :
- 2023
-
Abstract
- The prediction of drug-target interactions (DTIs) is a vital step in drug discovery. The success of machine learning and deep learning methods in accurately predicting DTIs plays a huge role in drug discovery. However, when dealing with learning algorithms, the datasets used are usually highly dimensional and extremely imbalanced. To solve this issue, the dataset must be resampled accordingly. In this paper, we have compared several data resampling techniques to overcome class imbalance in machine learning methods as well as to study the effectiveness of deep learning methods in overcoming class imbalance in DTI prediction in terms of binary classification using ten (10) cancer-related activity classes from BindingDB. It is found that the use of Random Undersampling (RUS) in predicting DTIs severely affects the performance of a model, especially when the dataset is highly imbalanced, thus, rendering RUS unreliable. It is also found that SVM-SMOTE can be used as a go-to resampling method when paired with the Random Forest and Gaussian Naïve Bayes classifiers, whereby a high F1 score is recorded for all activity classes that are severely and moderately imbalanced. Additionally, the deep learning method called Multilayer Perceptron recorded high F1 scores for all activity classes even when no resampling method was applied. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 14203049
- Volume :
- 28
- Issue :
- 4
- Database :
- Complementary Index
- Journal :
- Molecules
- Publication Type :
- Academic Journal
- Accession number :
- 162378125
- Full Text :
- https://doi.org/10.3390/molecules28041663