Back to Search Start Over

Smart Information Retrieval: Domain Knowledge Centric Optimization Approach

Authors :
Abduladem Aljamel
Taha Osman
Giovanni Acampora
Autilia Vitiello
Ziqi Zhang
Source :
IEEE Access, Vol 7, Pp 4167-4183 (2019)
Publication Year :
2019
Publisher :
IEEE, 2019.

Abstract

In the age of the Internet of Things, online data have witnessed a significant growth in terms of volume and diversity, and research into information retrieval has become one of the important research themes in the Internet-oriented data science research. This paper introduces a novel domain knowledge centric methodology aimed at improving the accuracy of using machine learning methods for relation extraction from text data, which is critical to the accuracy and efficiency of information retrieval-based applications, including recommender systems and sentiment analysis. The proposed methodology makes a significant contribution to the processes of domain knowledge-based relation extraction including interrogating Linked Open Datasets to generate the relation classification training data, addressing the imbalanced classification in the training datasets, determining the probability threshold of the best learning algorithm, and establishing the optimum parameters for genetic algorithms, which were utilized to optimize the feature selection for the learning algorithms. The experimental evaluation of the proposed methodology reveals that the adopted machine-learning algorithms exhibit higher precision and recall in relation extraction in the reduced feature space optimized by our implementation. The considered machine learning includes support vector machine, perceptron algorithm uneven margin, and K-nearest neighbors. The outcome is verified by comparing against the random mutation hill-climbing optimization algorithm using Wilcoxon signed-rank statistical analysis.

Details

Language :
English
ISSN :
21693536
Volume :
7
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.9a0c32d98b48e8b27ed932e65bd88f
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2018.2885640