Back to Search Start Over

Identifying Mis-Configured Author Profiles on Google Scholar Using Deep Learning

Authors :
Jiaxin Tang
Yang Chen
Guozhen She
Yang Xu
Kewei Sha
Xin Wang
Yi Wang
Zhenhua Zhang
Pan Hui
Source :
Applied Sciences, Vol 11, Iss 15, p 6912 (2021)
Publication Year :
2021
Publisher :
MDPI AG, 2021.

Abstract

Google Scholar has been a widely used platform for academic performance evaluation and citation analysis. The issue about the mis-configuration of author profiles may seriously damage the reliability of the data, and thus affect the accuracy of analysis. Therefore, it is important to detect the mis-configured author profiles. Dealing with this issue is challenging because the scale of the dataset is large and manual annotation is time-consuming and relatively subjective. In this paper, we first collect a dataset of Google Scholar’s author profiles in the field of computer science and compare the mis-configured author profiles with the reliable ones. Then, we propose an integrated model that utilizes machine learning and node embedding to automatically detect mis-configured author profiles. Additionally, we conduct two application case studies based on the data of Google Scholar, i.e., outstanding scholar searching and university ranking, to demonstrate how the improved dataset after filtering out the mis-configured author profiles will change the results. The two case studies validate the importance and meaningfulness of the detection of mis-configured author profiles.

Details

Language :
English
ISSN :
20763417
Volume :
11
Issue :
15
Database :
Directory of Open Access Journals
Journal :
Applied Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.7392c671ccc480491a07fe586b7f48c
Document Type :
article
Full Text :
https://doi.org/10.3390/app11156912