Full text

Turn on search term navigation

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Google Scholar has been a widely used platform for academic performance evaluation and citation analysis. The issue about the mis-configuration of author profiles may seriously damage the reliability of the data, and thus affect the accuracy of analysis. Therefore, it is important to detect the mis-configured author profiles. Dealing with this issue is challenging because the scale of the dataset is large and manual annotation is time-consuming and relatively subjective. In this paper, we first collect a dataset of Google Scholar’s author profiles in the field of computer science and compare the mis-configured author profiles with the reliable ones. Then, we propose an integrated model that utilizes machine learning and node embedding to automatically detect mis-configured author profiles. Additionally, we conduct two application case studies based on the data of Google Scholar, i.e., outstanding scholar searching and university ranking, to demonstrate how the improved dataset after filtering out the mis-configured author profiles will change the results. The two case studies validate the importance and meaningfulness of the detection of mis-configured author profiles.

Details

Title
Identifying Mis-Configured Author Profiles on Google Scholar Using Deep Learning
Author
Tang, Jiaxin 1 ; Chen, Yang 1   VIAFID ORCID Logo  ; She, Guozhen 1 ; Xu, Yang 2 ; Sha, Kewei 3 ; Wang, Xin 1 ; Wang, Yi 4 ; Zhang, Zhenhua 5 ; Pan, Hui 6 

 School of Computer Science, Fudan University, Shanghai 200433, China; [email protected] (J.T.); [email protected] (G.S.); [email protected] (Y.X.); [email protected] (X.W.); Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai 200433, China 
 School of Computer Science, Fudan University, Shanghai 200433, China; [email protected] (J.T.); [email protected] (G.S.); [email protected] (Y.X.); [email protected] (X.W.) 
 Department of Computing Sciences, University of Houston-Clear Lake, Houston, TX 77058, USA; [email protected] 
 Peng Cheng Laboratory, Shenzhen 518055, China; [email protected]; Institute of Future Networks, Southern University of Science and Technology, Shenzhen 518055, China 
 Meituan, Beijing 100102, China; [email protected] 
 Department of Computer Science, University of Helsinki, 00014 Helsinki, Finland; [email protected]; Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong 
First page
6912
Publication year
2021
Publication date
2021
Publisher
MDPI AG
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2558625308
Copyright
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.