Abstract

Although several large knowledge graphs have been proposed in the scholarly field, such graphs are limited with respect to several data quality dimensions such as accuracy and coverage. In this article, we present methods for enhancing the Microsoft Academic Knowledge Graph (MAKG), a recently published large-scale knowledge graph containing metadata about scientific publications and associated authors, venues, and affiliations. Based on a qualitative analysis of the MAKG, we address three aspects. First, we adopt and evaluate unsupervised approaches for large-scale author name disambiguation. Second, we develop and evaluate methods for tagging publications by their discipline and by keywords, facilitating enhanced search and recommendation of publications and associated entities. Third, we compute and evaluate embeddings for all 239 million publications, 243 million authors, 49,000 journals, and 16,000 conference entities in the MAKG based on several state-of-the-art embedding techniques. Finally, we provide statistics for the updated MAKG. Our final MAKG is publicly available at https://makg.org and can be used for the search or recommendation of scholarly entities, as well as enhanced scientific impact quantification.

Details

Title
The Microsoft Academic Knowledge Graph enhanced: Author name disambiguation, publication classification, and embeddings
Author
Färber, Michael  VIAFID ORCID Logo  ; Ao, Lin  VIAFID ORCID Logo 
Pages
51-98
Section
Research Articles
Publication year
2022
Publication date
Winter 2022
Publisher
MIT Press Journals, The
e-ISSN
26413337
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2893948350
Copyright
© 2022. This work is published under https://creativecommons.org/licenses/by/4.0/legalcode (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.