Full text

Turn on search term navigation

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

In the era of Big Data, integrating information from multiple sources has proven valuable in various fields. To ensure a high-quality supply of multi-source data, repairing different types of errors in the multi-source data becomes critical. This paper categorizes errors in multi-source data into entity information overlapping, attribute value conflicts, and attribute value inconsistencies. We first summarize existing repairing methods for these errors and then examine and review the study of the detection and repair of compound-type errors in multi-source data. Finally, we indicate further research directions in multi-source data repair.

Details

Title
Multi-Source Data Repairing: A Comprehensive Survey
Author
Chen, Ye 1   VIAFID ORCID Logo  ; Duan, Haoyang 2 ; Zhang, Hengtong 3 ; Zhang, Hua 2 ; Wang, Hongzhi 4   VIAFID ORCID Logo  ; Dai, Guojun 2 

 Hangzhou Dianzi University, Hangzhou 310018, China; Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China; Jubang Group Co., Ltd., Yueqing 325600, China 
 Hangzhou Dianzi University, Hangzhou 310018, China 
 Tencent AI Lab, Shenzhen 518054, China 
 Harbin Institute of Technology, Harbin 150001, China 
First page
2314
Publication year
2023
Publication date
2023
Publisher
MDPI AG
e-ISSN
22277390
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2819475592
Copyright
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.