Full text

Turn on search term navigation

© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Anonymization has the potential to foster the sharing of medical data. State-of-the-art methods use mathematical models to modify data to reduce privacy risks. However, the degree of protection must be balanced against the impact on statistical properties. We studied an extreme case of this trade-off: the statistical validity of an open medical dataset based on the German National Pandemic Cohort Network (NAPKON), which was prepared for publication using a strong anonymization procedure. Descriptive statistics and results of regression analyses were compared before and after anonymization of multiple variants of the original dataset. Despite significant differences in value distributions, the statistical bias was found to be small in all cases. In the regression analyses, the median absolute deviations of the estimated adjusted odds ratios for different sample sizes ranged from 0.01 [minimum = 0, maximum = 0.58] to 0.52 [minimum = 0.25, maximum = 0.91]. Disproportionate impact on the statistical properties of data is a common argument against the use of anonymization. Our analysis demonstrates that anonymization can actually preserve validity of statistical results in relatively low-dimensional data.

Details

Title
Statistical biases due to anonymization evaluated in an open clinical dataset from COVID-19 patients
Author
Koll, Carolin E. M. 1 ; Pütz, Sina M. 1 ; Meurers, Thierry 2 ; Lee, Chin Huang 1 ; Kohls, Mirjam 3 ; Stellbrink, Christoph 4 ; Thibeault, Charlotte 5 ; Reinke, Lennart 6 ; Steinbrecher, Sarah 5 ; Schreiber, Stefan 6 ; Mitrov, Lazar 1 ; Frank, Sandra 7 ; Miljukov, Olga 3 ; Erber, Johanna 8 ; Hellmuth, Johannes C. 9   VIAFID ORCID Logo  ; Reese, Jens-Peter 3 ; Steinbeis, Fridolin 5 ; Bahmer, Thomas 10 ; Hagen, Marina 11 ; Meybohm, Patrick 12 ; Hansch, Stefan 13 ; Vadász, István 14 ; Krist, Lilian 15 ; Jiru-Hillmann, Steffi 3 ; Prasser, Fabian 2 ; Vehreschild, Jörg Janne 16 ; Witzke, O.

 University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany (ROR: https://ror.org/00rcxh774) (GRID: grid.6190.e) (ISNI: 0000 0000 8580 3777) 
 Berlin Institute of Health at Charité – Universitätsmedizin Berlin, Charitéplatz 1, 10117, Berlin, Germany (ROR: https://ror.org/0493xsw21) (GRID: grid.484013.a) (ISNI: 0000 0004 6879 971X) 
 University of Wuerzburg, Faculty of Medicine, Institute for Clinical Epidemiology and Biometry, Wuerzburg, Germany (ROR: https://ror.org/00fbnyb24) (GRID: grid.8379.5) (ISNI: 0000 0001 1958 8658) 
 Department of Cardiology and Intensive Care Medicine, Bielefeld Medical Centre, Medical Faculty OWL, University of Bielefeld, Bielefeld, Germany (ROR: https://ror.org/02hpadn98) (GRID: grid.7491.b) (ISNI: 0000 0001 0944 9128) 
 Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany (ROR: https://ror.org/001w7jn25) (GRID: grid.6363.0) (ISNI: 0000 0001 2218 4662) 
 Internal Medicine Department I, University Medical Center Schleswig-Holstein Campus Kiel, Kiel, Germany (ROR: https://ror.org/01tvm6f46) (GRID: grid.412468.d) (ISNI: 0000 0004 0646 2097) 
 Department of Anesthesiology, University Hospital of Ludwig-Maximilians-University (LMU), Munich, Germany (ROR: https://ror.org/05591te55) (GRID: grid.5252.0) (ISNI: 0000 0004 1936 973X); Department of Medicine III, University Hospital, LMU Munich, Munich, Germany (ROR: https://ror.org/05591te55) (GRID: grid.5252.0) (ISNI: 0000 0004 1936 973X) 
 Technical University of Munich, School of Medicine, University Hospital rechts der Isar, Department of Internal Medicine II, Munich, Germany (ROR: https://ror.org/02kkvpp62) (GRID: grid.6936.a) (ISNI: 0000000123222966) 
 Department of Medicine III, University Hospital, LMU Munich, Munich, Germany (ROR: https://ror.org/05591te55) (GRID: grid.5252.0) (ISNI: 0000 0004 1936 973X); COVID-19 Registry of the LMU Munich (CORKUM), University Hospital, LMU Munich, Munich, Germany (ROR: https://ror.org/05591te55) (GRID: grid.5252.0) (ISNI: 0000 0004 1936 973X) 
10  Internal Medicine Department I, University Medical Center Schleswig-Holstein Campus Kiel, Kiel, Germany (ROR: https://ror.org/01tvm6f46) (GRID: grid.412468.d) (ISNI: 0000 0004 0646 2097); Airway Research Center North (ARCN), German Center for Lung Research (DZL), Großhansdorf, Germany (ROR: https://ror.org/03dx11k66) (GRID: grid.452624.3) 
11  Department II for Internal Medicine, Hematology/Oncology, University Hospital Frankfurt, Frankfurt am Main, Germany (ROR: https://ror.org/03f6n9m15) (GRID: grid.411088.4) (ISNI: 0000 0004 0578 8220) 
12  Department of Anaesthesiology, Intensive Care, Emergency and Pain Medicine, University Hospital Wuerzburg, Wuerzburg, Germany (ROR: https://ror.org/03pvr2g57) (GRID: grid.411760.5) (ISNI: 0000 0001 1378 7891) 
13  Department of Infection Prevention and Infectious Diseases, University Hospital Regensburg, Regensburg, Germany (ROR: https://ror.org/01226dv09) (GRID: grid.411941.8) (ISNI: 0000 0000 9194 7179) 
14  Department of Internal Medicine, Justus Liebig University, Universities of Giessen and Marburg Lung Center (UGMLC), Member of the German Center for Lung Research (DZL), Giessen, Germany (ROR: https://ror.org/045f0ws19) (GRID: grid.440517.3); The Cardio-Pulmonary Institute (CPI), Giessen, Germany (ROR: https://ror.org/04ckbty56) (GRID: grid.511808.5) 
15  Institute of Social Medicine, Epidemiology and Health Economics, Charité-Universitätsmedizin Berlin, Berlin, Germany (ROR: https://ror.org/001w7jn25) (GRID: grid.6363.0) (ISNI: 0000 0001 2218 4662) 
16  University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany (ROR: https://ror.org/00rcxh774) (GRID: grid.6190.e) (ISNI: 0000 0000 8580 3777); Department II for Internal Medicine, Hematology/Oncology, University Hospital Frankfurt, Frankfurt am Main, Germany (ROR: https://ror.org/03f6n9m15) (GRID: grid.411088.4) (ISNI: 0000 0004 0578 8220); German Centre for Infection Research (DZIF), partner site Bonn-Cologne, Cologne, Germany (ROR: https://ror.org/028s4q594) (GRID: grid.452463.2) 
Pages
776
Section
Analysis
Publication year
2022
Publication date
2022
Publisher
Nature Publishing Group
e-ISSN
20524463
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2756517457
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.