Abstract

The lack of data democratization and information leakage from trained models hinder the development and acceptance of robust deep learning-based healthcare solutions. This paper argues that irreversible data encoding can provide an effective solution to achieve data democratization without violating the privacy constraints imposed on healthcare data and clinical models. An ideal encoding framework transforms the data into a new space where it is imperceptible to a manual or computational inspection. However, encoded data should preserve the semantics of the original data such that deep learning models can be trained effectively. This paper hypothesizes the characteristics of the desired encoding framework and then exploits random projections and random quantum encoding to realize this framework for dense and longitudinal or time-series data. Experimental evaluation highlights that models trained on encoded time-series data effectively uphold the information bottleneck principle and hence, exhibit lesser information leakage from trained models.

Healthcare data democratization is often hampered by privacy constraints governing the sensitive healthcare data. Here, the authors show that encoding healthcare data could be a potential solution for achieving healthcare democratization within the context of deep learning.

Details

Title
Data encoding for healthcare data democratization and information leakage prevention
Author
Thakur, Anshul 1   VIAFID ORCID Logo  ; Zhu, Tingting 1   VIAFID ORCID Logo  ; Abrol, Vinayak 2 ; Armstrong, Jacob 1 ; Wang, Yujiang 3   VIAFID ORCID Logo  ; Clifton, David A. 3 

 University of Oxford, Department of Engineering Science, Oxfordshire, UK (GRID:grid.4991.5) (ISNI:0000 0004 1936 8948) 
 IIIT Delhi, Infosys Centre for AI, Delhi, India (GRID:grid.417967.a) (ISNI:0000 0004 0558 8755) 
 University of Oxford, Department of Engineering Science, Oxfordshire, UK (GRID:grid.4991.5) (ISNI:0000 0004 1936 8948); Oxford Suzhou Centre for Advanced Research, Suzhou, China (GRID:grid.4991.5) 
Pages
1582
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2929306653
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.