Abstract

Type 2 diabetes mellitus (T2DM) is a prevalent health challenge faced by countries worldwide. In this study, we propose a novel large language multimodal models (LLMMs) framework incorporating multimodal data from clinical notes and laboratory results for diabetes risk prediction. We collected five years of electronic health records (EHRs) dating from 2017 to 2021 from a Taiwan hospital database. This dataset included 1,420,596 clinical notes, 387,392 laboratory results, and more than 1505 laboratory test items. Our method combined a text embedding encoder and multi-head attention layer to learn laboratory values, and utilized a deep neural network (DNN) module to merge blood features with chronic disease semantics into a latent space. In our experiments, we observed that integrating clinical notes with predictions based on textual laboratory values significantly enhanced the predictive capability of the unimodal model in the early detection of T2DM. Moreover, we achieved an area greater than 0.70 under the receiver operating characteristic curve (AUC) for new-onset T2DM prediction, demonstrating the effectiveness of leveraging textual laboratory data for training and inference in LLMs and improving the accuracy of new-onset diabetes prediction.

Details

Title
Large language multimodal models for new-onset type 2 diabetes prediction using five-year cohort electronic health records
Author
Ding, Jun-En 1 ; Thao, Phan Nguyen Minh 2 ; Peng, Wen-Chih 2 ; Wang, Jian-Zhe 2 ; Chug, Chun-Cheng 2 ; Hsieh, Min-Chen 2 ; Tseng, Yun-Chien 2 ; Chen, Ling 3 ; Luo, Dongsheng 4 ; Wu, Chenwei 5 ; Wang, Chi-Te 6 ; Hsu, Chih-Ho 7 ; Chen, Yi-Tui 8 ; Chen, Pei-Fu 9 ; Liu, Feng 1 ; Hung, Fang-Ming 10 

 Stevens Institute of Technology, School of Systems and Enterprises, Hoboken, USA (GRID:grid.217309.e) (ISNI:0000 0001 2180 0654) 
 National Yang Ming Chiao Tung University, Department of Computer Science, Hsinchu City, Taiwan (GRID:grid.260539.b) (ISNI:0000 0001 2059 7017) 
 National Yang Ming Chiao Tung University, Institute of Hospital and Health Care Administration, Taipei City, Taiwan (GRID:grid.260539.b) (ISNI:0000 0001 2059 7017) 
 Florida International University, School of Computing and Information Science, Miami, USA (GRID:grid.65456.34) (ISNI:0000 0001 2110 1845) 
 University of Michigan, Electrical Engineering and Computer Science, Ann Arbor, USA (GRID:grid.214458.e) (ISNI:0000 0004 1936 7347) 
 Far Eastern Memorial Hospital, Center of Artificial Intelligence, New Taipei City, Taiwan (GRID:grid.414746.4) (ISNI:0000 0004 0604 4784) 
 Far Eastern Memorial Hospital, Department of Surgery, New Taipei City, Taiwan (GRID:grid.414746.4) (ISNI:0000 0004 0604 4784) 
 National Taipei University of Nursing and Health Sciences, Smart Healthcare Interdisciplinary College, Taipei City, Taiwan (GRID:grid.412146.4) (ISNI:0000 0004 0573 0416) 
 Far Eastern Memorial Hospital, Department of Anesthesiology, New Taipei City, Taiwan (GRID:grid.414746.4) (ISNI:0000 0004 0604 4784); Yuan Ze University, Department of Electrical Engineering, Taoyuan, Taiwan (GRID:grid.413050.3) (ISNI:0000 0004 1770 3669) 
10  Far Eastern Memorial Hospital, Surgical Trauma Intensive Care Unit, New Taipei City, Taiwan (GRID:grid.414746.4) (ISNI:0000 0004 0604 4784); National Taipei University of Nursing and Health Sciences, Smart Healthcare Interdisciplinary College, Taipei City, Taiwan (GRID:grid.412146.4) (ISNI:0000 0004 0573 0416) 
Pages
20774
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3101008237
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.