Full text

Turn on search term navigation

© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Mental health is a major global concern, yet findings remain fragmented across studies and databases, hindering integrative understanding and clinical translation. To address this gap, we present the Mental Disorders Knowledge Graph (MDKG)—a large-scale, contextualized knowledge graph built using large language models to unify evidence from biomedical literature and curated databases. MDKG comprises over 10 million relations, including nearly 1 million novel associations absent from existing resources. By structurally encoding contextual features such as conditionality, demographic factors, and co-occurring clinical attributes, the graph enables more nuanced interpretation and rapid expert validation, reducing evaluation time by up to 70%. Applied to predictive modeling in the UK Biobank, MDKG-enhanced representations yielded significant gains in predictive performance across multiple mental disorders. As a scalable and semantically enriched resource, MDKG offers a powerful foundation for accelerating psychiatric research and enabling interpretable, data-driven clinical insights.

Understanding the pathophysiological pathways of mental disorders and identifying reliable biomarkers remain challenging. This study introduces a large-scale knowledge graph tailored to mental disorders to improve knowledge discovery, disease prediction, and clinical validation

Details

Title
Large language model powered knowledge graph construction for mental health exploration
Author
Gao, Shan 1 ; Yu, Kaixian 2 ; Yang, Yue 3 ; Yu, Sheng 4   VIAFID ORCID Logo  ; Shi, Chenglong 5 ; Wang, Xueqin 6   VIAFID ORCID Logo  ; Tang, Niansheng 1 ; Zhu, Hongtu 7   VIAFID ORCID Logo 

 Yunnan Key Laboratory of Statistical Modeling and Data Analysis, Yunnan University, Kunming, Yunnan, China (ROR: https://ror.org/0040axw97) (GRID: grid.440773.3) (ISNI: 0000 0000 9342 2456) 
 Insilicom LLC, Tallahassee, FL, USA (GRID: grid.524822.b) 
 Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA (ROR: https://ror.org/0130frc33) (GRID: grid.10698.36) (ISNI: 0000 0001 2248 3208) 
 Department of Statistics and Data Science, Tsinghua University, Beijing, China (ROR: https://ror.org/03cve4549) (GRID: grid.12527.33) (ISNI: 0000 0001 0662 3178) 
 The Second Affiliated Hospital of Kunming Medical University, Kunming Medical University, Kunming, Yunnan, China (ROR: https://ror.org/038c3w259) (GRID: grid.285847.4) (ISNI: 0000 0000 9588 0960) 
 School of Management, University of Science and Technology of China, Hefei, Anhui, China (ROR: https://ror.org/04c4dkn09) (GRID: grid.59053.3a) (ISNI: 0000000121679639) 
 Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA (ROR: https://ror.org/0130frc33) (GRID: grid.10698.36) (ISNI: 0000 0001 2248 3208); Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA (ROR: https://ror.org/0130frc33) (GRID: grid.10698.36) (ISNI: 0000 0001 2248 3208); Biomedical Research Imaging Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA (ROR: https://ror.org/0130frc33) (GRID: grid.10698.36) (ISNI: 0000 0001 2248 3208); Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA (ROR: https://ror.org/0130frc33) (GRID: grid.10698.36) (ISNI: 0000 0001 2248 3208); Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA (ROR: https://ror.org/0130frc33) (GRID: grid.10698.36) (ISNI: 0000 0001 2248 3208) 
Pages
7526
Section
Article
Publication year
2025
Publication date
2025
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3239225211
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.