Content area

Abstract

As artificial intelligence (AI) increasingly integrates into scientific research, explainability has become a cornerstone for ensuring reliability and innovation in discovery processes. This review offers a forward-looking integration of explainable AI (XAI)-based research paradigms, encompassing small domain-specific models, large language models (LLMs), and agent-based large-small model collaboration. For domain-specific models, we introduce a knowledge-oriented taxonomy categorizing methods into knowledge-agnostic, knowledge-based, knowledge-infused, and knowledge-verified approaches, emphasizing the balance between domain knowledge and innovative insights. For LLMs, we examine three strategies for integrating domain knowledge—prompt engineering, retrieval-augmented generation, and supervised fine-tuning—along with advances in explainability, including local, global, and conversation-based explanations. We also envision future agent-based model collaborations within automated laboratories, stressing the need for context-aware explanations tailored to research goals. Additionally, we discuss the unique characteristics and limitations of both explainable small domain-specific models and LLMs in the realm of scientific discovery. Finally, we highlight methodological challenges, potential pitfalls, and the necessity of rigorous validation to ensure XAI’s transformative role in accelerating scientific discovery and reshaping research paradigms.

Details

10000008
Title
Empowering scientific discovery with explainable small domain-specific and large language models
Author
Yu, Hengjie 1 ; Wang, Yizhi 2 ; Cheng, Tao 3 ; Yan, Yan 4 ; Dawson, Kenneth A. 5 ; Li, Sam F. Y. 6 ; Zheng, Yefeng 1 ; Jin, Yaochu 1 

 Westlake University, School of Engineering, Hangzhou, China (GRID:grid.494629.4) (ISNI:0000 0004 8008 9315); Westlake Institute for Advanced Study, Institute of Advanced Technology, Hangzhou, China (GRID:grid.511490.8) 
 Westlake University, School of Engineering, Hangzhou, China (GRID:grid.494629.4) (ISNI:0000 0004 8008 9315) 
 University College London, SpaceTimeLab, Department of Civil, Environmental and Geomatic Engineering, London, UK (GRID:grid.83440.3b) (ISNI:0000 0001 2190 1201) 
 University College Dublin, Centre for BioNano Interactions, School of Chemistry, Dublin 4, Ireland (GRID:grid.7886.1) (ISNI:0000 0001 0768 2743); UCD Conway Institute of Biomolecular and Biomedical Research, University College Dublin, School of Biomolecular and Biomedical Science, Dublin 4, Ireland (GRID:grid.7886.1) (ISNI:0000 0001 0768 2743) 
 University College Dublin, Centre for BioNano Interactions, School of Chemistry, Dublin 4, Ireland (GRID:grid.7886.1) (ISNI:0000 0001 0768 2743) 
 National University of Singapore, Department of Chemistry, Singapore, Singapore (GRID:grid.428397.3) (ISNI:0000 0004 0385 0924) 
Publication title
Volume
58
Issue
12
Pages
371
Publication year
2025
Publication date
Dec 2025
Publisher
Springer Nature B.V.
Place of publication
Dordrecht
Country of publication
Netherlands
ISSN
02692821
e-ISSN
15737462
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-10-08
Milestone dates
2025-08-14 (Registration); 2025-08-14 (Accepted)
Publication history
 
 
   First posting date
08 Oct 2025
ProQuest document ID
3258735895
Document URL
https://www.proquest.com/scholarly-journals/empowering-scientific-discovery-with-explainable/docview/3258735895/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-12-06
Database
ProQuest One Academic