Content area
As artificial intelligence (AI) increasingly integrates into scientific research, explainability has become a cornerstone for ensuring reliability and innovation in discovery processes. This review offers a forward-looking integration of explainable AI (XAI)-based research paradigms, encompassing small domain-specific models, large language models (LLMs), and agent-based large-small model collaboration. For domain-specific models, we introduce a knowledge-oriented taxonomy categorizing methods into knowledge-agnostic, knowledge-based, knowledge-infused, and knowledge-verified approaches, emphasizing the balance between domain knowledge and innovative insights. For LLMs, we examine three strategies for integrating domain knowledge—prompt engineering, retrieval-augmented generation, and supervised fine-tuning—along with advances in explainability, including local, global, and conversation-based explanations. We also envision future agent-based model collaborations within automated laboratories, stressing the need for context-aware explanations tailored to research goals. Additionally, we discuss the unique characteristics and limitations of both explainable small domain-specific models and LLMs in the realm of scientific discovery. Finally, we highlight methodological challenges, potential pitfalls, and the necessity of rigorous validation to ensure XAI’s transformative role in accelerating scientific discovery and reshaping research paradigms.
Details
Reliability;
Knowledge;
Classification;
Discovery;
Collaboration;
Research;
Methodological problems;
Models;
Taxonomy;
Adoption of innovations;
Prompt engineering;
Laboratories;
Artificial intelligence;
Validity;
Hypotheses;
Innovations;
Medical research;
Paradigms;
Global local relationship;
Agent-based models;
Scientists;
Agnosticism;
Retrieval;
Knowledge management;
Researchers;
Automation;
Explainable artificial intelligence;
Large language models;
Empowerment;
Language modeling
1 Westlake University, School of Engineering, Hangzhou, China (GRID:grid.494629.4) (ISNI:0000 0004 8008 9315); Westlake Institute for Advanced Study, Institute of Advanced Technology, Hangzhou, China (GRID:grid.511490.8)
2 Westlake University, School of Engineering, Hangzhou, China (GRID:grid.494629.4) (ISNI:0000 0004 8008 9315)
3 University College London, SpaceTimeLab, Department of Civil, Environmental and Geomatic Engineering, London, UK (GRID:grid.83440.3b) (ISNI:0000 0001 2190 1201)
4 University College Dublin, Centre for BioNano Interactions, School of Chemistry, Dublin 4, Ireland (GRID:grid.7886.1) (ISNI:0000 0001 0768 2743); UCD Conway Institute of Biomolecular and Biomedical Research, University College Dublin, School of Biomolecular and Biomedical Science, Dublin 4, Ireland (GRID:grid.7886.1) (ISNI:0000 0001 0768 2743)
5 University College Dublin, Centre for BioNano Interactions, School of Chemistry, Dublin 4, Ireland (GRID:grid.7886.1) (ISNI:0000 0001 0768 2743)
6 National University of Singapore, Department of Chemistry, Singapore, Singapore (GRID:grid.428397.3) (ISNI:0000 0004 0385 0924)