Content area

Abstract

Artificial intelligence (AI) is being increasingly integrated into scientific discovery to augment and accelerate research, helping scientists to generate hypotheses, design experiments, collect and interpret large datasets, and gain insights that might not have been possible using traditional scientific methods alone. Here we examine breakthroughs over the past decade that include self-supervised learning, which allows models to be trained on vast amounts of unlabelled data, and geometric deep learning, which leverages knowledge about the structure of scientific data to enhance model accuracy and efficiency. Generative AI methods can create designs, such as small-molecule drugs and proteins, by analysing diverse data modalities, including images and sequences. We discuss how these methods can help scientists throughout the scientific process and the central issues that remain despite such advances. Both developers and users of AI toolsneed a better understanding of when such approaches need improvement, and challenges posed by poor data quality and stewardship remain. These issues cut across scientific disciplines and require developing foundational algorithmic approaches that can contribute to scientific understanding or acquire it autonomously, making them critical areas of focus for AI innovation.

Details

Title
Scientific discovery in the age of artificial intelligence
Author
Wang, Hanchen 1 ; Fu, Tianfan 2 ; Du, Yuanqi 3 ; Gao, Wenhao 4 ; Huang, Kexin 5 ; Liu, Ziming; Chandak, Payal; Liu, Shengchao; Van Katwyk, Peter; Deac, Andreea; Anandkumar, Anima; Bergen, Karianne; Gomes, Carla P; Ho, Shirley; Kohli, Pushmeet; Lasenby, Joan; Leskovec, Jure; Liu, Tie-Yan; Manrai, Arjun; Marks, Debora; Ramsundar, Bharath; Song, Le; Sun, Jimeng; Tang, Jian; Veli&ccaronković, Petar; Welling, Max; Zhang, Linfeng; Coley, Connor W; Bengio, Yoshua; Zitnik, Marinka

 Department of Engineering, University of Cambridge, Cambridge, UK. Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA Present address: Department of Research and Early Development, Genentech Inc, South San Francisco, CA, USA. Present address: Department of Computer Science, Stanford University, Stanford, CA, USA 
 Department of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, USA 
 Department of Computer Science, Cornell University, Ithaca, NY, USA 
 Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA 
 Department of Computer Science, Stanford University, Stanford, CA, USA 
Pages
47-60
Section
Review
Publication year
2023
Publication date
Aug 3, 2023
Publisher
Nature Publishing Group
ISSN
00280836
e-ISSN
14764687
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2846299618
Copyright
Copyright Nature Publishing Group Aug 3, 2023