Content area

Abstract

This paper explores Machine Learning (ML) parameterizations for radiative transfer in the ICOsahedral Nonhydrostatic weather and climate model (ICON) and investigates the achieved ML model speed‐up with ICON running on graphics processing units (GPUs). Five ML models, with varying complexity and size, are coupled to ICON; more specifically, a multilayer perceptron (MLP), a Unet model, a bidirectional recurrent neural network with long short‐term memory (BiLSTM), a vision transformer (ViT), and a random forest (RF) as a baseline. The ML parameterizations are coupled to the ICON code that includes OpenACC compiler directives to enable GPU support. The coupling is done with the PyTorch‐Fortran coupler developed at NVIDIA. The most accurate model is the BiLSTM with a physics‐informed normalization strategy, a penalty for the heating rates during training, a Gaussian smoothing as postprocessing and a simplified computation of the fluxes at the upper levels to ensure stability of the ICON model top. The presented setup enables stable aquaplanet simulations with ICON for several weeks at a resolution of about 80 km and compares well with the physics‐based default radiative transfer parameterization, ecRad. Our results indicate that the compute requirements of the ML models that can ensure the stability of ICON are comparable to GPU optimized classical physics parameterizations in terms of memory consumption and computational speed.

Details

1009240
Business indexing term
Title
Revisiting Machine Learning Approaches for Short‐ and Longwave Radiation Inference in Weather and Climate Models
Author
Bertoli, Guillaume 1   VIAFID ORCID Logo  ; Mohebi, Salman 2 ; Ozdemir, Firat 2   VIAFID ORCID Logo  ; Jucker, Jonas 3 ; Rüdisühli, Stefan 4   VIAFID ORCID Logo  ; Perez‐Cruz, Fernando 5 ; Salzmann, Mathieu 2 ; Schemm, Sebastian 6   VIAFID ORCID Logo 

 Institute for Atmospheric and Climate Science, ETH Zurich, Zurich, Switzerland, Learning the Earth with Artificial Intelligence and Physics Center, Columbia University, New York, NY, USA 
 Swiss Data Science Center, ETH Zurich and EPFL, Zurich, Switzerland 
 Center for Climate Systems Modeling, ETH Zurich, Zurich, Switzerland 
 Institute for Atmospheric and Climate Science, ETH Zurich, Zurich, Switzerland, Now at Meteomatics, St. Gallen, Switzerland 
 Swiss Data Science Center, ETH Zurich and EPFL, Zurich, Switzerland, Computer Science Department, ETH Zurich, Zurich, Switzerland, Now at Bank for International Settlements (BIS), Basel, Switzerland 
 Department of Applied Mathematics and Theoretical Physics, Cambridge University, Cambridge, UK 
Publication title
Volume
17
Issue
9
Number of pages
25
Publication year
2025
Publication date
Sep 1, 2025
Section
Research Article
Publisher
John Wiley & Sons, Inc.
Place of publication
Washington
Country of publication
United States
Publication subject
e-ISSN
19422466
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-09-23
Milestone dates
2025-08-08 (manuscriptRevised); 2025-09-23 (publishedOnlineFinalForm); 2025-01-17 (manuscriptReceived); 2025-08-17 (manuscriptAccepted)
Publication history
 
 
   First posting date
23 Sep 2025
ProQuest document ID
3254545794
Document URL
https://www.proquest.com/scholarly-journals/revisiting-machine-learning-approaches-short/docview/3254545794/se-2?accountid=208611
Copyright
© 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-09-26
Database
ProQuest One Academic