Abstract

Air pollution, specifically PM2.5, has become a significant global concern owing to its detrimental impacts on public health. Even so, the high-resolution monitoring of air pollution is still a challenge on a global scale. To cope with this, machine learning (ML) techniques have been utilized to infer the concentration of air pollutants at a fine scale. In this study, we propose DeepAir, a learning framework for estimating PM2.5 concentrations at a fine scale with sparsely distributed observations. DeepAir integrates a pre-trained convolutional neural network with the LightGBM method. This framework estimates the PM2.5 concentration of a given patch, utilizing a synergy of geographical information, meteorological conditions, and satellite observations. We select California as the focal region and train the model with data from 2014 to 2017 provided by 130 PM2.5 observation stations in the state. Upon training, the model can be applied to estimate the daily PM2.5 concentrations at 1 km resolution across California. Our methodology meticulously incorporates meteorological variables, with a particular emphasis on wildfire propagation, and contemplates the complex interplay of various features. To ascertain the efficacy of our model, we employ the 10-fold cross-validation technique, which confirms that our model surpasses traditional ML and standalone deep learning methods.

Details

Title
DeepAir: deep learning and satellite imagery to estimate high-resolution PM2.5 at scale
Author
Guo, Wenxuan 1   VIAFID ORCID Logo  ; Hu, Zhaoping 1 ; Ling, Jin 2 ; Xu, Yanyan 3   VIAFID ORCID Logo  ; Gonzalez, Marta C 4 

 MoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University , Shanghai 200240, People’s Republic of China 
 Energy Technologies Area , Lawrence Berkeley National Laboratory, Berkeley, CA 94720, United States of America 
 MoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University , Shanghai 200240, People’s Republic of China; Energy Technologies Area , Lawrence Berkeley National Laboratory, Berkeley, CA 94720, United States of America; Department of City and Regional Planning, University of California , Berkeley, CA 94720, United States of America 
 Energy Technologies Area , Lawrence Berkeley National Laboratory, Berkeley, CA 94720, United States of America; Department of City and Regional Planning, University of California , Berkeley, CA 94720, United States of America; Department of Civil and Environmental Engineering, University of California , Berkeley, CA 94720, United States of America 
First page
015057
Publication year
2025
Publication date
Mar 2025
Publisher
IOP Publishing
e-ISSN
26322153
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3174084418
Copyright
© 2025 The Author(s). Published by IOP Publishing Ltd. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.