Abstract

Poor air quality due to large amounts of human activity shows the need to increase public awareness and alertness by building a system predicting future pollutant concentrations. This research creates a prediction system using the LightGBM algorithm for PM2.5 and CO2 pollutant parameters with an additional parameter reduction method using PCA to increase prediction accuracy. The number of valid datasets is 918 for each of the five parameters at each measurement station, with data gaps filled using median values so that they can be used for predictions. The prediction results show that the best accuracy for PM2.5 is at the Deli station, which uses PCA with a MAPE of 21.5%, and for CO2, it is achieved at the Deli station without PCA with a MAPE of 4.8%. Based on its accuracy, PCA is less suitable if there are outliers in the dataset, but PCA is ideal for homogeneous datasets. Overall, the prediction results based on accuracy for PM2.5 are in the feasible category, and for CO2, they are in the accurate and very accurate category. To optimize prediction results, especially in the long term, it is necessary to retrain with a complete and up-to-date dataset to better suit air conditions.

Details

Title
Prediction of PM2.5 and CO2 concentrations using the PCA-LightGBM method in Bandung, Indonesia
Author
Adiwidya, A S 1 ; Romadhony, A 2 ; Chandra, I 3 ; Sukmawati, A N D 1 ; Sholihah, H M 1 ; Islamiah, D U 1 ; Rinaldi, A 1 

 Engineering Physics, School of Electrical Engineering, Telkom University , Bandung, Indonesia 
 Informatics, School of Informatics, Telkom University , Bandung, Indonesia 
 Engineering Physics, School of Electrical Engineering, Telkom University , Bandung, Indonesia; Center of Excellence of Sustainable Energy and Climate Change, Telkom University , Bandung, Indonesia 
First page
012004
Publication year
2025
Publication date
Feb 2025
Publisher
IOP Publishing
ISSN
17426588
e-ISSN
17426596
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3168381117
Copyright
Published under licence by IOP Publishing Ltd. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.