Abstract

Accurate traffic flow prediction (TFP) is vital for efficient and sustainable transportation management and the development of intelligent traffic systems. However, missing data in real-world traffic datasets poses a significant challenge to maintaining prediction precision. This study introduces REPTF-TMDI, a novel method that combines a Reduced Error Pruning Tree Forest (REPTree Forest) with a newly proposed Time-based Missing Data Imputation (TMDI) approach. The REPTree Forest, an ensemble learning approach, is tailored for time-related traffic data to enhance predictive accuracy and support the evolution of sustainable urban mobility solutions. Meanwhile, the TMDI approach exploits temporal patterns to estimate missing values reliably whenever empty fields are encountered. The proposed method was evaluated using hourly traffic flow data from a major U.S. roadway spanning 2012–2018, incorporating temporal features (e.g., hour, day, month, year, weekday), holiday indicator, and weather conditions (temperature, rain, snow, and cloud coverage). Experimental results demonstrated that the REPTF-TMDI method outperformed conventional imputation techniques across various missing data ratios by achieving an average 11.76% improvement in terms of correlation coefficient (R). Furthermore, REPTree Forest achieved improvements of 68.62% in RMSE and 70.52% in MAE compared to existing state-of-the-art models. These findings highlight the method’s ability to significantly boost traffic flow prediction accuracy, even in the presence of missing data, thereby contributing to the broader objectives of sustainable urban transportation systems.

Details

Title
A Novel Reduced Error Pruning Tree Forest with Time-Based Missing Data Imputation (REPTF-TMDI) for Traffic Flow Prediction
Author
Dogan, Yunus 1 ; Tuysuzoglu, Goksu 1 ; Elife Ozturk Kiyak 2 ; Ghasemkhani, Bita 3 ; Kokten, Ulas Birant 4 ; Utku, Semih 1 ; Birant, Derya 1 

 Department of Computer Engineering, Dokuz Eylul University, Izmir, 35390, Turkey 
 Independent Researcher, Izmir, 35140, Turkey 
 Graduate School of Natural and Applied Sciences, Dokuz Eylul University, Izmir, 35390, Turkey 
 Department of Computer Engineering, Dokuz Eylul University, Izmir, 35390, Turkey, Information Technologies Research and Application Center (DEBTAM), Dokuz Eylul University, Izmir, 35390, Turkey 
Pages
1677-1715
Section
ARTICLE
Publication year
2025
Publication date
2025
Publisher
Tech Science Press
ISSN
1526-1492
e-ISSN
1526-1506
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3246599195
Copyright
© 2025. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.