Open Access iconOpen Access

ARTICLE

A Novel Reduced Error Pruning Tree Forest with Time-Based Missing Data Imputation (REPTF-TMDI) for Traffic Flow Prediction

Yunus Dogan1, Goksu Tuysuzoglu1, Elife Ozturk Kiyak2, Bita Ghasemkhani3, Kokten Ulas Birant1,4, Semih Utku1, Derya Birant1,*

1 Department of Computer Engineering, Dokuz Eylul University, Izmir, 35390, Turkey
2 Independent Researcher, Izmir, 35140, Turkey
3 Graduate School of Natural and Applied Sciences, Dokuz Eylul University, Izmir, 35390, Turkey
4 Information Technologies Research and Application Center (DEBTAM), Dokuz Eylul University, Izmir, 35390, Turkey

* Corresponding Author: Derya Birant. Email: email

Computer Modeling in Engineering & Sciences 2025, 144(2), 1677-1715. https://doi.org/10.32604/cmes.2025.069255

Abstract

Accurate traffic flow prediction (TFP) is vital for efficient and sustainable transportation management and the development of intelligent traffic systems. However, missing data in real-world traffic datasets poses a significant challenge to maintaining prediction precision. This study introduces REPTF-TMDI, a novel method that combines a Reduced Error Pruning Tree Forest (REPTree Forest) with a newly proposed Time-based Missing Data Imputation (TMDI) approach. The REPTree Forest, an ensemble learning approach, is tailored for time-related traffic data to enhance predictive accuracy and support the evolution of sustainable urban mobility solutions. Meanwhile, the TMDI approach exploits temporal patterns to estimate missing values reliably whenever empty fields are encountered. The proposed method was evaluated using hourly traffic flow data from a major U.S. roadway spanning 2012–2018, incorporating temporal features (e.g., hour, day, month, year, weekday), holiday indicator, and weather conditions (temperature, rain, snow, and cloud coverage). Experimental results demonstrated that the REPTF-TMDI method outperformed conventional imputation techniques across various missing data ratios by achieving an average 11.76% improvement in terms of correlation coefficient (R). Furthermore, REPTree Forest achieved improvements of 68.62% in RMSE and 70.52% in MAE compared to existing state-of-the-art models. These findings highlight the method’s ability to significantly boost traffic flow prediction accuracy, even in the presence of missing data, thereby contributing to the broader objectives of sustainable urban transportation systems.

Keywords

Machine learning; traffic flow prediction; missing data imputation; reduced error pruning tree (REPTree); sustainable transportation systems; traffic management; artificial intelligence

Cite This Article

APA Style
Dogan, Y., Tuysuzoglu, G., Kiyak, E.O., Ghasemkhani, B., Birant, K.U. et al. (2025). A Novel Reduced Error Pruning Tree Forest with Time-Based Missing Data Imputation (REPTF-TMDI) for Traffic Flow Prediction. Computer Modeling in Engineering & Sciences, 144(2), 1677–1715. https://doi.org/10.32604/cmes.2025.069255
Vancouver Style
Dogan Y, Tuysuzoglu G, Kiyak EO, Ghasemkhani B, Birant KU, Utku S, et al. A Novel Reduced Error Pruning Tree Forest with Time-Based Missing Data Imputation (REPTF-TMDI) for Traffic Flow Prediction. Comput Model Eng Sci. 2025;144(2):1677–1715. https://doi.org/10.32604/cmes.2025.069255
IEEE Style
Y. Dogan et al., “A Novel Reduced Error Pruning Tree Forest with Time-Based Missing Data Imputation (REPTF-TMDI) for Traffic Flow Prediction,” Comput. Model. Eng. Sci., vol. 144, no. 2, pp. 1677–1715, 2025. https://doi.org/10.32604/cmes.2025.069255



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 8450

    View

  • 7890

    Download

  • 0

    Like

Share Link