Home / Journals / CMES / Online First / doi:10.32604/cmes.2025.073513
Special Issues
Table of Content

Open Access

ARTICLE

XGBoost-Based Active Learning for Wildfire Risk Prediction

Hongrong Wang1,2, Hang Geng1,*, Jing Yuan1, Wen Zhang2, Hanmin Sheng1, Qiuhua Wang3, Xinjian Li4,5
1 School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
2 Sichuan Provincial Institute of Forest and Grassland Survey and Planning, Chengdu, 610081, China
3 Southwest Forestry University, Kunming, 650224, China
4 Guangxi University, Nanning, 530004, China
5 Guangxi Institute of Industrial Technology for Space-Time Information Co., Ltd., Nanning, 530201, China
* Corresponding Author: Hang Geng. Email: email
(This article belongs to the Special Issue: Incomplete Data Test, Analysis and Fusion Under Complex Environments)

Computer Modeling in Engineering & Sciences https://doi.org/10.32604/cmes.2025.073513

Received 19 September 2025; Accepted 15 October 2025; Published online 10 November 2025

Abstract

Machine learning has emerged as a key approach in wildfire risk prediction research. However, in practical applications, the scarcity of data for specific regions often hinders model performance, with models trained on region-specific data struggling to generalize due to differences in data distributions. While traditional methods based on expert knowledge tend to generalize better across regions, they are limited in leveraging multi-source data effectively, resulting in suboptimal predictive accuracy. This paper addresses this challenge by exploring how accumulated domain expertise in wildfire prediction can reduce model reliance on large volumes of high-quality data. An active learning algorithm is proposed based on XGBoost for wildfire risk assessment that autonomously identifies low-confidence predictions and seeks re-labeling through a human-in-the-loop or physics-based correction approach. This corrected data is reintegrated into the model, effectively preventing catastrophic forgetting. Experimental results demonstrate that the proposed human-in-the-loop approach significantly enhances labeling accuracy, predictive performance, and preserves the model’s ability to generalize. These findings highlight the value of incorporating human expertise into machine learning models, offering a practical solution to mitigate data quality challenges and improve model reliability in wildfire risk prediction.

Keywords

Risk prediction; active learning; XGBoost; spatio-temporal division
  • 265

    View

  • 59

    Download

  • 0

    Like

Share Link