TY - EJOU AU - Wang, Hongrong AU - Geng, Hang AU - Yuan, Jing AU - Zhang, Wen AU - Sheng, Hanmin AU - Wang, Qiuhua AU - Li, Xinjian TI - XGBoost-Based Active Learning for Wildfire Risk Prediction T2 - Computer Modeling in Engineering \& Sciences PY - 2025 VL - 145 IS - 3 SN - 1526-1506 AB - Machine learning has emerged as a key approach in wildfire risk prediction research. However, in practical applications, the scarcity of data for specific regions often hinders model performance, with models trained on region-specific data struggling to generalize due to differences in data distributions. While traditional methods based on expert knowledge tend to generalize better across regions, they are limited in leveraging multi-source data effectively, resulting in suboptimal predictive accuracy. This paper addresses this challenge by exploring how accumulated domain expertise in wildfire prediction can reduce model reliance on large volumes of high-quality data. An active learning algorithm is proposed based on XGBoost for wildfire risk assessment that autonomously identifies low-confidence predictions and seeks re-labeling through a human-in-the-loop or physics-based correction approach. This corrected data is reintegrated into the model, effectively preventing catastrophic forgetting. Experimental results demonstrate that the proposed human-in-the-loop approach significantly enhances labeling accuracy, predictive performance, and preserves the model’s ability to generalize. These findings highlight the value of incorporating human expertise into machine learning models, offering a practical solution to mitigate data quality challenges and improve model reliability in wildfire risk prediction. KW - Risk prediction; active learning; XGBoost; spatio-temporal division DO - 10.32604/cmes.2025.073513