Open Access
ARTICLE
XGBoost-Based Active Learning for Wildfire Risk Prediction
1 School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
2 Sichuan Provincial Institute of Forest and Grassland Survey and Planning, Chengdu, 610081, China
3 Southwest Forestry University, Kunming, 650224, China
4 Guangxi University, Nanning, 530004, China
5 Guangxi Institute of Industrial Technology for Space-Time Information Co., Ltd., Nanning, 530201, China
* Corresponding Author: Hang Geng. Email:
(This article belongs to the Special Issue: Incomplete Data Test, Analysis and Fusion Under Complex Environments)
Computer Modeling in Engineering & Sciences 2025, 145(3), 3701-3721. https://doi.org/10.32604/cmes.2025.073513
Received 19 September 2025; Accepted 15 October 2025; Issue published 23 December 2025
Abstract
Machine learning has emerged as a key approach in wildfire risk prediction research. However, in practical applications, the scarcity of data for specific regions often hinders model performance, with models trained on region-specific data struggling to generalize due to differences in data distributions. While traditional methods based on expert knowledge tend to generalize better across regions, they are limited in leveraging multi-source data effectively, resulting in suboptimal predictive accuracy. This paper addresses this challenge by exploring how accumulated domain expertise in wildfire prediction can reduce model reliance on large volumes of high-quality data. An active learning algorithm is proposed based on XGBoost for wildfire risk assessment that autonomously identifies low-confidence predictions and seeks re-labeling through a human-in-the-loop or physics-based correction approach. This corrected data is reintegrated into the model, effectively preventing catastrophic forgetting. Experimental results demonstrate that the proposed human-in-the-loop approach significantly enhances labeling accuracy, predictive performance, and preserves the model’s ability to generalize. These findings highlight the value of incorporating human expertise into machine learning models, offering a practical solution to mitigate data quality challenges and improve model reliability in wildfire risk prediction.Keywords
Cite This Article
Copyright © 2025 The Author(s). Published by Tech Science Press.This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Submit a Paper
Propose a Special lssue
View Full Text
Download PDF
Downloads
Citation Tools