Open Access
ARTICLE
Predicting Tropical Cyclone Genesis Location Using STAG-Net: A Spatio-Temporal Attention-Gated Network
1 Department of Information Technology, The Islamia University of Bahawalpur, Bahawalpur, Pakistan
2 Department of Software Engineering, The Islamia University of Bahawalpur, Bahawalpur, Pakistan
3 Department of Computer Science, COMSATS University Islamabad, Vehari Campus, Vehari, Pakistan
4 Department of Artificial Intelligence, The Islamia University of Bahawalpur, Bahawalpur, Pakistan
5 Center for Advanced Analytics, CoE for Artificial Intelligence, Faculty of Computing and Informatics, Multimedia University, Cyberjaya, Selangor, Malaysia
6 Artificial Intelligence and Data Analytics Laboratory (AIDA), College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh, Saudi Arabia
7 Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia
* Corresponding Author: Rab Nawaz Bashir. Email:
(This article belongs to the Special Issue: Applied Machine Learning for FAIR and Responsible Modelling)
Computer Modeling in Engineering & Sciences 2026, 147(2), 38 https://doi.org/10.32604/cmes.2026.078569
Received 04 January 2026; Accepted 23 March 2026; Issue published 27 May 2026
Abstract
Tropical Cyclone (TC) genesis forecasting is an important aspect of early warning systems, as it allows the adoption of early warnings and mitigation plans. However, existing methods often rely on binary classification or fail to capture the complex spatio-temporal dependencies that govern TC formation. To address this limitation, this study introduces STAG-Net, a novel Spatio-Temporal Attention-Gated Network designed to directly predict the geographical coordinates of TC genesis. The model uses multivariate variables of meteorological factors such as u-wind, v-wind, relative humidity, temperature, and large-scale dynamic features using a Convolutional Neural Network (CNN), Gated Recurrent Units (GRUs), and a channel-wise attention mechanism in identifying both spatial and temporal characteristics. The methodology takes the initial tropical disturbance data as an input and obtains spatial features in the ERA5 reanalysis dataset that covers 37 isobaric pressure levels. The study also investigates the effect of grid resolution on prediction performance, as four grid sizes were compared, namelyKeywords
Tropical cyclone (TC) genesis forecasting is an important concern in atmospheric science and disaster risk management [1]. The term “cyclone genesis” is used to characterize the transition from a tropical disturbance to a tropical cyclone, as this is a critical phase that influences the storm’s future trajectory and development. It is imperative to ensure adequate forecasting of this phase at the earliest possible stage, as it provides significant lead time for proactive decision-making. This lead time can be effectively used to provide timely warnings and mobilise emergency response agencies in order to minimise potential socio-economic losses [2]. The importance of enhancing predictive capabilities is further underscored by the growth in the frequency and intensity of tropical cyclones, which may be worsened by the rising climate [3]. In numerous instances, conventional forecasting methodologies, which primarily rely on numerical weather prediction schemes [4] and heuristic warning criteria [5], are unable to identify weak or emerging disturbances that subsequently develop into strong cyclonic storms. This vulnerability is particularly evident in regions with sparse data or those located in low-latitude oceanic regions [6]. It is imperative to develop models that can leverage the extensive collection of climatic data and reveal the concealed forms associated with cyclogenesis. It would be beneficial to implement an alternative that is based on a data-driven approach to spatio-temporal atmospheric information and has the capacity to learn non-linear, multi-factorial relationships among the factors (pressure, temperature, wind, humidity, etc.). This field holds immense potential to transform cyclone early warning systems and improve climate resilience across vulnerable regions.
Traditional TC genesis prediction systems were based on both numerical weather prediction (NWP) [4] models and empirical indices [3] based on physical knowledge of atmospheric dynamics. Although these models have played an essential role in productive forecasting, they experience several limitations. Most pronouncedly, their coarse resolution fails to capture mesoscale structure and convective processes that are very crucial in the formation of cyclones. Furthermore, the traditional models require high computational power and have high forecasting error when lead times exceed 48 h [7]. In addition, often deterministic predictions do not provide the probabilistic meaning that is vital in risk-based decision-making. Even more sophisticated ensemble systems, like those applied in the S2S (Subseasonal to Seasonal) framework [8], do not perform well beyond a week’s timeframe.
Statistical models [9–11] have also been employed for the TC genesis forecasting task. These models are conventionally based on empirical indices and regression methods using environmental data such as vorticity, wind shear, and humidity. Although these models are computationally effective, they fail to reflect non-linear interactions and capture spatio-temporal relationships, which restrict their effectiveness in predicting the location of TC genesis.
In the past few years, several researchers have utilized the effectiveness of machine learning for cyclone-genesis prediction. Studies [12,13] map TC genesis into a binary classification problem. These systems are effective in distinguishing non-developing tropical depressions from developing ones and enabling early warnings. The study [14] presents TC-Pred, a deep learning model based on the transformer, for predicting TC genesis using ERA5 spatiotemporal data. It takes advantage of long-range dependency learning achieved via attention mechanisms. The method performs better than the CNN and Convolutional Long Short-Term Memory (ConvLSTM) benchmarks. In the study [15], a spatiotemporal deep learning model is suggested using ConvLSTM to forecast the genesis of a tropical cyclone (TC) based on environmental fields from ERA5 data. It poses TC genesis prediction as a binary classification task of genesis vs. non-genesis-related events. Although the performance was good, the weaknesses are regional imbalance and less generalization across the basins.
Although in the recent past there have been significant improvements in the prediction of TC genesis using machine learning, current models have several major limitations, which highlight the need for further improvement. A prominent shortcoming is the overwhelming focus on binary classification, where models merely determine whether a disturbance will develop into a TC within a fixed lead time. Such models are indeed effective in the early detection but fail in the ability to give the absolute position of genesis, which is essential in targeted early alerts, resource placements, and activity scheduling. TC genesis models, such as optimized Kalman Filter algorithm [12], TCGP-Net [15] use static or short-time temporal features to predict the genesis of occurrence at the current location of disturbance and do not consider the displacement of the disturbance location to a different spot within the field. Furthermore, the current TC genesis prediction studies do not focus much on the vertical structure of the atmosphere. In a majority of the reports, only a few isolated isobaric levels are assessed or averaged, disregarding the complete vertical structure that actually controls the cyclogenesis. It is possible that this simplification will cause the underrepresentation of features and lower model robustness. Thus, it becomes crucial to have a data-based model that allows prediction of the most likely genesis location while simultaneously studying all involved isobaric layers [16], offering a more spatially and vertically focused method of TC genesis forecasting.
To overcome these limitations, this study proposes a novel methodology for TC genesis location prediction. The proposed study utilizes ERA-5 reanalysis spatio-temporal data in grid format and predicts the coordinates of possible TC genesis. Pressure-level data for four different components (U-component of wind, V-component of wind, relative humidity, and temperature) across 37 different isobaric planes (1000 to 1 hPa) are extracted in four different grid sizes (
• A novel methodology, Spatio-Temporal Attention-Gated Network (STAG-Net), is proposed for TC genesis location prediction by utilizing wide temporal information as well as deep atmospheric data in spatio-temporal format from ERA5 reanalysis.
• The impact of varying spatial grid sizes (
• Conducted comprehensive experiments to analyze the contribution of four atmospheric components across 37 isobaric levels, identifying the most effective variable pair for enhancing prediction accuracy.
The rest of the study is organized as follows: Section 1.1 contains a survey of state-of-the-art research and its associated concerns. The methodology is described in Section 2. The evaluation of results is presented in Section 3. Lastly, Section 4 concludes the study by presenting the major findings and future directions.
Predicting TC genesis falls into three broad categories, including numerical forecasting techniques, empirical indices, and data-driven techniques. Physical or ECMWF [17,18] and GFS numerical models [19] can be used to calculate atmospheric dynamics by solving physical equations, although they usually have high computing costs and initialization problems. Other empirical indices, such as the Genesis Potential Index (GPI) [3], measure multiple environmental variables through a past trend but are not dynamic to changing climate conditions. However, an alternative data-based approach, which involves machine learning models specifically, has gained prominence due to being able to learn complex spatio-temporal patterns in machine learning models in reanalysis data-based approaches with a higher level of accuracy and operational efficiencies of machine learning models.
The statistical and machine learning models are a part of the data-driven models in predicting tropical cyclone genesis. The study by [20] used a statistical framework to determine important environmental factors of short-term TC genesis prediction. They have modeled the problem as a classification problem, where the validation was done using graphical model structure learning (PC algorithm) and logistic regression. However, the data imbalance and discretizations of statistical dependency are key limitations of statistical models. The study [21] formulated a statistical-dynamical tropical cyclone genesis guidance tool based on global model outputs by carrying out a multiple logistic regression. The problem of genesis prediction was conceptualized in the study as a classification problem and found major predictors in basins. Even though the forecasts were traditionally well-calibrated, they had the drawbacks of the biases in models and the prediction imperfection of regional specificity.
Although these developments have occurred, there are complex nonlinear and spatio-temporal relationships that can hardly be represented using traditional statistical models. Therefore, more advanced machine learning-based models have been applied to improve tropical cyclone genesis prediction.
1.1.2 Machine Learning-Based Models
Machine learning-based methods have been used for meteorological tasks. The study [22] used an ensemble approach for TC path prediction. The study [23] used a ConvGRU model for TC intensity prediction up to 24 h.
The study [24] proposes a machine learning methodology to define the possibility of mesoscale convective systems (MCSs) developing into a TC. Positioned as a classification task, it utilized AdaBoost as the classification algorithm, achieving an F1-score of 97.2 with a 6-h lead time. A major contribution is the combination of environmental and MCS-specific predictors, which provides better prediction than conventional methods such as the Genesis Potential Index (GPI).
The study [25] proposes a machine learning framework for long-term TC genesis prediction using Support Vector Machine (SVM) and AdaBoost algorithms. By accounting for five important meteorological factors, the model successfully maps input features to genesis outcomes as a classification task. It balances the data and training splits and provides better spatio-temporal precision and generalization.
The study [26] uses multiple environmental variables to forecast TC genesis with a Maximum Entropy (MaxEnt) machine learning model. The model, trained on reanalysis data and tested on historical observations, provides more spatially accurate potential genesis index values compared to conventional genesis indices. When applied to CMIP6 future climate scenarios, it demonstrates a statistically significant reduction in TC genesis prediction error, indicating a complex nonlinear correlation between potential intensity and TC genesis.
The study [27] put forward a multiscale regression model to predict the frequency of TC genesis (TCGF) in the Northern Hemisphere. The model represents variability of observed TCGF in 1960–2019 in six climate-related predictors at interannual, interdecadal, and global warming time scales. Together with CMIP6 climate data, it forecasts basin-scale climate change in the near future, which gives a hybrid statistical-dynamical model that has a better decadal predictivity than longer-range historical trajectory-based prediction methods.
The study [28] proposes a novel methodology that combines CNN and Long Short-Term Memory (LSTM) within a deep learning framework to predict typhoons. It combines 3D and 2D CNNs to get spatial information based on atmospheric and ocean surface data, and LSTM acquires time-dependent information. The model is assessed using three regional datasets, and it is found to be better than the conventional numerical, statistical, and machine learning approaches, and this offers an innovative spatio-temporal reservoir-based technique to typhoon prediction, disaster warning, and preparedness.
The study [29] introduces a deep learning model, TCGNet, that combines Convolutional Neural Networks (CNNs) with channel- and spatial-attention mechanisms to emulate tropical cyclone genesis (TCG) under climate change conditions. Based on two historical and CMIP6 datasets, TCGNet shows improved prediction ability and generalization ability compared to the conventional statistical models. It eliminates the need for manual feature selection, captures complex, spatio-temporal patterns in the environment, and extrapolates the lower frequency of TCG in the case of increased carbon emissions.
The study [15] establishes an enhanced spatiotemporal deep learning model, TC-Pred, in which TC intensity is predicted at high resolution. The most important contribution is a feature aggregation mechanism for multi-source environmental data and the incorporation of a convolutional transformer module into the sequence-to-sequence architecture to reduce sequential dependency problems. The task is positioned as a regression task by forecasting future TC intensities. The model uses the ConvGRU algorithm embedded within a deep neural network.
The study [30] leverage artificial intelligence to increase the predictability of extreme events on subseasonal to decadal timescales through the hybrid integration of artificial intelligence and climate models, along with best practices for robustness and interpretability.
The study [31] has proposed the XAI-GPI framework, which uses the combination of clustering, ensemble feature selection, and Shapley Additive Explanations (SHAP) to better estimate the genesis of tropical cyclones while providing transparent driver attribution. The study [32] suggests NeuralGCM, an ML-physics model to make efficient seasonal predictions of tropical cyclone activity with competitive performance. The study [33], the researchers suggest the use of uncertainty-sensitive cyclone forecasting using interpretability-driven white-box models, including decision trees and Bayesian rule lists.
Although the accuracy of tropical cyclone genesis prediction has been improved significantly with statistical and machine learning-based models, as shown in Table 1, they still have significant shortcomings. Most importantly, these models are binary classification models and do not directly predict TC genesis location. Additionally, these studies use a fixed grid size and fail to consider the best grid size. These gaps point to the need for a spatio-temporal modeling framework for predicting genesis coordinates as well as testing various grid sizes to identify a new grid size.

This section describes the proposed methodology, which can be divided into two phases: feature construction and model development. The feature construction phase is based on data extraction and feature engineering to generate spatio-temporal data. Afterwards, the model development phase makes use of these features to train a predictive model that can effectively predict the location of TC genesis.
In this study, TC genesis location prediction is formulated as a regression problem based on spatio-temporal data. Let
Let
•
•
•
The goal is to learn a regression function, as shown in Eq. (1):
that maps the input spatio-temporal features to the predicted genesis coordinates.
The feature construction stage starts with the temporal identification of tropical disturbances, followed by the extraction of the spatio-temporal representation around each disturbance to be used in predictive modeling.
2.2.1 Step 1: Tropical Disturbance Identification
Let
denote the CMA Best Track dataset [34], where each record captures the timestamp

These
where T represents time, and
2.2.2 Step 2: Spatial Grid Definition
For each detected disturbance,
where
2.2.3 Step 3: Multivariable, Multi-Level Atmospheric Data Extraction
From the ERA5 reanalysis dataset [35], we extract multiple variables (e.g., zonal wind
These four variables were chosen because they are representative of the basic dynamical and thermodynamical processes that control TC genesis prediction. Previous studies have unanimously found mid-level vorticity, wind structure, temperature stratification, and moisture availability to be dominant predictors of genesis [12]. Moreover, some key indices, such as vertical wind shear, tilting, and moisture convergence, can be diagnostically established from
For each variable
This yields a 4D feature tensor for each sample
where:
•
•
•
Each tensor

Figure 1: Representation of spatio-temporal feature extraction for a single atmospheric component.
2.2.4 Step 4: Single-Level Atmospheric Data Extraction
Sea surface temperature (SST) is extracted from the ERA5 single-level reanalysis dataset as a surface variable without vertical pressure dependence. Unlike multi-level atmospheric variables defined across pressure levels
For each tropical cyclone instance at time
The resulting spatial tensor is represented as:
To integrate SST into the temporal branch, the spatial grid is vectorized and concatenated with the temporal feature representation:
where T denotes the temporal length, and
The proposed STAG-Net (Spatio-Temporal Attention-Gated Network) predicts the TC genesis location by utilizing wide temporal information as well as deep atmospheric data in spatio-temporal format, as shown in Fig. 2. The proposed model is a hybrid network structure composed of a gated recurrent unit (GRU) for sequential temporal learning in the ground-level features of tropical disturbances and multi-channel CNNs with channel-wise attention for extracting rich representations from the 4D atmospheric features. STAG-Net also adds GRU-based temporal encoding over pressure levels to capture vertical dynamics. The model achieves its prediction goal by combining wide and deep feature representations using a fully connected fusion layer, which produces the predicted coordinates for the potential TC genesis area.

Figure 2: Proposed STAG-Net model for TC genesis location prediction.
2.3.1 Temporal Encoding via GRU
We use a two-layer GRU to process the temporally wide input as in Eq. (10):
where
2.3.2 Spatial Feature Encoding via CNN
Each variable
Convolutional Block: The initial input to the convolutional pipeline is denoted as
Channel-Wise Attention: At each block, attention is applied as in Eq. (13). The attention map
After 3 layers of Conv-Attention-Pooling, the spatial feature is flattened as in Eqs. (14) and (15):
The CNN-encoded features for each variable across T are:
Eq. (16) represents the use of a Gated Recurrent Unit (GRU) to process a sequence of feature vectors
2.3.3 Feature Fusion and Output Layer
We concatenate the wide and deep features from each variable as in Eq. (18):
This vector is passed through two dense layers as in Eqs. (19) and (20):
where
The model is trained using Mean Absolute Error (L1 loss) as in Eq. (21).
To assess the effectiveness of the proposed STAG-Net framework in predicting the location of TC genesis, a comprehensive set of experiments was carried out using spatio-temporal features based on ERA5 reanalysis data. The analysis aims to determine the effectiveness of the model in its capability to determine TC genesis coordinates ahead of time, given the initial tropical disturbance data.
The chosen hyperparameters represent an optimized training scheme of the STAG-Net model, as shown in Table 3. We use Adam optimizer because it is good at dealing with sparse gradients and a learning rate of 0.0007 to get stable convergence. The model is based on L1 Loss (MAE) to emphasize the avoidance of large errors. The model is trained for 128 epochs with 128 batch sizes, which is a trade-off between computational performance and learning stability. The data is split as 80:20 in terms of training and testing. The architectural building block consists of three convolutional layers containing a gradually increasing number of channels (64, 128, 256), Max Pooling for spatial down-sampling and attention layers that increase the discriminative power of the features after each convolution stage.

Evaluation metrics used in this study are defined as in Eqs. (22)–(24).
where:
•
•
•
• RSS represents the residual sum of squares.
• TSS is the total sum of squares.
3.3.1 Performance of Proposed Model
The performance of the proposed model, as shown in Table 4, indicates clear evidence that the proposed model ((u, v, r, t, wide) + attention) demonstrates strong predictive capability for TC genesis locations. Particularly, it has the lowest MAE for latitude (


Figure 3: Performance visualization of the STAG-Net model for component-wise fusion.
The performance of the proposed STAG-Net model was assessed using two real-time cases of TC genesis: MINDULLE (2016) and YAGI (2013). In the case of TC MINDULLE, the actual genesis coordinates were (
These findings indicate that STAG-Net has a high ability to forecast TC genesis locations with sub-degree error in latitude and longitude, as shown in Fig. 4. Fig. 5 shows the distribution of prediction errors, demonstrating right-skewed behavior, with most predictions clustered around the median (

Figure 4: Performance visualization of the STAG-Net model on real-time TC MINDULLE (2016) and YAGI (2013) genesis locations.

Figure 5: Distribution of tropical cyclone genesis location errors for test events.
To further examine the temporal robustness of the proposed model, the time gap between the start of the disturbance and the time taken to reach TC genesis was divided into four slots, i.e., 0–24 h, 24–48 h, 48–72 h, and more than 72 h. As can be seen from Table 5, the accuracy of the prediction gradually declines as the time interval increases. The lowest mean MAE is shown in the first 24 h (

3.3.2 Performance of Attention Layer
To evaluate the effectiveness of the channel-wise attention layer, we compare the results of the STAG-Net model with ((u, v, r, t, wide) + attention) and without (u, v, r, t, wide) attention layer, as shown in Table 4. The results indicate that the baseline model obtained
The Gradient-weighted Class Activation Mapping (Grad-CAM) visualization, as shown in Fig. 6, highlights the spatial regions that are most important for predicting cyclone genesis for the TC named “Cecil.” The activation map indicates an area of localized focus, which indicates that the model is perceiving physically meaningful structures in the atmosphere instead of diffuse structures.

Figure 6: Grad-CAM overlay showing spatial importance regions for cyclone genesis prediction.
The great reason for this enhancement is that channel-wise attention enables the model to focus on the most useful feature maps at each convolutional block. In the prediction of the location of TC genesis, various meteorological factors (such as components of wind, temperature, relative humidity, etc.) play different roles at different locations. These feature channels are adaptively adjusted by the attention mechanism, which favours significant features and suppresses irrelevant or noisy ones. Such a selective focus is a part of more effective representation learning and therefore leads to less prediction error and better generalization.
To evaluate the impact of grid size in the proposed approach, we compare the performance of multiple grid sizes. In our experimental evaluation, we compare four different grid sizes (


Figure 7: Performance visualization of grid size in the STAG-Net model.
The

Figure 8: Channel-wise attention maps for all grid sizes illustrating the model’s focus regions.
However, further increasing the grid size beyond 30
3.3.4 Comparison with Baseline Models
Most of the existing work on TC genesis prediction is presented in the context of a binary classification problem (i.e., genesis vs. non-genesis); therefore, these studies do not allow direct comparison with our coordinate prediction problem. Hence, we adjust the most suitable baseline models to fit our regression-based model in order to make a fair comparison.
We modified three baseline models: a temporal model (GRU) [36], Saf-Net [37], and the Dynamic Spatio-temporal model (DST) [16]. To ensure a fair comparison, only the last classification layers of the baseline models were changed, while the original feature extraction architectures were kept unchanged. Specifically, the sigmoid/softmax output layers were replaced with fully connected linear layers having two outputs (latitude and longitude), and the binary cross-entropy loss was replaced with Mean Squared Error (MSE). All convolutional, attention, and recurrent modules remained unchanged to maintain architectural consistency.
The results shown in Table 7 and Fig. 9 provide a clear indication that STAG-Net efficiently outperforms the baseline models with regard to all important performance metrics, including MAE, MSE, RMSE, and


Figure 9: performance comparison of STAG-Net model with other temporal and Spatio-temporal models.
The results of the study illustrate the efficiency of the proposed STAG-Net model for the prediction of TC genesis location. STAG-Net exceedingly performs better than the baseline temporal and spatio-temporal models in different evaluation metrics, namely MAE, MSE, RMSE, and
The addition of channel-wise attention mechanisms is another reason why the model has achieved better performance. Such attention modules help the model to focus on the most relevant feature maps by giving it importance scores, in this way minimizing the influence of irrelevant or noisy data. Consequently, the model shows a better performance in capturing more informative and discriminative features, which is very beneficial for the generalization ability.
Besides, the grid resolution of
The proposed model processes a dense tensor but is nonetheless computationally feasible for early-warning applications. For the case of

Although the proposed STAG-Net model shows positive results, there are several limitations. For example, the model is based on a fixed-grid resolution and does not dynamically adapt to different spatial scales of various tropical disturbances. Further, the model is likely to perform poorly in areas or seasons where past cyclone activity is limited due to insufficient training data. Addressing such issues in future work would improve the robustness and extensibility of the model.
Furthermore, the current work mainly focuses on multi-level atmospheric dynamics to study vertically structured predictors of cyclogenesis. While considering model dimensionality and computational cost, single-level thermodynamic spatial fields such as ocean heat content were not fully integrated into the spatial branch, which may limit the representation of oceanic energy contribution to genesis.
This study introduces STAG-Net, a novel Spatio-Temporal Attention-Gated Network, which combines multivariate meteorological inputs, i.e., u-wind, v-wind, relative humidity, temperature, and large-scale features, together with convolutional layers, GRU, and a channel-wise attention mechanism. The study utilized the initial tropical disturbance data to extract spatial information from the ERA5 reanalysis data store. The reanalysis data were extracted in the form of grids for 37 different isobaric planes and 4 different variables. Furthermore, the study also evaluates the impact of grid size and compares four different grid sizes
The performance obtained in the evaluation shows significant improvement compared with the benchmark DST model. Specifically, the Average Mean Absolute Error (MAE) decreased to
Future investigations will be aimed at methodological extensions to make the models more robust and applicable in practice. To start with, the dynamic grid resolution will be implemented with the help of a multi-scale input strategy, and coarse and fine-resolution grids will be selected dynamically based on the level of disturbances and environmental variability. This can be hierarchical CNNs or attention-based spatial scaling. Second, multi-source data fusion will use feature fusion or learning of a transformer to combine the satellite brightness temperature data and radar precipitation fields. Third, uncertainty quantification will be integrated either through Monte Carlo dropout or ensemble learning to compute the prediction confidence intervals. Finally, real-time data ingestion pipelines will be developed for operational forecast applications.
Acknowledgement: This research is supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2026R760), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. The authors are thankful for the support of the Artificial Intelligence & Data Analytics Lab (AIDA), CCIS Prince Sultan University, Riyadh, Saudi Arabia.
Funding Statement: This research is supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2026R760), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
Author Contributions: Kalim Sattar led the research and conducted the design and methodology. Malik Muhammad Saad Missen supervised the methodological development. Syeda Zoupash Zahra carried out the result analysis. Najia Saher and Rab Nawaz Bashir contributed to manuscript writing and content organization. Oumaima Saidani, Shahid Kamal and Muhammad I. Khan validated the methodology and results. All authors reviewed and approved the final version of the manuscript.
Availability of Data and Materials: GitHub repository: https://github.com/chkalim/TC_Genesis.
Ethics Approval: Not applicable.
Conflicts of Interest: The authors declare no conflicts of interest.
References
1. Hon KK, Ballard R, Blake E, Bond S, Gile R, Halperin D, et al. Recent advances in operational tropical cyclone genesis forecast. Tropical Cyclone Res Rev. 2023;12(4):323–40. doi:10.1016/j.tcrr.2023.12.001. [Google Scholar] [CrossRef]
2. Hu L, Wen T, Shao Y, Wang Q, Fang W, Yang J, et al. Economic impacts of tropical cyclone-induced multiple hazards in China. Earths Future. 2023;11(9):e2023EF003622. doi:10.1029/2023ef003622. [Google Scholar] [CrossRef]
3. Camargo S, Sobel AH, Barnston AG, Emanuel KA. Tropical cyclone genesis potential index in climate models. Tellus A Dyn Meteorol Oceanogr. 2007;59(4):428–43. doi:10.1111/j.1600-0870.2007.00238.x. [Google Scholar] [CrossRef]
4. Kimura R. Numerical weather prediction. J Wind Eng Indust Aerod. 2002;90(12):1403–14. doi:10.1016/s0167-6105(02)00261-1. [Google Scholar] [CrossRef]
5. Nicholls N. Cognitive illusions, heuristics, and climate prediction. Bull Am Meteorol Soc. 1999;80(7):1385–98. doi:10.1175/1520-0477(1999)080<1385:cihacp>2.0.co;2. [Google Scholar] [CrossRef]
6. Cheung KKW. A review of ensemble forecasting techniques with a focus on tropical cyclone forecasting. Meteorol Appl. 2001;8(3):315–32. doi:10.1017/s1350482701003073. [Google Scholar] [CrossRef]
7. Chen R, Zhang W, Wang X. Machine learning in tropical cyclone forecast modeling: a review. Atmosphere. 2020;11(7):676. doi:10.3390/atmos11070676. [Google Scholar] [CrossRef]
8. Choi KS, Moon JY, Kim DW, Chu PS. Seasonal prediction of tropical cyclone genesis frequency over the western North Pacific using teleconnection patterns. Theor Appl Climatol. 2009;100(1–2):191–206. doi:10.1007/s00704-009-0182-1. [Google Scholar] [CrossRef]
9. Lee CY, Tippett MK, Camargo SJ, Sobel AH. Probabilistic multiple linear regression modeling for tropical cyclone intensity. Mon Weather Rev. 2015;143(3):933–54. [Google Scholar]
10. Wang Y, Rao Y, Tan ZM, Schönemann D. A statistical analysis of the effects of vertical wind shear on tropical cyclone intensity change over the western North Pacific. Mon Weather Rev. 2015;143(9):3434–53. doi:10.1175/mwr-d-15-0049.1. [Google Scholar] [CrossRef]
11. DeMaria M, Kaplan J. An updated statistical hurricane intensity prediction scheme (SHIPS) for the Atlantic and Eastern North Pacific basins. Weather Forecast. 1999;14(3):326–37. doi:10.1175/1520-0434(1999)014<0326:auship>2.0.co;2. [Google Scholar] [CrossRef]
12. Loi CL, Wu CC, Liang YC. Prediction of tropical cyclogenesis based on machine learning methods and its SHAP interpretation. J Adv Model Earth Syst. 2024;16(3):e2023MS003637. doi:10.1029/2023ms003637. [Google Scholar] [CrossRef]
13. Park MS, Kim M, Lee MI, Im J, Park S. Detection of tropical cyclone genesis via quantitative satellite ocean surface wind pattern and intensity analyses using decision trees. Remote Sens Environ. 2016;183:205–14. doi:10.1016/j.rse.2016.06.006. [Google Scholar] [CrossRef]
14. Qian Q, Chen Y, Jia X, Ma H, Dong W. Basin dependencies of tropical cyclone genesis environment and possible future changes revealed by machine learning methods. iScience. 2025;28(2):111714. doi:10.1016/j.isci.2024.111714. [Google Scholar] [PubMed] [CrossRef]
15. Mu B, Wang X, Yuan S, Chen Y, Wang G, Qin B, et al. A deep learning-based global tropical cyclogenesis prediction model and its interpretability analysis. Sci China Earth Sci. 2024;67(12):3671–95. doi:10.1007/s11430-023-1383-6. [Google Scholar] [CrossRef]
16. Sattar K, Malik Muhammad Saad M, Saher N, Nawaz Bashir R, Zoupash Zahra S, Faheem M, et al. Tropical cyclone intensity prediction using spatio-temporal data fusion. IEEE Access. 2025;13:70095–104. [Google Scholar]
17. Dias J, Gehne M, Kiladis GN, Sakaeda N, Bechtold P, Haiden T. Equatorial waves and the skill of NCEP and ECMWF numerical weather prediction systems. Mon Weather Rev. 2018;146(6):1763–84. doi:10.1175/mwr-d-17-0362.1. [Google Scholar] [CrossRef]
18. Navascués B, Calvo J, Morales G, Santos C, Callado A, Cansado A, et al. Long-term verification of HIRLAM and ECMWF forecasts over Southern Europe: history and perspectives of numerical weather prediction at AEMET. Atmos Res. 2013;125–126(3):20–33. doi:10.1016/j.atmosres.2013.01.010. [Google Scholar] [CrossRef]
19. López Gómez J, Troncoso Pastoriza F, Fariña EA, Eguía Oller P, Granada Álvarez E. Use of a numerical weather prediction model as a meteorological source for the estimation of heating demand in building thermal simulations. Sustain Cities Soc. 2020;62(3):102403. doi:10.1016/j.scs.2020.102403. [Google Scholar] [CrossRef]
20. Wijnands JS, Qian G, Kuleshov Y. Variable selection for tropical cyclogenesis predictive modeling. Mon Weather Rev. 2016;144(12):4605–19. doi:10.1175/mwr-d-16-0166.1. [Google Scholar] [CrossRef]
21. Halperin DJ, Hart RE, Fuelberg HE, Cossuth JH. The development and evaluation of a statistical–dynamical tropical cyclone genesis guidance tool. Weather Forecast. 2017;32(1):27–46. doi:10.1175/waf-d-16-0072.1. [Google Scholar] [CrossRef]
22. Sattar K, Zahra SZ, Faheem M, Missen MMS, Bashir RN, Abbas MZ. Stacked ensemble model for tropical cyclone path prediction. IEEE Access. 2023;11:69512–21. doi:10.1109/access.2023.3292907. [Google Scholar] [CrossRef]
23. Zhang Z, Yang X, Shi L, Wang B, Du Z, Zhang F, et al. A neural network framework for fine-grained tropical cyclone intensity prediction. Knowl Based Syst. 2022;241:108195. doi:10.1016/j.knosys.2022.108195. [Google Scholar] [CrossRef]
24. Zhang T, Lin W, Lin Y, Zhang M, Yu H, Cao K, et al. Prediction of tropical cyclone genesis from mesoscale convective systems using machine learning. Weather Forecast. 2019;34(4):1035–49. doi:10.1175/waf-d-18-0201.1. [Google Scholar] [CrossRef]
25. Wei M, Fang G, Ge Y. Tropical cyclone genesis prediction based on support vector machine considering effects of multiple meteorological parameters. J Wind Eng Ind Aerodynamics. 2023;243(1):105591. doi:10.1016/j.jweia.2023.105591. [Google Scholar] [CrossRef]
26. Qian Q, Jia X, Lin Y. Reduced tropical cyclone genesis in the future as predicted by a machine learning model. Earths Future. 2022;10(2):e2021EF002455. doi:10.1029/2021ef002455. [Google Scholar] [CrossRef]
27. Zhao J, Zhan R, Wang Y, Jiang L, Huang X. A multiscale-model-based near-term prediction of tropical cyclone genesis frequency in the northern hemisphere. J Geophy Res: Atmosph. 2022;127(22):e2022JD037267. doi:10.1029/2022jd037267. [Google Scholar] [CrossRef]
28. Chen R, Wang X, Zhang W, Zhu X, Li A, Yang C. A hybrid CNN-LSTM model for typhoon formation forecasting. GeoInformatica. 2019;23(3):375–96. doi:10.1007/s10707-019-00355-0. [Google Scholar] [CrossRef]
29. Tong B, He Y, Hu G, Duan Z, Chan P. AI-aided simulation of tropical cyclone genesis under climate change. Glob Planet Change. 2025;251(1–2):104809. doi:10.1016/j.gloplacha.2025.104809. [Google Scholar] [CrossRef]
30. Materia S, García LP, van Straaten C, Sungmin O, Mamalakis A, Cavicchia L, et al. Artificial intelligence for climate prediction of extremes: state of the art, challenges, and future perspectives. WIREs Clim Change. 2024;15(6):e914. doi:10.1002/wcc.914. [Google Scholar] [CrossRef]
31. Dainelli F, Pérez-Aracil J, Ascenso G, Scoccimarro E, Giuliani M, Sanz SS, et al. XAI-GPI: an interpretable and adaptive machine learning genesis index for tropical cyclones. J Geophy Res: Mach Learn Computat. 2025;2(4):e2025JH000865. doi:10.1029/2025JH000865. [Google Scholar] [CrossRef]
32. Zhang G, Rao M, Yuval J, Zhao M. Advancing seasonal prediction of tropical cyclone activity with a hybrid AI-physics climate model. Environ Res Lett. 2025;20(9):094031. doi:10.1088/1748-9326/adf864. [Google Scholar] [CrossRef]
33. Ferrero MA, Azzalini D, Giuliani M, Castelletti A, Cerutti F, Amigoni F. Trustworthy inductive knowledge for tropical cyclones formation detection. Cham, Switzerland: Springer Nature; 2025. p. 69–81. doi:10.1007/978-3-031-89366-7_5. [Google Scholar] [CrossRef]
34. Ying M, Zhang W, Yu H, Lu X, Feng J, Fan Y, et al. An overview of the China meteorological administration tropical cyclone database. J Atmos Ocean Technol. 2014;31(2):287–301. doi:10.1175/jtech-d-12-00119.1. [Google Scholar] [CrossRef]
35. Hersbach H, Bell B, Berrisford P, Hirahara S, Horányi A, Muñoz-Sabater J, et al. The ERA5 global reanalysis. Quart J Royal Meteorol Soc. 2020;146(730):1999–2049. doi:10.1002/qj.4803. [Google Scholar] [CrossRef]
36. Hafeez AB, Alonso E, Riaz A. DTC-TranGru: improving the performance of the next-DTC prediction model with transformer and GRU. In: Proceedings of the 39th ACM/SIGAPP Symposium on Applied Computing. New York, NY, USA: ACM; 2024. p. 927–34. [Google Scholar]
37. Xu G, Lin K, Li X, Ye Y. SAF-Net: a spatio-temporal deep learning method for typhoon intensity prediction. Pattern Recognit Lett. 2022;155:121–7. [Google Scholar]
Cite This Article
Copyright © 2026 The Author(s). Published by Tech Science Press.This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Submit a Paper
Propose a Special lssue
View Full Text
Download PDF
Downloads
Citation Tools