Global Sub-national Impact-based Forecasting for Tropical Cyclones Using Open Data: Combining Machine Learning and Exposure-based Approaches

Moss, Federico; Mejova, Yelena; Kaltenbrunner, Andreas; Downing, Tristan; van den Homberg, Marc; Ndirangu, Pauline; Milano, Leonardo; Kalimeri, Kyriaki

doi:10.5194/egusphere-2026-1996

Preprints

https://doi.org/10.5194/egusphere-2026-1996

Preprints

15 Apr 2026

| 15 Apr 2026

Status: this preprint is open for discussion and under review for Natural Hazards and Earth System Sciences (NHESS).

Global Sub-national Impact-based Forecasting for Tropical Cyclones Using Open Data: Combining Machine Learning and Exposure-based Approaches

Federico Moss, Yelena Mejova, Andreas Kaltenbrunner, Tristan Downing, Marc van den Homberg, Pauline Ndirangu, Leonardo Milano, and Kyriaki Kalimeri

Abstract. Tropical cyclones (TCs) cause substantial and uneven impacts across regions, driven by differences in exposure and vulnerability. While anticipatory action (AA) systems aim to mitigate these impacts, they are typically based on hazard thresholds rather than predicted consequences, limiting their effectiveness and consistency. Impact-based forecasting offers a promising alternative, but existing approaches are often region-specific or rely on non-transferable data. In this study, we develop a global, sub-national impact-based forecasting framework that predicts affected-population fractions using only openly available data. The model integrates hazard, exposure, and contextual features within a two-stage XGBoost architecture and is evaluated across 780 historical TC events using decision-relevant metrics aligned with operational thresholds. Our results show that machine learning improves the detection and spatial localization of impacts, but does not outperform simpler exposure-based approaches in identifying severe events. This reveals a fundamental trade-off between coverage and conservative severity detection, suggesting that hybrid strategies combining both approaches are better suited for operational use. We position this system as a first-generation global benchmark for impact-based forecasting: it demonstrates the feasibility of transferable, sub-national predictions using open data, while clarifying the limitations that must be addressed for reliable deployment in anticipatory action systems.

Received: 08 Apr 2026 – Discussion started: 15 Apr 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Federico Moss, Yelena Mejova, Andreas Kaltenbrunner, Tristan Downing, Marc van den Homberg, Pauline Ndirangu, Leonardo Milano, and Kyriaki Kalimeri

Status: open (until 19 Jul 2026)

Post a comment Subscribe to comment alert

RC1: 'Comment on egusphere-2026-1996', Bernard Alan Racoma, 24 Jun 2026 reply

This manuscript presents a timely and well-executed study on global, sub-national impact-based forecasting for tropical cyclones using a two-stage XGBoost framework and openly available data. The work is particularly valuable in its integration of hazard, exposure, and contextual predictors at global scale, as well as its comparison against operationally relevant exposure-based baselines. The inclusion of decision-oriented evaluation (e.g., 0% vs. 15% thresholds), sensitivity analyses (temporal and geographic), and exploratory forecast-based experiments strengthens the contribution and demonstrates careful methodological consideration. Overall, the paper addresses an important problem and provides meaningful insights into the trade-offs between machine learning and simpler rule-based approaches.
At the same time, several aspects of the manuscript would benefit from clarification and further refinement. In particular, some methodological elements would be clearer with additional explanation, including the definition and use of key concepts such as the ‘impact fraction’, the selection of decision thresholds (0% and 15%), and the role of hyperparameters in the XGBoost models. Similarly, certain components—such as the description of the TIGGE forecasts, the operational interpretation of the two-stage framework, and the aggregation to ADM1 units—would benefit from more explicit documentation or justification, especially given the global and heterogeneous nature of the dataset.
Additionally, there are opportunities to strengthen the interpretation and presentation of results. For example, reporting class distributions more explicitly would improve understanding of model performance under strong class imbalance, and the conclusions and future work sections could be expanded to more clearly articulate the methodological contribution, novelty, and implications of the findings (e.g., the complementarity of machine learning and threshold-based approaches, and the role of rainfall-driven impacts).
I have also attached an annotated version of the manuscript with specific comments and suggestions provided directly in the text.
Overall, these are relatively minor revisions focused on clarity, transparency, and positioning, and I recommend the manuscript for publication pending minor revisions.

Reply

Citation: https://doi.org/10.5194/egusphere-2026-1996-RC1
RC2: 'Comment on egusphere-2026-1996', Anonymous Referee #2, 03 Jul 2026 reply

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2026/egusphere-2026-1996/egusphere-2026-1996-RC2-supplement.pdf
Reply

Citation: https://doi.org/10.5194/egusphere-2026-1996-RC2

Federico Moss, Yelena Mejova, Andreas Kaltenbrunner, Tristan Downing, Marc van den Homberg, Pauline Ndirangu, Leonardo Milano, and Kyriaki Kalimeri

Viewed

Total article views: 458 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
315	123	20	458	21	18

HTML: 315
PDF: 123
XML: 20
Total: 458
BibTeX: 21
EndNote: 18

Views and downloads (calculated since 15 Apr 2026)

Month	HTML	PDF	XML	Total
Apr 2026	152	43	11	206
May 2026	133	63	4	200
Jun 2026	19	5	3	27
Jul 2026	11	12	2	25

Cumulative views and downloads (calculated since 15 Apr 2026)

Month	HTML	PDF	XML	Total
Apr 2026	152	43	11	206
May 2026	133	63	4	200
Jun 2026	19	5	3	27
Jul 2026	11	12	2	25

Viewed (geographical distribution)

Total article views: 444 (including HTML, PDF, and XML) Thereof 444 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 16 Jul 2026

Short summary

This study explores how to better predict the real impacts of tropical cyclones on people, not just the strength of the storm. Using openly available global data, we developed a method to estimate how many people may be affected in different areas. We find that combining data-driven models with simple rules gives the most reliable results. This approach can help improve early warnings and support faster, more targeted disaster response, potentially reducing harm to vulnerable communities.


Total:	0
HTML:	0
PDF:	0
XML:	0