Forecasting threshold exceedance of atmospheric variables at a specific location
Abstract. Accurate short-term forecasting of extreme weather events is essential for early warning systems and disaster mitigation. This study compares two methodological approaches for predicting, at some given site, threshold exceedances of atmospheric variables such as temperature and wind speed: (i) direct probabilistic methods, which treat exceedance as a binary classification problem and (ii) full distribution probabilistic methods, which model the complete conditional probability law of the target variable. Using theoretical analysis and numerical simulations on a toy model, alongside real-world data from the MeteoNet dataset (2016–2018) for southeastern France, we demonstrate that the full distribution approach consistently outperforms the direct method for rare, extreme events.
This advantage arises because the full distribution approach can effectively learn the parameters of the conditional distribution even from moderate and mild intensity events, thus achieving better calibration and discrimination in the tails. We find that the specific parametric shape of the chosen distribution plays a secondary role compared to accurately capturing predictable shifts in its bulk properties (i.e., mean and variance). This suggests that extreme exceedances are primarily driven by significant conditional
displacements of the entire distribution, rather than by unpredictable, fat-tailed anomalies within a static climatology. Our results are validated for both strong surface wind speeds and intense hourly rainfall, with performance evaluated using proper scoring rules (Brier Score, logarithmic score) and deterministic skill scores (Peirce Skill Score, Critical Success Index, Heidke Skill Score).
These findings highlight the critical importance of modeling the full probability distribution for rare-event forecasting and provide practical guidance for improving extreme weather prediction in operational meteorology.