Preprints
https://doi.org/10.5194/egusphere-2026-199
https://doi.org/10.5194/egusphere-2026-199
06 Feb 2026
 | 06 Feb 2026
Status: this preprint is open for discussion and under review for Climate of the Past (CP).

From manual classification to large language models: assessing the quality and consistency of historical convective event records

Franck Schätz and Rüdiger Glaser

Abstract. Historical text sources represent a central, yet methodologically challenging basis for the reconstruction of convective weather events. This study examines the extent to which historical reports on thunderstorms and hailstorms contain reliable climatological information, despite heterogeneous sources, varying degrees of detail and linguistic diversity. Based on a corpus prepared using source criticism, qualitative descriptions are converted into structured evidence levels and intensity classes and analysed using statistical methods and a multilingual BERT language model.

The reconstructed time series show a distinctly stable seasonal signal with a dominant summer maximum that occurs independently of fluctuations in source density and is consistent both in the overall series and in a dense observation window. A comparison with modern observation data from the German Weather Service and with independent historical reconstructions shows a high degree of agreement in seasonal patterns despite different survey methods and time periods. Analysis of the intensity classes also shows that historical sources do not primarily document extreme events, but rather reflect a physically plausible ranking of event strengths.

The results of the automated classification prove that the language model reliably reproduces seasonal and intensity-related patterns and implicitly captures source-specific reporting patterns without levelling them. Overall, the study shows that AI-supported methods can extract robust climatological information from historical texts when processed using rigorous methods, thus opening up new perspectives for quantitative historical climate research.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Franck Schätz and Rüdiger Glaser

Status: open (until 03 Apr 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Franck Schätz and Rüdiger Glaser
Franck Schätz and Rüdiger Glaser
Metrics will be available soon.
Latest update: 06 Feb 2026
Download
Short summary
Could AI help us decode historical storm reports? This study develops a method for classifying storms and hail events. We have transformed vague descriptions into reliable climate data, showing that even ancient texts exhibit clear seasonal patterns. Training a language model shows that AI can automatically extract weather information while meeting scientific standards. Free tools reveal that summer is the peak season for storms.
Share