Technical note: Quantifying System Limits: Flood Oriented Operational Stress Testing under Intensified Precipitation Regimes in Central Europe

Reinert, Jens; Klopries, Elena-Maria; Hassenjürgen, Paul; Menz, Alexander; Hofmann, Julian; Brüll, Catrina; Klein, Angela; Schüttrumpf, Holger

doi:10.5194/egusphere-2025-6378

Preprints

https://doi.org/10.5194/egusphere-2025-6378

Preprints

25 Feb 2026

| 25 Feb 2026

Technical note: Quantifying System Limits: Flood Oriented Operational Stress Testing under Intensified Precipitation Regimes in Central Europe

Jens Reinert, Elena-Maria Klopries, Paul Hassenjürgen, Alexander Menz, Julian Hofmann, Catrina Brüll, Angela Klein, and Holger Schüttrumpf

Abstract. The rise of short-duration extreme rainfall events across Central Europe has exposed critical weaknesses in traditional flood risk management methods that depend on fixed return periods and design thresholds. In response, this paper introduces FLOOD-ST: a scenario-based operational stress testing framework designed to evaluate the functional limits and vulnerabilities of water infrastructure systems under realistic, high-impact flood scenarios.

FLOOD-ST combines synthetic rainfall archetypes, varying antecedent soil moisture conditions and infrastructure-specific scenarios to systematically evaluate system performance. By integrating elements of functional stress testing, the framework diagnoses weak points and illuminates cascading failure pathways across interconnected assets. Unlike traditional hazard mapping, FLOOD-ST does not aim to replace existing tools but complements them by offering a diagnostic view of system behaviour during extreme conditions. The framework is platform-independent, modular, and scalable, making it suitable for both urban and rural catchments, especially those characterised by high system complexity, short warning times, or critical interdependencies.

This paper describes the conceptual foundation, simulation workflow, and real-world applications of FLOOD-ST, demonstrating its potential to support adaptive planning, emergency preparedness, and climate-resilient water management.

Received: 19 Dec 2025 – Discussion started: 25 Feb 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Jens Reinert, Elena-Maria Klopries, Paul Hassenjürgen, Alexander Menz, Julian Hofmann, Catrina Brüll, Angela Klein, and Holger Schüttrumpf

Status: final response (author comments only)

RC1:
'Comment on egusphere-2025-6378', Anonymous Referee #1, 12 May 2026
The authors introduce the FLOOD-ST framework that moves away from conventional flood hazard mapping and towards evaluating system behavior, identifying weak points and their cascading effects under variable scenarios and conditions. The paper is well written and drives on a comprehensive methodology section. It addresses important issues in flood modeling and management and provides valuable insight into the topic. However, some aspects should be improved before publication.
General comments:
Restructure Introduction: The introduction (Motivation for Flood-Oriented Operational Stress Testing) should be thoroughly revised to follow a more linear structure. It is better to first describe the background and motivation, break down the main problem that is discussed in this paper and then briefly present how this submission adds to solving that problem. In particular, line 45-48 basically already present the main problem this submission tackles. However, the following lines circle back to give background to the already described problem. It would be better to change the order of reasoning here.

Furthermore, line 61-64 presents a small part of the applied methodology, but line 65 then presents the main focus of the submission. After that, more background and recent developments and papers are presented again. I understand that it is often impossible to have the introduction in strictly linear order, but this submission would heavily benefit from disentangling the state of research and the own contribution more precisely. The raised issues in lines 45 and following and line 61 and following can be transferred to most parts of the introduction.

I think the main contribution of this submission is that flood modeling, regardless of its purpose for risk assessment or mitigation planning, should take into account more variables than risk-based hazard mapping. These variables are mainly a broader range of rainfall archetypes, impact of infrastructure through clogging or other failure, runoff variabilities through antecedent soil moisture, and operational limits (see lines 46-48 and lines 151-154, and section 2.2). However, this key message seems a bit hidden in the submission, as it is framed within the FLOOD-ST framework. If the authors agree that this is the main contribution, then this should be more strongly stressed throughout the paper and the focus should slightly shift away from promoting FLOOD-ST itself and towards presenting the key messages.

The authors extend the frame of FLOOD-ST to multiple different areas, up to urban pluvial impacts or GIS-based methods (line 291). Yet, most of the stated stressors are evidently focused on flash floods and more refined modeling methods. Also the given examples seem to incorporate these test cases and methods and do not touch on urban pluvial impacts or GIS-based methods. Therefore, I suggest to put a more concise limitation (albeit more a practical, not a theoretical) on the framework. I do not see this as a weakness of FLOOD-ST, it is simply better to state and limit the application of FLOOD-ST to where it is strongest and of most value.

FLOOD-ST is a powerful framework, yet it does seem impractical to adopt it in its full extent. The presented examples are carried out by rather large project consortiums with high-end expertise and resources. It would be interesting to know how much of FLOOD-ST is applicable on a smaller project scale, also in terms of expertise, since the coupling between different system inputs is not straightforward. The authors should elaborate on when a user is starting to use FLOOD-ST or simply just adding one feature (e.g. soil moisture) to conventional risk hazard mapping.

Specific Comments:
Line 42: The term drainage is misleading here. In an urban context (which FLOOD-ST also aims to cover), drainage does not increase flood peaks. I assume the authors want to mention the drainage through small ditches which are typical at agricultural lands, but I think the term in itself can be misleading here. I suggest the authors to either omit the term drainage here, or further explain which type of drainage they are pointing at.
Line 83-90: While I agree with the statements here, the authors should provide and cite sources to support the statements.
Line 94-97: The definition of tipping points could better be moved to the methodology section.
Line 168-172: This is an important new distinction in FLOOD-ST which might even be worth mentioning in the Motivation chapter. The authors should comment on whether model calibration might not actually support the framework. Even if calibration can only be fulfilled for a specific set of variables, which might differ at different scenarios (e.g. antecedent soil moisture), it can still be used to validate the model representation of other, more fixed variables, e.g. topography, roughness etc.
Table 1 and Figure 1: Figure 1 repeats a lot of information from Table 1 (Output, Outcome). Furthermore, Basis in Table 1 seems to be Input in Figure 1. The authors should try to limit repetition, but keep consistency in their wording when describing FLOOD-ST.
Line 198-207: If I understand correctly, the authors suggest to keep duration and rainfall amount constant within one archetype. However, the authors should also give some suggestions on how many different archetypes for one model domain are advised and provide further guidelines on how to define one archetype.
Line 280 and Line 281: FLOOD-ST seems to promote testing different stressors, and explicitly mentions that investigating average antecedent moisture conditions are also of high importance (line 235-238) and that this even presents “the full diagnostic capability” of FLOOD-ST. However, both given examples seem to mainly focus on the extreme spectrum, e.g. only investigating AMC III. The authors should therefore comment, why FLOOD-ST was not fully applied in the given examples and whether fully applying FLOOD-ST is even manageable. Furthermore, I don’t understand why these examples are given here, when there is the explicit section 2.4 “Current Application”.
Line 287: The way that FLOOD-ST is presented does not necessarily cover pluvial impacts. Pluvial flooding is mainly dependent on rainfall amount and the capacity of the drainage system combined with soil infiltration. Since the drainage system is only very implicitly covered by FLOOD-ST (maybe within infrastructure), I think the authors would do better to exclude pluvial flooding. Otherwise, the impact of underground drainage systems should be discussed more in detail within the framework.
Lines 332-338: This is mainly a repetition of information that has been mentioned multiple times before. The authors could skip this. In general, the benefits of FLOOD-ST have been covered before in the submission, so the authors might rephrase also the title of this overall section (possibly “Co-benefits”).
Lines 361-363: This mainly starts to touch on my concern in the general comments. I do not think data quality is a specific limitation of FLOOD-ST, but to flood modeling in general. The authors should more consider whether FLOOD-ST is appropriate for simplified methods on a coarse scale and should add more on the required expertise for applying FLOOD-ST and whether this might present another aspect of limitation.
Line 394 states that the stress tests are carried out on the full hydrologic system, which conflicts to different statements in the manuscript that seem to hint that different individual of FLOOD-ST can also be applied. Again, if it is always on the full hydrologic system, then this would put high limits on expertise and resources.

Technical Comments:
Line 52: Avoid the repetition of the word short here. E.g. you could change to “quick hydrological responses with often only short-term discharge records” or “short hydrological response times with often only recent discharge records”.
Line 62: I think the authors should replace banking sector with financial sector, as mentioned later.
The authors should recheck their spellings when it comes to capitalization, e.g., line 83 “(Flash) Floods”, line 123 “Scenarios”, line 269 “Indicators”.
Line 280: While a “2021-like” scenario might be understood in Luxembourg, Belgium, the Netherlands and Germany, the authors should better explain the term and explicitly refer to the flood event from 2021.
Line 287: delete the comma between on and river, and add a comma before “or cascading”.
Citation: https://doi.org/10.5194/egusphere-2025-6378-RC1
- AC1: 'Reply on RC1', Jens Reinert, 13 May 2026
  
  Dear Anonymous Referee #1,
  We sincerely thank you for the careful reading of our Technical Note and for the constructive and detailed comments. We are pleased that you recognize the relevance of the proposed FLOOD-ST framework and the value of moving beyond conventional flood hazard mapping towards a more diagnostic assessment of system behavior, weak points, and cascading effects.
  Your comments are very helpful for sharpening the scope, structure, and practical interpretation of the manuscript. In particular, we appreciate the suggestion to clarify the main contribution more explicitly, to distinguish more clearly between the state of research and our own methodological contribution, and to better define the practical limits and implementation levels of FLOOD-ST.
  Below, we provide a response to the main points raised and indicate how we plan to revise the manuscript accordingly.
  General comment 1: Restructure Introduction
  We thank the referee for this important comment and agree that the Introduction would benefit from a clearer and more linear structure. In the current version, the background, problem formulation, methodological motivation, and contribution of the manuscript are partly interwoven. We agree that this makes the central argument less direct than it should be.
  In the revised manuscript, we will reorganize the Introduction by first presenting the broader background and motivation, namely the limitations of conventional flood hazard mapping under non-stationary and infrastructure-sensitive flood conditions. We will then define the specific methodological gap addressed by the Technical Note: the need for a structured diagnostic approach that evaluates system behavior, weak points, operational limits, and cascading effects under variable stress conditions. Finally, we will present FLOOD-ST as our proposed contribution to this problem.
  We will therefore revise the order of reasoning in the Introduction, especially around the passages currently located in lines 45–48 and 61–65. The revised Introduction will more clearly separate background, problem statement, state of research, and the own contribution of the manuscript.
  General comment 2: Main contribution and key message
  We fully agree with the referee that the central contribution should not be understood primarily as the promotion of FLOOD-ST as a label, but as the systematic extension of flood hazard and risk-oriented modelling towards scenario-based diagnostics of system behavior.
  In the revised manuscript, we will therefore emphasize more clearly that the main contribution lies in the structured consideration of multiple stressors and system states that are often insufficiently represented in conventional hazard mapping. These include, among others, different rainfall archetypes, antecedent soil moisture conditions, infrastructure performance and failure, clogging, operational limits, and cascading effects.
  We will revise the manuscript accordingly so that this key message becomes more visible throughout the paper. FLOOD-ST will be presented less as a stand-alone concept to be promoted and more as a framework that operationalizes this broader methodological shift from static hazard representation towards diagnostic stress testing of flood-prone systems.
  General comment 3: Scope and practical limitation of FLOOD-ST
  We thank the referee for this valuable suggestion. We agree that the manuscript currently presents FLOOD-ST in a broad conceptual way, while the examples and the most strongly developed methodological elements are primarily related to flash-flood-oriented and hydrodynamically explicit applications.
  In the revised manuscript, we will clarify that FLOOD-ST is conceptually modular and can in principle support different flood-generation mechanisms. However, we will also state more explicitly that the strongest practical basis of the present Technical Note lies in applications where rapidly changing hydrological conditions, infrastructure performance, hydraulic routing, operational controls, and cascading effects are of central importance. This particularly includes flash-flood-oriented applications in small to medium-sized catchments and urban or peri-urban environments with relevant hydraulic infrastructure.
  We will add a more explicit practical limitation to avoid overstating the current level of demonstrated applicability. In particular, we will clarify that applications to fully urban pluvial systems require an adequate representation of urban drainage systems, infiltration processes, surface flow paths, and, where relevant, surface–subsurface interactions. Where such components are not represented, FLOOD-ST should be applied only in a more limited diagnostic sense.
  General comment 4: Practical applicability and smaller project scales
  We agree that this is an important point. The presented examples are indeed embedded in larger research and project contexts, and a full implementation of FLOOD-ST may require substantial data, modelling expertise, and computational resources. We will therefore expand the discussion of practical applicability and scalability.
  In the revised manuscript, we plan to introduce different levels of FLOOD-ST implementation. A minimum implementation could consist of extending an existing flood hazard model by one or more explicitly defined stressors, such as antecedent soil moisture or altered rainfall duration, provided that the results are interpreted diagnostically in terms of system behavior and weak points. A more advanced implementation would systematically combine multiple stressors, such as rainfall archetypes, soil moisture states, infrastructure performance, and operational conditions. A full system-oriented FLOOD-ST application would additionally include coupled system components, operational rules, cascading effects, and object-based impact indicators.
  This clarification will also address the referee’s question of when a user is applying FLOOD-ST and when they are simply adding one feature to a conventional hazard mapping workflow. We will define FLOOD-ST not by the number of stressors alone, but by the diagnostic logic: the explicit use of scenario variations to identify system behavior, weak points, thresholds, operational limits, and impact pathways.
  Specific and technical comments
  We thank the referee for the detailed specific and technical comments. These suggestions are very helpful and will be addressed carefully during the revision of the manuscript.
  In particular, we will revise the potentially misleading use of the term “drainage”, add supporting references where requested, reconsider the placement of the tipping point definition, and clarify the role of calibration within FLOOD-ST. We will also harmonize the terminology between Table 1 and Figure 1, reduce unnecessary repetition, and provide clearer guidance on the selection and definition of rainfall archetypes.
  Furthermore, we will revise the discussion of the examples to make clear that they illustrate selected diagnostic pathways rather than a complete application of all FLOOD-ST dimensions. We will also clarify the treatment of pluvial impacts and specify that such applications require an adequate representation of relevant urban drainage and infiltration processes. The limitation section will be revised to better distinguish general limitations of flood modelling from FLOOD-ST-specific issues, particularly regarding expertise, data availability, model complexity, and scalability.
  The technical corrections regarding wording, terminology, capitalization, punctuation, and the explanation of the “2021-like” scenario will also be implemented in the revised manuscript.
  Once again, we thank Anonymous Referee #1 for the constructive and detailed assessment. The comments will help us to make the manuscript more focused, more consistent, and more useful for both scientific and applied flood risk communities. We believe that the proposed revisions will substantially improve the clarity of the Technical Note, particularly with regard to the main contribution, practical scope, scalability, and limitations of FLOOD-ST.
  
  Citation: https://doi.org/10.5194/egusphere-2025-6378-AC1
RC2:
'Comment on egusphere-2025-6378', Anonymous Referee #2, 27 May 2026
This Technical Note presents FLOOD-ST, a scenario-based operational stress testing framework for diagnosing functional limits, weak points, and cascading failure pathways. The topic is relevant, particularly given the increasing concern over short-duration extreme rainfall, compound flood drivers, and the limitations of return-period-based design approaches. The conceptual framing is promising, and the two application contexts suggest that the framework is grounded in ongoing practice. However, several methodological components require clarification. The manuscript would be strengthened by clarifying the scenario-generation logic, providing a more explicit interface with climate change.
Specific comments
A central component of FLOOD-ST is the definition of rainfall archetypes such as stationary cloudbursts, translating convective storms, and multi-cell compound events. This is conceptually useful, but the manuscript should explain how these archetypes are selected, parameterized, and justified for a given catchment. For example, how are rainfall depth, duration and spatial pattern determined? Are they based on observed events, expert judgment, or rainfall generators? This issue is important because the current formulation could be perceived as subjective. The authors note that future work may explore machine learning, and stochastic exploration, but the current framework would benefit from a clearer baseline method for selecting plausible but severe scenarios.

The manuscript motivates FLOOD-ST using the intensification of short-duration extreme rainfall. However, the connection between this motivation and the actual design of rainfall stressors remains underdeveloped. Recent work on nonstationary IDF curves, and sub-daily precipitation changes could provide a stronger quantitative basis for defining rainfall triggers. For example, nonstationary IDF information could be used to specify time-varying or climate-adjusted rainfall depths, or stress increments relative to current design standards. This would help connect FLOOD-ST to existing engineering practice while still allowing the framework to move beyond fixed return periods. The authors should clarify how nonstationary rainfall information can inform the selection of “plausible worst-case” or “future-relevant” scenarios. See the following paper for reference. https://doi.org/10.1002/wat2.1519

A following comment: the manuscript states that FLOOD-ST is climate-adaptive and can use future analogues, ensemble forecasts, or climate-adjusted storm archetypes. However, it is not clear how future climate change is actually incorporated into the framework.

The current framework relies on representative rainfall archetypes and synthetic rainfall generators. A potentially valuable addition would be stochastic storm transposition, especially for generating physically plausible high-impact rainfall scenarios while preserving observed storm space–time structure. This could address one of the manuscript’s key limitations: the potential subjectivity in defining representative storm archetypes. Stochastic storm transposition could generate a large library of transposed historical storms. This connection would not necessarily turn FLOOD-ST into a conventional probabilistic risk framework. Rather, stochastic storm transposition could serve as an objective scenario-generation layer for stress testing. The authors may consider discussing this option, where they discuss scenario discovery and stochastic exploration. See 1016/j.jhydrol.2026.135617 and 10.1016/j.jhydrol.2020.124816

The comparison in Table 1 is useful, but some contrasts appear overly simplified. For example, traditional flood modelling is described as “linear” and “deterministic,” and as neglecting flood peaks and timing. In practice, many hydrological and hydraulic models used in hazard mapping are nonlinear and explicitly simulate hydrographs, routing, and timing, even if their regulatory outputs are often summarized as return-period maps.

The authors should ensure consistency in the spelling and capitalization of FLOOD-ST/Flood-ST.
Citation: https://doi.org/10.5194/egusphere-2025-6378-RC2
- AC2: 'Reply on RC2', Jens Reinert, 27 May 2026
  
  Dear Anonymous Referee #2,
  We sincerely thank you for carefully reviewing our Technical Note and for your constructive and helpful comments. We are pleased that you recognise the relevance of FLOOD-ST as a scenario-based operational stress testing framework and acknowledge its potential for diagnosing functional limits, weak points, and cascading failure pathways under increasingly complex flood-generating conditions.
  Your comments are particularly helpful in identifying where the methodological description of FLOOD-ST should be made more explicit. We agree that the manuscript would benefit from a clearer explanation of the scenario-generation logic, a stronger connection to nonstationary rainfall information and climate change, and a more precise discussion of how rainfall archetypes can be selected and justified in practice.
  In the revised manuscript, we will therefore expand the description of rainfall archetype selection and parameterisation. We will clarify that archetypes should not be selected arbitrarily, but should be derived from a transparent combination of observed historical events, regional hydroclimatic characteristics, design rainfall statistics, expert knowledge, and, where available, rainfall generators or climate-adjusted precipitation information. We will also specify more clearly how rainfall depth, duration, temporal distribution, and spatial pattern can be defined for a given catchment and diagnostic question.
  We agree that the link between the motivation of intensified short-duration rainfall and the actual design of rainfall stressors should be strengthened. In response, we will revise the manuscript to explain how nonstationary IDF curves, sub-daily precipitation scaling, and climate-adjusted design rainfall information can be used to define plausible worst-case or future-relevant rainfall stressors. This will help connect FLOOD-ST more explicitly to existing engineering practice, while preserving its diagnostic purpose beyond fixed return-period-based hazard mapping. We thank the referee for the suggested reference, which we will consider in this context.
  We also agree that the incorporation of future climate change should be described more concretely. We will therefore clarify that climate change can enter FLOOD-ST at different levels, for example through adjusted rainfall depths or intensities, altered temporal rainfall patterns, modified antecedent moisture conditions, future analogue events, ensemble-based rainfall scenarios, or climate-adjusted storm archetypes. In addition, we will clarify that future-relevant stressors are not limited to the precipitation side. Changes in the catchment area itself, such as urban expansion, increasing surface sealing, changes in land use, modified drainage capacity, or shifts in the composition and vulnerability of the urban fabric, can also be incorporated as stressors or scenario dimensions. In this sense, FLOOD-ST can combine climate projections with catchment development scenarios to assess how future hydro-meteorological forcing and changing exposure or system conditions jointly affect flood system performance. We will make it clear that FLOOD-ST does not require a single fixed climate scenario but provides a framework for translating climate and catchment-change information into stressors relevant to system diagnostics.
  The suggestion to discuss stochastic storm transposition is very valuable. We agree that this approach could provide an objective and physically plausible scenario-generation layer by preserving the space–time structure of observed storms while exploring their potential impact under different spatial placements. We will add this option to the discussion of scenario discovery and stochastic exploration and clarify that such approaches can complement FLOOD-ST without turning it into a conventional probabilistic risk framework. Instead, they can support the generation of plausible high-impact stress scenarios for diagnostic stress testing. We will consider the suggested references in this context.
  We further thank the referee for pointing out that some contrasts in Table 1 may be overly simplified. We agree that traditional hydrological and hydraulic models are not inherently linear or purely deterministic, and that many models used in hazard mapping explicitly simulate hydrographs, routing, and timing. The intended contrast was not between model physics, but between the dominant use of such models in regulatory or planning contexts, where outputs are often condensed into return-period-based hazard maps. We will revise Table 1 accordingly to avoid overgeneralization and to distinguish more carefully between modelling capability and typical hazard-mapping practice.
  Finally, we will ensure consistent spelling and capitalization of FLOOD-ST throughout the manuscript.
  Once again, we thank Anonymous Referee #2 for the constructive assessment. We believe that the suggested revisions will substantially improve the methodological clarity of the Technical Note, especially with regard to scenario generation, climate-change integration, and the practical use of rainfall archetypes in FLOOD-ST.
  
  Citation: https://doi.org/10.5194/egusphere-2025-6378-AC2

Jens Reinert, Elena-Maria Klopries, Paul Hassenjürgen, Alexander Menz, Julian Hofmann, Catrina Brüll, Angela Klein, and Holger Schüttrumpf

Viewed

Total article views: 1,461 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
770	610	81	1,461	84	104

HTML: 770
PDF: 610
XML: 81
Total: 1,461
BibTeX: 84
EndNote: 104

Views and downloads (calculated since 25 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	228	80	16	324
Mar 2026	400	389	56	845
Apr 2026	63	72	1	136
May 2026	55	49	5	109
Jun 2026	11	6	2	19
Jul 2026	13	14	1	28

Cumulative views and downloads (calculated since 25 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	228	80	16	324
Mar 2026	400	389	56	845
Apr 2026	63	72	1	136
May 2026	55	49	5	109
Jun 2026	11	6	2	19
Jul 2026	13	14	1	28

Viewed (geographical distribution)

Total article views: 1,442 (including HTML, PDF, and XML) Thereof 1,442 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 28 Jul 2026

Short summary

Flood risk in increasing as intense rainfall events become more frequent and water management systems are stressed beyond their original design. This study develops a new way to examine how rivers, flood protection measures and emergency operations behave during realistic but severe flood situations. Instead of focusing on single design events, we tested a range of plausible rainfall patterns, ground wetness conditions and infrastructure disruptions.


Total:	0
HTML:	0
PDF:	0
XML:	0