A Factorized Fourier Neural Operator Surrogate for Basin-Scale Tsunami Propagation
Abstract. Tsunami models have been developed for several decades, and many have shown good agreement with observations from real world events. The model solves wave equations, but simulation is usually computationally expensive in a large-scale basin. To assess potential tsunami impacts, ensemble analysis is standard for sampling source uncertainties, but computational costs constrain the number of scenarios that can be evaluated. Machine‑learning approaches have been developed to reduce the computational burden and accelerate typical tsunami‑ensemble analyses. However, these surrogate models are usually task-specific; they emulate buoy signals, sensor inputs, and maximum water level maps. Recent advances in machine learning techniques, such as neural operators, allow learning full wave evolution from physics-based simulations. Here, we introduce a data-driven tsunami surrogate model based on a Factorized Fourier Neural Operator (F-FNO). Memory-efficient F-FNO supports higher Fourier mode capacity, enabling the tsunami surrogate model to learn scenario-based COMCOT simulations and generalize to unseen epicenter locations/extrapolated magnitudes. We designed logic tree-based COMCOT simulations for the East Sea (Sea of Japan) to construct a surrogate operator. The F-FNO learns tsunami propagation through a short sequence of wavefield states and creates a general operator function that generates future wave and velocity fields. From the logic tree, we hold out the largest magnitude (8.0) and one specific source location for model evaluation and to test the scalability of the neural operator. As a result, the surrogate predicted tsunami waves with root mean square errors in surface elevation of 2–8 cm and first-arrival timing errors of approximately 8–12 min. Running the F-FNO surrogate requires approximately 8.5–12 s per scenario on a single GPU, compared to 87.9–95.7 s of COMCOT simulation time. The computational efficiency of the operator and its potential to scale to larger scenario ensembles support more timely tsunami scenario analysis and can complement physics-based solvers in offshore applications.
General comments:
The paper creates a new type of tsunami surrogate model. It is based on a Factorized Fourier Neural Operator (F-FNO) a good choice due for NO due to the ability of F-FNO to capture higher Fourier modes -- as shown in the comparison with regular FNOs. The application to the sea of Japan tsunami sources shows excellent timing but the amplitudes are underestimated by some margin.
The method is underperforming against more established emulation approaches cited in the context of peaks for hazard assessments. However the F-FNO benefits are more about its ability to emulate wave propagation between function spaces: inputs and outputs can be fields. The impulsive source forcing is interesting as well as physics-guided learning strategies. Nice implementation and extension of FNO, with lots of care in the accumulation of errors over hundreds of steps, and loss tailored to the problem, with efficient implementation. The design of experiments is unfortunately substandard.
I suggest major revisions (or reject and resubmit if more time is needed) as there is lots of potential but the current version suffers from a few issues that need to be addressed:
Specific comments:
The main attraction of NOs as explained, is the ability to take as inputs fields v GPs and other approaches. What can we learn about the dynamics and patterns with an F-FNO emulator v GPs?
The logic tree lines 202-211 is not a good design to train the F-FNO, unable to explore enough across 3-4 values only, compared to say a Latin Hybercube or a sequential design. It is also very large with 486 values despite for only 6 parameters (for each scaling law type). A GP would need less than 100 samples to be trained on 6 parameters. The training needs to be redone and discussed. I expect a big improvement in the quality of the emulation.
For validation, the authors hold out the largest magnitude (8.0) and one specific source location for model evaluation and to test the scalability of the neural operator.
This is too small for validation, I suggest a Leave-one-out strategy.
There is no comparison with other techniques on the example case. In the discussion some mention of the possibly higher accuracy of other methods due to focus on specific outputs, but some illustrations of shortcomings/strength of the method would be good.
The introduction is good, the motivation explicit with the aim to go beyond "predicting a fixed set of outputs such as maximum water level and inundation maps": explain the benefits obtained from that? Dynamics are mentioned and nice motivation but what can potentially be learned from this approach?
Fig 8 there is an obvious issue of extrapolation as outside the range of training.
Fig 10 and 11 very interesting. Often underestimated of peak wave height discussed in section 3.3
Speed-up is interesting 10x but COMCOT is fast model, PCOMCOT is discussed as more relevant extension 50-75x but also higher resolution near the coastline (as discussed but not performed) would be more expensive and more accurate?
Usually surrogates allow gains of multiple orders of magnitude. Ideally a higher resolution model than a fast coarse resoution COMCOT should be the simulator that will then be emulated.
The whole paragraph in the Conclusions "The surrogate functions as a rapid offshore boundary condition generator, facilitating the creation of nested grids that enhance the resolution of site-specific hazard studies. By accelerating the outer grid propagation.." is not fit for purpose. Indeed the main simulation costs are local not propagation in seconds but local high resolution coastal modelling. I suggest to remove this paragraph and ideally run the model at higher resolution -- including near the coastline -- to train te F-FNO. It should actually boost the quality of the F-FNO (using the right design) and provide a stronger statement of success in terms of impact on hazard assessements and warning and in terms of computational efficiency.