Evaluating Surface Mass Balance Variability from Climate Models using GPS Bedrock Vertical Time Series data

Rajavarathan, Jenan; King, Matt; Watson, Christopher; Hansen, Nicolaj

doi:10.5194/egusphere-2026-1146

Preprints

https://doi.org/10.5194/egusphere-2026-1146

Preprints

09 Apr 2026

| 09 Apr 2026

Status: this preprint is open for discussion and under review for The Cryosphere (TC).

Evaluating Surface Mass Balance Variability from Climate Models using GPS Bedrock Vertical Time Series data

Jenan Rajavarathan, Matt King, Christopher Watson, and Nicolaj Hansen

Abstract. Accurate estimates of Antarctic Surface Mass Balance (SMB) are essential for quantifying ice-sheet mass changes and their contributions to global sea level rise. Regional Climate Models (RCMs) and atmospheric reanalyses provide SMB products that are widely used in glaciology and climatology studies, yet substantial discrepancies between models persist. This study evaluates interannual to decadal variability in seven SMB models by comparing computed SMB elastic vertical bedrock displacements with GPS vertical timeseries from across Antarctica. The models vary in spatial and temporal resolution: RACMO2.3p2 (27 km), RACMO2.4p1 (11 km), statistically downscaled RACMO2.3p2 (2 km), MAR (35 km), GEMB (10 km), HIRHAM5 (12.5 km) and MERRA2 (12.5 km). Model performance is assessed through the quantification of low-frequency variance reduction in GPS residuals after SMB loading correction and by computing scale factors between the observed and model time series. Results indicate that all considered SMB models reduce long-period (>1.5 yr) GPS variance on average, but performance varies across Antarctic regions and GPS sites. All RACMO variants, specifically the higher-resolution variants (2 and 11 km) show better performance overall, achieving typically the largest variance reductions and yielding scale factors closest to unity, particularly in the Antarctic Peninsula and coastal margin of Antarctica; MERRA2 and HIRHAM5 have the weakest overall performance. Our findings suggest that GPS observations, with some limitations, provide a useful new constraint on SMB model evaluation that yields insights into spatial and temporal variabilities that traditional SMB model evaluations are unable to fully resolve.

Received: 03 Mar 2026 – Discussion started: 09 Apr 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 2274 KB)

Supplement (3382 KB)

Download & links

Jenan Rajavarathan, Matt King, Christopher Watson, and Nicolaj Hansen

Status: open (until 21 May 2026)

Post a comment Subscribe to comment alert

RC1: 'Comment on egusphere-2026-1146', Anonymous Referee #1, 30 Apr 2026 reply

The paper "Evaluating Surface Mass Balance Variability from Climate Models using GPS Bedrock Vertical Time Series Data", by Rajavarathan et al., presents a new approach for assessing the performance of Surface Mass Balance (SMB) models using vertical land motion derived from GPS observations. In this paper, the authors use seven SMB model products to calculate the corresponding loading displacements, which are then compared to GPS as an independent observational reference. The paper suggests that GPS provides a useful constraint on SMB model evaluation, with varying performance between models depending on their resolution and forcing. Interestingly, all SMB-corrected GPS time series consistently show reduced long-period (>1.5 yr) variance on average, but performance varies across Antarctic regions and GPS sites.
The data processing is rigorous, and the discussion of the influence of SMBL on GPS time series, as well as the spectral analysis of residual time series, is adequate. Overall, the study is well executed and contributes valuable insights into estimates of ice-sheet mass variability and its varying contribution to sea-level change. However, a few minor aspects of the analysis require further justification:
- The authors indicate in Section 2.1 that they compute the elastic loading displacements in a centre-of-solid Earth (CE) reference frame at each GPS site location. However, GPS displacement time series are in the centre-of-figure (CF) frame. Although the CE frame closely approximates the CF frame, it is important to acknowledge this potential mismatch in the manuscript.
- The authors also mention that SMB anomalies are bilinearly interpolated onto a common regular grid of 2 km resolution. It would be very complementary to the discussion to address the effect of interpolation, specifically smoothing of the signal versus using the native grid resolution.
- In line 170, it is also worth mentioning that different SMB models adopt different topography grids, which might potentially contribute to spatial coherence variability.

Reply

Citation: https://doi.org/10.5194/egusphere-2026-1146-RC1
CC1: 'Comment on egusphere-2026-1146', Nicole-Jeanne Schlegel, 01 May 2026 reply

Hi Authors,
Thank you for your work on this. Just a note about the spatial resolution of GEMB. It looks like you are using the zenodo version forced with ERA5. As opposed to the version in the GEMB paper, that was forced with high spatial resolution RACMO output, the ERA5 GEMB are directly forced with low-resolution reanalysis (~.25 degrees). This means the input is lower resolution than 10km. Even though the model is run, and reported, at a higher spatial resolution, in this version of GEMB there is no true "downscaling" involved. The ERA5 inputs are simply interpolated to the GEMB grid before the energy and mass balance is calculated. This is probably the reason that the results look smoother and more like a lower resolution output. While the current GEMB release includes the model just running the column snow model in the input it receives, downscaling routines will be part of a release in the near future, and we should expect to see a sharpening if gradients in areas like the margins. Happy to discuss more if you need more information.
Best wishes and good luck with paper, Nicole

Reply

Citation: https://doi.org/10.5194/egusphere-2026-1146-CC1
RC2:
'Comment on egusphere-2026-1146', Brooke Medley, 12 May 2026 reply
Summary
Here, the authors use novel GPS vertical time series from sites across Antarctica to evaluate the variability in SMB from several atmospheric models, focusing on the low frequency variance reduction in the GPS signals after SMB loading correction. They find all models reduce the variance but to different magnitudes and their performance also varies by location. The paper adds a new element to SMB model evaluation: the ability to assess variations in SMB as they are often evaluated against annual to centennial averages in net SMB from in situ measurements or ground-based and airborne radar studies.
Evaluation
Overall, the manuscript was well-written and contained clear and concise descriptions of the methodology and interpretations (although there are a few minor suggestions for some additional detail). Given the difficulty in evaluating SMB across Antarctica, it is a timely and very important submission, especially as more SMB models come online at increasingly finer spatial resolution. While the solid earth modeling is beyond my expertise, I have few comments regarding the methodology, I do believe that the paper would benefit from additional exploration or discussion of various interpretations of their findings, which are detailed below. A few minor comments follow these more substantive comments.
The authors are clearly aware of this issue as it is discussed briefly in the text, but it is not really clear (and perhaps that is because it is not) on which dimension of variability is being evaluated here, and I’m assuming the answer is likely both variability in space and time. The title states, ”Evaluating Surface Mass Balance Variability from Climate Models…”, but it’s obviously a convolution of space and time, which make its somewhat more difficult for the modelers to assess why their model performs as it does. This conundrum is central to the paper, and I believe deserves more exploration. A few additional thoughts to consider:
Can one evaluate from each model the cells where the spatial signal exceeds the local signal? As in, where does the far field signal exceed the near field? Perhaps this would help us better understand the strengths/weaknesses of each model.

Related to the above, the GPS measurements are typically restricted to areas of high spatial SMB variability in coastal regions (although a few interior sites do exist). Because of this potential bias, the GPS sites that you use might be focused on regions where the far field signal exceeds the local scale, suggesting the GPS are more evaluating the spatial signal in the variability as opposed to the actual local variability through time, which would then bias performance metrics towards those models with finer spatial resolution.

And related to the above, was there consideration of weighting of the variance reductions across all sites for each model? One might attribute the best performance to a specific model but perhaps that is solely because the sites happened to be concentrated in their region of highest performance. Some additional discussion of the sparsity of sampling is worthwhile.

It would be interesting to explore the site-by-site variance reductions more. Which sites are all negative or positive contributors to variance reduction? Which has the largest range of performances? Across all models there are a substantial number of sites where the variance increases (red dots); what does this imply? There is SMB signal that the model is not capturing? It’s also important to note that the model with the highest variance explained had a median reduction in variance of 23%, which is still very small, suggesting there is a lot of unexplained SMB variability even in the highest performer.

Finally, is there any impact of the choice of the time span of the reference climate interval or the long-term mean SMB on your results? After generating the cumulative anomalies, the authors detrend the time series, so that likely accounts for much of the differences that would result from a variable choice of reference time interval. There could still be minor differences, however. Is it worthwhile to explore various time reference intervals to see if it impacts the ability of the SMB models to reduce the variance in the residuals? Does the solid earth response have a memory of the historical mass loadings? The authors state that (while not perfect) there does appear to be some correlation between higher accumulation and higher accumulation variability in time with more variance explained. It could be that regions with higher mass loadings have a higher mass flux where the total mass signal has a “shorter” memory, resulting in a ~40 year record that can approximate most of the fluctuations whereas at lower accumulation sites there is a much longer memory that models cannot explain.

Minor Comments
Line 65, states the SMB anomalies are determined by using a refence climate computed over 1980-2022, but TableS1 shows different start and end dates. I’m assuming that indeed they are all calculated over their common interval and that the start and end dates are simply the temporal extent of the model. Also, I assume that the detrending is based on that same interval (1980-2022)?

For clarification, the MERRA-2 SMB product is provided at 12.5 km resolution, but the temporal variability in time comes from the MERRA-2 reanalysis, which is very coarse in space (0.5 degrees latitude by 0.625 degrees longitude). The precipitation magnitudes come from the 12.5 km high resolution M2R12K data product. Therefore, it is more of a blend between 12.5 km and several 10s of kms. Given that this paper focuses on the variability more, its representative resolution is likely much larger than 12.5 km given variability is driven by MERRA-2 at coarse resolution (see Medley et al., 2022).

References
Medley, B., Neumann, T. A., Zwally, H. J., Smith, B. E., & Stevens, C. M. (2022). Simulations of firn processes over the Greenland and Antarctic ice sheets: 1980–2021. The Cryosphere, 16(10), 3971-4011.

Reply
Citation: https://doi.org/10.5194/egusphere-2026-1146-RC2

Jenan Rajavarathan, Matt King, Christopher Watson, and Nicolaj Hansen

Supplement

https://doi.org/10.5194/egusphere-2026-1146-supplement

Jenan Rajavarathan, Matt King, Christopher Watson, and Nicolaj Hansen

Viewed

Total article views: 340 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
244	80	16	340	57	27	32

HTML: 244
PDF: 80
XML: 16
Total: 340
Supplement: 57
BibTeX: 27
EndNote: 32

Views and downloads (calculated since 09 Apr 2026)

Month	HTML	PDF	XML	Total
Apr 2026	183	65	14	262
May 2026	61	15	2	78

Cumulative views and downloads (calculated since 09 Apr 2026)

Month	HTML	PDF	XML	Total
Apr 2026	183	65	14	262
May 2026	61	15	2	78

Viewed (geographical distribution)

Total article views: 340 (including HTML, PDF, and XML) Thereof 340 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 19 May 2026

Short summary

We use long-term GPS bedrock measurements across Antarctica to assess modelled Surface Mass Balance (SMB) variability. Seven models of SMB loading displacement are evaluated in how well they match the GPS time series, including their ability to reduce long-period variations in the GPS. All models reduce long-period variations, but performance varies by site and model. RACMO SMB model variants performs best overall, suggesting they provide more realistic estimates of Antarctic mass variability.


Total:	0
HTML:	0
PDF:	0
XML:	0