Intra-city Scale Graph Neural Networks Enhance Short-term Air Temperature Forecasting
Abstract. Air temperature (Ta) has critical implications for various socioeconomic sectors, yet its dynamics are particularly complex in urban areas due to heterogeneous built environments, landscapes, and diverse anthropogenic activities. Physics-based models struggle with intra-city Ta forecasts due to inadequate urban representation and limited spatial resolution. While weather observation networks offer promising alternatives for direct local Ta modeling, an effective framework to leverage these intra-city data remains lacking. Here, we demonstrate that graph neural networks (GNNs) can harness observation network information to refine Ta prediction at individual locations and elucidate underlying mechanisms. Our novel Mix-n-Scale framework with GNNs achieves over 12 % improvement in short-term Ta forecasts compared to conventional time-series approaches. Further model evaluation disentangles performance variations with local Ta variability in diverse spatiotemporal contexts, indicating distinct patterns of intra-city heterogeneity across seasonal and diurnal scales. Our findings establish graph-based approaches for leveraging proliferating urban sensor data and advancing understanding of Ta spatiotemporal dynamics in complex urban environments.
Competing interests: The authors declare no conflicts of interest relevant to this study.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
This study introduces a novel Mix-n-Scale framework with GNNs for short-term air temperature (Ta) forecasting at the intra-city scale. The authors convincingly demonstrate that incorporating spatial interactions improves forecast skill, achieving more than a 12 % reduction in RMSE compared with a conventional LSTM-only baseline. They provide a thorough evaluation of model performance and present well-structured analyses. Overall, this is a solid paper and is a meaningful contribution with clear potential to advance short-term forecasting in urban areas. However, there are several areas where additional experiment and discussion would strengthen the manuscript. I would support publication after minor revisions.
Major comments:
Minor comments: