Preprints
https://doi.org/10.5194/egusphere-2024-3183
https://doi.org/10.5194/egusphere-2024-3183
22 Oct 2024
 | 22 Oct 2024
Status: this preprint has been withdrawn by the authors.

Compressing high-resolution data through latent representation encoding for downscaling large-scale AI weather forecast model

Qian Liu, Bing Gong, Xiaoran Zhuang, Xiaohui Zhong, Zhiming Kang, and Hao Li

Abstract. The rapid advancement of artificial intelligence (AI) in weather research has been driven by the ability to learn from large, high-dimensional datasets. However, this progress also poses significant challenges, particularly regarding the substantial costs associated with processing extensive data and the limitations of computational resources. Inspired by the Neural Image Compression (NIC) task in computer vision, this study seeks to compress weather data to address these challenges and enhance the efficiency of downstream applications. Specifically, we propose a variational autoencoder (VAE) framework tailored for compressing high-resolution datasets, specifically the High Resolution China Meteorological Administration Land Data Assimilation System (HRCLDAS) with a spatial resolution of 1 km. Our framework successfully reduced the storage size of 3 years of HRCLDAS data from 8.61 TB to just 204 GB, while preserving essential information. In addition, we demonstrated the utility of the compressed data through a downscaling task, where the model trained on the compressed dataset achieved accuracy comparable to that of the model trained on the original data. These results highlight the effectiveness and potential of the compressed data for future weather research.

This preprint has been withdrawn.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Qian Liu, Bing Gong, Xiaoran Zhuang, Xiaohui Zhong, Zhiming Kang, and Hao Li

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2024-3183', Anonymous Referee #1, 15 Nov 2024
  • CEC1: 'Comment on egusphere-2024-3183 - No compliance with the policy of the journal', Juan Antonio Añel, 02 Dec 2024
    • AC1: 'Reply on CEC1', Bing Gong, 16 Dec 2024
    • AC2: 'Reply on CEC1', Bing Gong, 20 Dec 2024
      • CEC2: 'Reply on AC2', Juan Antonio Añel, 22 Dec 2024
        • AC3: 'Reply on CEC2', Bing Gong, 28 Dec 2024
          • CEC3: 'Reply on AC3', Juan Antonio Añel, 28 Dec 2024
            • AC4: 'Reply on CEC3', Bing Gong, 28 Dec 2024

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2024-3183', Anonymous Referee #1, 15 Nov 2024
  • CEC1: 'Comment on egusphere-2024-3183 - No compliance with the policy of the journal', Juan Antonio Añel, 02 Dec 2024
    • AC1: 'Reply on CEC1', Bing Gong, 16 Dec 2024
    • AC2: 'Reply on CEC1', Bing Gong, 20 Dec 2024
      • CEC2: 'Reply on AC2', Juan Antonio Añel, 22 Dec 2024
        • AC3: 'Reply on CEC2', Bing Gong, 28 Dec 2024
          • CEC3: 'Reply on AC3', Juan Antonio Añel, 28 Dec 2024
            • AC4: 'Reply on CEC3', Bing Gong, 28 Dec 2024
Qian Liu, Bing Gong, Xiaoran Zhuang, Xiaohui Zhong, Zhiming Kang, and Hao Li
Qian Liu, Bing Gong, Xiaoran Zhuang, Xiaohui Zhong, Zhiming Kang, and Hao Li

Viewed

Total article views: 437 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
321 95 21 437 6 5
  • HTML: 321
  • PDF: 95
  • XML: 21
  • Total: 437
  • BibTeX: 6
  • EndNote: 5
Views and downloads (calculated since 22 Oct 2024)
Cumulative views and downloads (calculated since 22 Oct 2024)

Viewed (geographical distribution)

Total article views: 417 (including HTML, PDF, and XML) Thereof 417 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 21 Jan 2025
Download

This preprint has been withdrawn.

Short summary
This paper presents a machine learning-based data compression method for weather and climate research, addressing the computational challenges posed by large datasets in weather applications. We propose a novel VAE framework that compresses three years of 1 km resolution data from 8.61 TB to 204 GB. This reduction significantly lowers computational resource requirements. We also demonstrate the compressed data's effectiveness by downscaling the Fuxi AI forecast model.