Preprints
https://doi.org/10.5194/egusphere-2024-2650
https://doi.org/10.5194/egusphere-2024-2650
21 Oct 2024
 | 21 Oct 2024
Status: this preprint is open for discussion and under review for Atmospheric Measurement Techniques (AMT).

Quality Control of Historical Temperature Data for Pure Rotational Raman Lidar Using Density-Based Clustering

Rongzheng Cao, Siying Chen, Wangshu Tan, Yixuan Xie, He Chen, Pan Guo, Rui Hu, Yinghong Yu, Jie Yu, and Shusen Yao

Abstract. This paper is the first to use two density-based clustering algorithms, Density-Based Spatial Clustering of Applications with Noise (DBSCAN) and Ordering Points To Identify the Clustering Structure (OPTICS), to screen the historical detection data of pure rotational Raman (PRR) temperature measurement lidar. To address the issues of threshold radius in DBSCAN and output value processing in OPTICS, three automated processing methods suitable for PRR temperature lidar detection data characteristics are proposed. These methods are the k-distance Fast Change Region (k-FCR) Method based on the DBSCAN, the Reachability Distance (RD) Method based on the OPTICS, and the Predecessor Divergence (PD) Method based on the OPTICS. Using these three methods, quality control was conducted on the historical data detected by a PRR temperature lidar from March 2021 to May 2024, demonstrating the effectiveness of these methods in automated quality control of historical data and the complementary nature of their quality control effects. Under the reliable threshold set in this paper, compared with the traditional Signal-to-Noise Ratio (SNR) method, the RD method increased the True Positive Rate (TPR) by 23.7 %, the PD method increased the True Negative Rate (TNR) by 6.0 %, and the k-FCR method increased the TPR by 72.1 % at the cost of some TNR loss. The influence of the SNR of data points and the number of continuous observation profiles on the quality control results is also explored, providing further references for the selection and application of different quality control methods. The methods provided in this paper will allow relevant researchers to filter PRR lidar data of atmospheric temperature according to their own needs, and these methods can also be applied to the automated processing of future atmospheric temperature data from detection networks.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Rongzheng Cao, Siying Chen, Wangshu Tan, Yixuan Xie, He Chen, Pan Guo, Rui Hu, Yinghong Yu, Jie Yu, and Shusen Yao

Status: open (until 26 Nov 2024)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Rongzheng Cao, Siying Chen, Wangshu Tan, Yixuan Xie, He Chen, Pan Guo, Rui Hu, Yinghong Yu, Jie Yu, and Shusen Yao
Rongzheng Cao, Siying Chen, Wangshu Tan, Yixuan Xie, He Chen, Pan Guo, Rui Hu, Yinghong Yu, Jie Yu, and Shusen Yao

Viewed

Total article views: 93 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
60 21 12 93 25 3 2
  • HTML: 60
  • PDF: 21
  • XML: 12
  • Total: 93
  • Supplement: 25
  • BibTeX: 3
  • EndNote: 2
Views and downloads (calculated since 21 Oct 2024)
Cumulative views and downloads (calculated since 21 Oct 2024)

Viewed (geographical distribution)

Total article views: 125 (including HTML, PDF, and XML) Thereof 125 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 20 Nov 2024
Download
Short summary
This study applied two density-based clustering algorithms to the quality control of temperature data from Raman lidar. Three automated methods were proposed, achieving automation in data quality control. The effectiveness of these three methods was verified using Raman temperature lidar data and ERA5 data from the past three years. Compared with the previous method, they have great improvements. Additionally, factors affecting the quality control results were further analyzed.