Preprints
https://doi.org/10.5194/egusphere-2024-3364
https://doi.org/10.5194/egusphere-2024-3364
19 May 2025
 | 19 May 2025
Status: this preprint is open for discussion and under review for Atmospheric Measurement Techniques (AMT).

Cloud Fraction estimation using Random Forest classifier on Sky Images

Sougat Kumar Sarangi, Chandan Sarangi, Niravkumar Patel, Bomidi Lakshmi Madhavan, Shantikumar Singh Ningombam, Belur Ravindra, and Madineni Venkat Ratnam

Abstract. Cloud fraction (CF) is an integral aspect of weather and radiation forecasting, but real time monitoring of CF is still inaccurate, expensive and exclusive to commercial sky imagers. Traditional cloud segmentation methods, which often rely on empirically determined threshold values, struggle under complex atmospheric and cloud conditions. This study investigates the use of a Random Forest (RF) classifier for pixel-wise cloud segmentation using a dataset of semantically annotated images from five geographically diverse locations. The RF model was trained on diverse sky conditions and atmospheric loads, ensuring robust performance across varied environments. The accuracy score was always above 85 % for all the locations along with similarly high F1 score and Receiver Operating Characteristic – Area Under the Curve (ROC-AUC) score establishing the efficacy of the model. Validation experiments conducted at three Atmospheric Radiation Measurement (ARM) sites and two Indian locations, including Gadanki and Merak, demonstrated that the RF classifier outperformed conventional Total Sky Imager (TSI) methods, particularly in high-pollution areas. The model effectively captured long-term weather and cloud patterns, exhibiting strong location-agnostic performance. However, challenges in distinguishing sun glares and cirrus clouds due to annotation limitations were noted. Despite these minor issues, the RF classifier shows significant promise for accurate and adaptable cloud cover estimation, making it a valuable tool in climate studies.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Sougat Kumar Sarangi, Chandan Sarangi, Niravkumar Patel, Bomidi Lakshmi Madhavan, Shantikumar Singh Ningombam, Belur Ravindra, and Madineni Venkat Ratnam

Status: open (until 07 Jul 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2024-3364', Anonymous Referee #2, 06 Jun 2025 reply
  • RC2: 'Comment on egusphere-2024-3364', Anonymous Referee #1, 16 Jun 2025 reply
Sougat Kumar Sarangi, Chandan Sarangi, Niravkumar Patel, Bomidi Lakshmi Madhavan, Shantikumar Singh Ningombam, Belur Ravindra, and Madineni Venkat Ratnam
Sougat Kumar Sarangi, Chandan Sarangi, Niravkumar Patel, Bomidi Lakshmi Madhavan, Shantikumar Singh Ningombam, Belur Ravindra, and Madineni Venkat Ratnam

Viewed

Total article views: 143 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
113 23 7 143 3 3
  • HTML: 113
  • PDF: 23
  • XML: 7
  • Total: 143
  • BibTeX: 3
  • EndNote: 3
Views and downloads (calculated since 19 May 2025)
Cumulative views and downloads (calculated since 19 May 2025)

Viewed (geographical distribution)

Total article views: 142 (including HTML, PDF, and XML) Thereof 142 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 16 Jun 2025
Download
Short summary
This study introduces a new approach to measure cloud cover from image data taken by ground based sky observations. Our method used diverse sky images taken from various locations across the globe to train our machine learning model. We achieved a very high accuracy in detecting cloud cover, even in polluted areas. Our model surpasses traditional methods by running efficiently with minimal computational needs.
Share