Preprints
https://doi.org/10.5194/egusphere-2024-3364
https://doi.org/10.5194/egusphere-2024-3364
19 May 2025
 | 19 May 2025

Cloud Fraction estimation using Random Forest classifier on Sky Images

Sougat Kumar Sarangi, Chandan Sarangi, Niravkumar Patel, Bomidi Lakshmi Madhavan, Shantikumar Singh Ningombam, Belur Ravindra, and Madineni Venkat Ratnam

Abstract. Cloud fraction (CF) is an integral aspect of weather and radiation forecasting, but real time monitoring of CF is still inaccurate, expensive and exclusive to commercial sky imagers. Traditional cloud segmentation methods, which often rely on empirically determined threshold values, struggle under complex atmospheric and cloud conditions. This study investigates the use of a Random Forest (RF) classifier for pixel-wise cloud segmentation using a dataset of semantically annotated images from five geographically diverse locations. The RF model was trained on diverse sky conditions and atmospheric loads, ensuring robust performance across varied environments. The accuracy score was always above 85 % for all the locations along with similarly high F1 score and Receiver Operating Characteristic – Area Under the Curve (ROC-AUC) score establishing the efficacy of the model. Validation experiments conducted at three Atmospheric Radiation Measurement (ARM) sites and two Indian locations, including Gadanki and Merak, demonstrated that the RF classifier outperformed conventional Total Sky Imager (TSI) methods, particularly in high-pollution areas. The model effectively captured long-term weather and cloud patterns, exhibiting strong location-agnostic performance. However, challenges in distinguishing sun glares and cirrus clouds due to annotation limitations were noted. Despite these minor issues, the RF classifier shows significant promise for accurate and adaptable cloud cover estimation, making it a valuable tool in climate studies.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Sougat Kumar Sarangi, Chandan Sarangi, Niravkumar Patel, Bomidi Lakshmi Madhavan, Shantikumar Singh Ningombam, Belur Ravindra, and Madineni Venkat Ratnam

Status: final response (author comments only)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2024-3364', Anonymous Referee #2, 06 Jun 2025
    • AC1: 'Reply on RC1', Sougat Kumar Sarangi, 31 Jul 2025
  • RC2: 'Comment on egusphere-2024-3364', Anonymous Referee #1, 16 Jun 2025
    • AC2: 'Reply on RC2', Sougat Kumar Sarangi, 31 Jul 2025
Sougat Kumar Sarangi, Chandan Sarangi, Niravkumar Patel, Bomidi Lakshmi Madhavan, Shantikumar Singh Ningombam, Belur Ravindra, and Madineni Venkat Ratnam
Sougat Kumar Sarangi, Chandan Sarangi, Niravkumar Patel, Bomidi Lakshmi Madhavan, Shantikumar Singh Ningombam, Belur Ravindra, and Madineni Venkat Ratnam

Viewed

Total article views: 367 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
306 40 21 367 8 24
  • HTML: 306
  • PDF: 40
  • XML: 21
  • Total: 367
  • BibTeX: 8
  • EndNote: 24
Views and downloads (calculated since 19 May 2025)
Cumulative views and downloads (calculated since 19 May 2025)

Viewed (geographical distribution)

Total article views: 363 (including HTML, PDF, and XML) Thereof 363 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 30 Aug 2025
Download
Short summary
This study introduces a new approach to measure cloud cover from image data taken by ground based sky observations. Our method used diverse sky images taken from various locations across the globe to train our machine learning model. We achieved a very high accuracy in detecting cloud cover, even in polluted areas. Our model surpasses traditional methods by running efficiently with minimal computational needs.
Share