Emulating Aerosol Optics with Randomly Generated Neural Networks

Geiss, Andrew; Ma, Po-Lun; Singh, Balwinder; Hardin, Joseph C.

doi:https://doi.org/10.5194/egusphere-2022-559

Preprints

https://doi.org/10.5194/egusphere-2022-559

Preprints

08 Jul 2022

| 08 Jul 2022

Emulating Aerosol Optics with Randomly Generated Neural Networks

Andrew Geiss, Po-Lun Ma, Balwinder Singh, and Joseph C. Hardin

Abstract. Atmospheric aerosols have a substantial impact on climate and remain one of the largest sources of uncertainty in climate forecasts. Accurate representation of their direct radiative effects is a crucial component of modern climate models. Direct computation of the radiative properties of aerosols is far too computationally expensive to perform in a climate model however, so optical properties are typically approximated using a parameterization. This work develops artificial neural networks (ANNs) capable of replacing the current aerosol optics parameterization used in the Energy Exascale Earth System Model (E3SM). A large training dataset is generated by using Mie code to directly compute the optical properties of a range of atmospheric aerosol populations given a large variety of particle sizes, wavelengths, and refractive indices. Optimal neural architectures for shortwave and longwave bands are identified by evaluating ANNs with randomly generated wirings. Randomly generated deep ANNs are able to outperform conventional multi-layer perceptron style architectures with comparable parameter counts. Finally, the ANN-based parameterization is found to dramatically outperform the current parameterization. The success of this approach makes possible the future inclusion of much more sophisticated representations of aerosol optics in climate models that cannot be captured through simple expansion of the existing parameterization scheme.

Received: 28 Jun 2022 – Discussion started: 08 Jul 2022

Download & links

Preprint (PDF, 2695 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (2695 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

05 May 2023

Emulating aerosol optics with randomly generated neural networks

Andrew Geiss, Po-Lun Ma, Balwinder Singh, and Joseph C. Hardin

Geosci. Model Dev., 16, 2355–2370, https://doi.org/10.5194/gmd-16-2355-2023,https://doi.org/10.5194/gmd-16-2355-2023, 2023

Short summary

Andrew Geiss et al.

Interactive discussion

Status: closed

RC1: 'Comment on egusphere-2022-559', Anonymous Referee #1, 15 Jul 2022

This manuscript describes the development of neural networks to replace the aerosol optics in a climate model with a more detailed treatment, which is based on running the same types of codes with more detail to develop a parameterization which better represents more detailed modeling. In general the manuscript is well written and clear. There is a nice discussion of how the neural network is developed. However, my main critique is that the evaluation section (section 5) is pretty minimal. Just some error curves. What does it look like in the full model? You have demonstrated that the new parameterization represents the more detailed code better than the existing parameterization. Does it change the answers in the climate model it is designed for in any meaningful way, and does it cost anything more to run it. Also good to note in the conclusions what lessons you learn from this experience about building neural networks for parameterization replacement. This is probably suitable for publication with minor revisions, but with at least trying it in a climate model perhaps.

Detailed comments below.

Page 1, L10: Would be good to have more detail on what ‘outperform’ means specifically in another sentence or two.

Page 1, L15: Disingenuous. The direct effects of aerosols are not the largest uncertainty: only indirect effects on clouds.

Page 2, L29: Example of climate models generating training data (for replacing part of a parameterization: Gettelman et al 2021.

Gettelman, A., D. J. Gagne, C.-C. Chen, M. W. Christensen, Z. J. Lebo, H. Morrison, and G. Gantos. “Machine Learning the Warm Rain Process.” Journal of Advances in Modeling Earth Systems 13, no. 2 (2021): e2020MS002268. https://doi.org/10.1029/2020MS002268.

Page 3, L63: clarify ‘these optical properties (absorption…etc’

Page 3, L77: what is the size range here? Please be explicit.

Page 3, L90: The CESM reference should probably be Danabasoglu, et al 2020.

Danabasoglu, G., J.-F. Lamarque, J. Bacmeister, D. A. Bailey, A. K. DuVivier, J. Edwards, L. K. Emmons, et al. “The Community Earth System Model Version 2 (CESM2).” Journal of Advances in Modeling Earth Systems 12, no. 2 (2020): e2019MS001916. https://doi.org/10.1029/2019MS001916.

Page 5, L148: how much error is there in the approximations? Can you quantify it?

Page 9, L251: why would a random network do better? Is there an explanation? Isn’t that a form of overfitting?

Page 10, L289: Please describe these terms a bit. ReLU, ELU, Leaky ReLU and Parametric ReLU

Page 10, L291: what is a transfer function? Above you call them activation functions. Please clarify.

Page 11, Figure 2: is the red dot the ‘optimum’ network?

Page 12, L350: can you make the evaluation a bit more quantitative in spots? It seems a bit ‘weak’ right now, especially compared to the rest of the paper.

Page 13, L360: What does the first column (Table:) of table 1 mean? Should it say something?

Page 13, L364: Can you explain the patters in Figure 5? What do they arise from?

Page 16, L402: Other lessons learned? It would be great to share in the paper.

Citation: https://doi.org/10.5194/egusphere-2022-559-RC1
RC2:
'Comment on egusphere-2022-559', Anonymous Referee #2, 10 Aug 2022
Review of the manuscript egusphere-2022-559 by Geiss et al.

The manuscript (MS) presents a new method (based on Artificial Neural Networks (ANNs)) for online calculation of the optical properties of the internally mixed aerosols. Current parametrizations and look-up tables are either computationally unaffordable or fail to capture the large variabilities in aerosol properties. The training dataset is based on the Mie code that directly computes the optical properties of aerosols by considering the variability of the particle sizes, wavelengths, and refractive indices. This approach is similar to previous parameterizations but uses a higher resolution for different parameters. By evaluating ANNs with randomly generated wirings, the optimal network architectures are identified for SW and LW. The results show that randomly generated deep ANNs lead to lower error compared to the conventional multi-layer perceptron. Besides, the ANN-based parameterization outperforms the current parameterization.

The paper is very well structured and written. I really enjoyed the detailed explanations of the assumptions and methods that makes it easy to follow the results. The methods and results are robust with major benefits for the aerosol modeling community. Thus, I recommend publication after addressing the minor points/questions listed below.

With respect to the I/O, it is not clear why nine variables are chosen. Any pre-processing or input selection procedure? Especially two parameters “surface mode radius over wavelength” and “surface mode radius” are obviously correlated. This should not happen.

Why do you need one-hot encoding? What additional information does it contain for the modes?

It can be expected that 2-3 hidden layers can capture the nonlinearities of the system very well and more hidden layers often lead to over-fitting (shown in Fig 2). But it is not clear if the rather minor MAE reduction by random wired networks (outperform is too strong here) is justified by its computational costs/complexity.

I would like to see ANN vs. Mie similar to figure 5 but for all parameters: extinction coefficient, single scattering albedo and asymmetry parameter in SW and WL.
Citation: https://doi.org/10.5194/egusphere-2022-559-RC2
CEC1:
'Comment on egusphere-2022-559', Juan Antonio Añel, 23 Aug 2022

Dear authors,
Many thanks for your effort to comply with the "Code and Data Policy" of the journal.
I would like to point out some issues and ask you to fix them. First, PyMieScatt is currently stored in GitHub, and GitHub is not a suitable repository according to our policy. However, PyMieScatt is released under the MIT license, allowing you to store it in another repository. Therefore, please, add it to the Zenodo already set up for this manuscript or create a different Zenodo repository. A similar thing applies to the miev0 subroutine.
Also, note that several of the hyperlinks in the "Code and Data Availability" section (in the PDF file) are broken. Please, fix it.
Regards,
Juan A. Añel
Geosci. Model Dev. Exec. Editor

Citation: https://doi.org/10.5194/egusphere-2022-559-CEC1
- AC1: 'Reply on CEC1', Andrew Geiss, 06 Oct 2022
  
  I have added copies of the Mie scattering codes and links to their sources to the Github and Zenodo repositories.
  The hyperlinks all work for me. I did notice that one of them picks up a page number if you copy/paste from the pdf, so maybe this was the issue. In any case, I will double check everything in the updated manuscript before re-submitting.
  Thanks,
  Andrew
  
  Citation: https://doi.org/10.5194/egusphere-2022-559-AC1

Interactive discussion

Status: closed

RC1: 'Comment on egusphere-2022-559', Anonymous Referee #1, 15 Jul 2022

This manuscript describes the development of neural networks to replace the aerosol optics in a climate model with a more detailed treatment, which is based on running the same types of codes with more detail to develop a parameterization which better represents more detailed modeling. In general the manuscript is well written and clear. There is a nice discussion of how the neural network is developed. However, my main critique is that the evaluation section (section 5) is pretty minimal. Just some error curves. What does it look like in the full model? You have demonstrated that the new parameterization represents the more detailed code better than the existing parameterization. Does it change the answers in the climate model it is designed for in any meaningful way, and does it cost anything more to run it. Also good to note in the conclusions what lessons you learn from this experience about building neural networks for parameterization replacement. This is probably suitable for publication with minor revisions, but with at least trying it in a climate model perhaps.

Detailed comments below.

Page 1, L10: Would be good to have more detail on what ‘outperform’ means specifically in another sentence or two.

Page 1, L15: Disingenuous. The direct effects of aerosols are not the largest uncertainty: only indirect effects on clouds.

Page 2, L29: Example of climate models generating training data (for replacing part of a parameterization: Gettelman et al 2021.

Gettelman, A., D. J. Gagne, C.-C. Chen, M. W. Christensen, Z. J. Lebo, H. Morrison, and G. Gantos. “Machine Learning the Warm Rain Process.” Journal of Advances in Modeling Earth Systems 13, no. 2 (2021): e2020MS002268. https://doi.org/10.1029/2020MS002268.

Page 3, L63: clarify ‘these optical properties (absorption…etc’

Page 3, L77: what is the size range here? Please be explicit.

Page 3, L90: The CESM reference should probably be Danabasoglu, et al 2020.

Danabasoglu, G., J.-F. Lamarque, J. Bacmeister, D. A. Bailey, A. K. DuVivier, J. Edwards, L. K. Emmons, et al. “The Community Earth System Model Version 2 (CESM2).” Journal of Advances in Modeling Earth Systems 12, no. 2 (2020): e2019MS001916. https://doi.org/10.1029/2019MS001916.

Page 5, L148: how much error is there in the approximations? Can you quantify it?

Page 9, L251: why would a random network do better? Is there an explanation? Isn’t that a form of overfitting?

Page 10, L289: Please describe these terms a bit. ReLU, ELU, Leaky ReLU and Parametric ReLU

Page 10, L291: what is a transfer function? Above you call them activation functions. Please clarify.

Page 11, Figure 2: is the red dot the ‘optimum’ network?

Page 12, L350: can you make the evaluation a bit more quantitative in spots? It seems a bit ‘weak’ right now, especially compared to the rest of the paper.

Page 13, L360: What does the first column (Table:) of table 1 mean? Should it say something?

Page 13, L364: Can you explain the patters in Figure 5? What do they arise from?

Page 16, L402: Other lessons learned? It would be great to share in the paper.

Citation: https://doi.org/10.5194/egusphere-2022-559-RC1
RC2:
'Comment on egusphere-2022-559', Anonymous Referee #2, 10 Aug 2022
Review of the manuscript egusphere-2022-559 by Geiss et al.

The manuscript (MS) presents a new method (based on Artificial Neural Networks (ANNs)) for online calculation of the optical properties of the internally mixed aerosols. Current parametrizations and look-up tables are either computationally unaffordable or fail to capture the large variabilities in aerosol properties. The training dataset is based on the Mie code that directly computes the optical properties of aerosols by considering the variability of the particle sizes, wavelengths, and refractive indices. This approach is similar to previous parameterizations but uses a higher resolution for different parameters. By evaluating ANNs with randomly generated wirings, the optimal network architectures are identified for SW and LW. The results show that randomly generated deep ANNs lead to lower error compared to the conventional multi-layer perceptron. Besides, the ANN-based parameterization outperforms the current parameterization.

The paper is very well structured and written. I really enjoyed the detailed explanations of the assumptions and methods that makes it easy to follow the results. The methods and results are robust with major benefits for the aerosol modeling community. Thus, I recommend publication after addressing the minor points/questions listed below.

With respect to the I/O, it is not clear why nine variables are chosen. Any pre-processing or input selection procedure? Especially two parameters “surface mode radius over wavelength” and “surface mode radius” are obviously correlated. This should not happen.

Why do you need one-hot encoding? What additional information does it contain for the modes?

It can be expected that 2-3 hidden layers can capture the nonlinearities of the system very well and more hidden layers often lead to over-fitting (shown in Fig 2). But it is not clear if the rather minor MAE reduction by random wired networks (outperform is too strong here) is justified by its computational costs/complexity.

I would like to see ANN vs. Mie similar to figure 5 but for all parameters: extinction coefficient, single scattering albedo and asymmetry parameter in SW and WL.
Citation: https://doi.org/10.5194/egusphere-2022-559-RC2
CEC1:
'Comment on egusphere-2022-559', Juan Antonio Añel, 23 Aug 2022

Dear authors,
Many thanks for your effort to comply with the "Code and Data Policy" of the journal.
I would like to point out some issues and ask you to fix them. First, PyMieScatt is currently stored in GitHub, and GitHub is not a suitable repository according to our policy. However, PyMieScatt is released under the MIT license, allowing you to store it in another repository. Therefore, please, add it to the Zenodo already set up for this manuscript or create a different Zenodo repository. A similar thing applies to the miev0 subroutine.
Also, note that several of the hyperlinks in the "Code and Data Availability" section (in the PDF file) are broken. Please, fix it.
Regards,
Juan A. Añel
Geosci. Model Dev. Exec. Editor

Citation: https://doi.org/10.5194/egusphere-2022-559-CEC1
- AC1: 'Reply on CEC1', Andrew Geiss, 06 Oct 2022
  
  I have added copies of the Mie scattering codes and links to their sources to the Github and Zenodo repositories.
  The hyperlinks all work for me. I did notice that one of them picks up a page number if you copy/paste from the pdf, so maybe this was the issue. In any case, I will double check everything in the updated manuscript before re-submitting.
  Thanks,
  Andrew
  
  Citation: https://doi.org/10.5194/egusphere-2022-559-AC1

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Andrew Geiss on behalf of the Authors (21 Feb 2023) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (08 Mar 2023) by Samuel Remy

RR by Anonymous Referee #1 (18 Mar 2023)

ED: Publish as is (04 Apr 2023) by Samuel Remy

AR by Andrew Geiss on behalf of the Authors (07 Apr 2023) Manuscript

Journal article(s) based on this preprint

05 May 2023

Emulating aerosol optics with randomly generated neural networks

Andrew Geiss, Po-Lun Ma, Balwinder Singh, and Joseph C. Hardin

Geosci. Model Dev., 16, 2355–2370, https://doi.org/10.5194/gmd-16-2355-2023,https://doi.org/10.5194/gmd-16-2355-2023, 2023

Short summary

Andrew Geiss et al.

Data sets

Aerosol Optics ML Datasets Andrew Geiss https://doi.org/10.5281/zenodo.6762700

Model code and software

Aerosol Optics ML Code Andrew Geiss https://doi.org/10.5281/zenodo.6767169

Andrew Geiss et al.

Viewed

Total article views: 850 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
673	162	15	850	7	6

HTML: 673
PDF: 162
XML: 15
Total: 850
BibTeX: 7
EndNote: 6

Views and downloads (calculated since 08 Jul 2022)

Month	HTML	PDF	XML	Total
Jul 2022	168	54	6	228
Aug 2022	79	30	5	114
Sep 2022	48	10	0	58
Oct 2022	63	16	3	82
Nov 2022	59	3	1	63
Dec 2022	35	8	0	43
Jan 2023	42	7	0	49
Feb 2023	57	10	0	67
Mar 2023	63	13	0	76
Apr 2023	58	10	0	68
May 2023	1	1	0	2

Cumulative views and downloads (calculated since 08 Jul 2022)

Month	HTML	PDF	XML	Total
Jul 2022	168	54	6	228
Aug 2022	79	30	5	114
Sep 2022	48	10	0	58
Oct 2022	63	16	3	82
Nov 2022	59	3	1	63
Dec 2022	35	8	0	43
Jan 2023	42	7	0	49
Feb 2023	57	10	0	67
Mar 2023	63	13	0	76
Apr 2023	58	10	0	68
May 2023	1	1	0	2

Viewed (geographical distribution)

Total article views: 804 (including HTML, PDF, and XML) Thereof 804 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 05 May 2023

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (2695 KB)
Metadata XML

Short summary

Atmospheric aerosols play a critical role in Earth's climate, but it is too computationally expensive to directly model their interaction with radiation. This work develops a new neural network based parameterization of aerosol optical properties for use in the Energy Exascale Earth System model that is much more accurate than the current one, and introduces a unique model optimization method that involves randomly generating neural network architectures.


Total:	0
HTML:	0
PDF:	0
XML:	0