the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Emulating Aerosol Optics with Randomly Generated Neural Networks
Abstract. Atmospheric aerosols have a substantial impact on climate and remain one of the largest sources of uncertainty in climate forecasts. Accurate representation of their direct radiative effects is a crucial component of modern climate models. Direct computation of the radiative properties of aerosols is far too computationally expensive to perform in a climate model however, so optical properties are typically approximated using a parameterization. This work develops artificial neural networks (ANNs) capable of replacing the current aerosol optics parameterization used in the Energy Exascale Earth System Model (E3SM). A large training dataset is generated by using Mie code to directly compute the optical properties of a range of atmospheric aerosol populations given a large variety of particle sizes, wavelengths, and refractive indices. Optimal neural architectures for shortwave and longwave bands are identified by evaluating ANNs with randomly generated wirings. Randomly generated deep ANNs are able to outperform conventional multi-layer perceptron style architectures with comparable parameter counts. Finally, the ANN-based parameterization is found to dramatically outperform the current parameterization. The success of this approach makes possible the future inclusion of much more sophisticated representations of aerosol optics in climate models that cannot be captured through simple expansion of the existing parameterization scheme.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(2695 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(2695 KB) - Metadata XML
- BibTeX
- EndNote
- Final revised paper
Journal article(s) based on this preprint
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2022-559', Anonymous Referee #1, 15 Jul 2022
This manuscript describes the development of neural networks to replace the aerosol optics in a climate model with a more detailed treatment, which is based on running the same types of codes with more detail to develop a parameterization which better represents more detailed modeling. In general the manuscript is well written and clear. There is a nice discussion of how the neural network is developed. However, my main critique is that the evaluation section (section 5) is pretty minimal. Just some error curves. What does it look like in the full model? You have demonstrated that the new parameterization represents the more detailed code better than the existing parameterization. Does it change the answers in the climate model it is designed for in any meaningful way, and does it cost anything more to run it. Also good to note in the conclusions what lessons you learn from this experience about building neural networks for parameterization replacement. This is probably suitable for publication with minor revisions, but with at least trying it in a climate model perhaps.Detailed comments below.
Page 1, L10: Would be good to have more detail on what ‘outperform’ means specifically in another sentence or two.
Page 1, L15: Disingenuous. The direct effects of aerosols are not the largest uncertainty: only indirect effects on clouds.
Page 2, L29: Example of climate models generating training data (for replacing part of a parameterization: Gettelman et al 2021.
Gettelman, A., D. J. Gagne, C.-C. Chen, M. W. Christensen, Z. J. Lebo, H. Morrison, and G. Gantos. “Machine Learning the Warm Rain Process.” Journal of Advances in Modeling Earth Systems 13, no. 2 (2021): e2020MS002268. https://doi.org/10.1029/2020MS002268.
Page 3, L63: clarify ‘these optical properties (absorption…etc’Page 3, L77: what is the size range here? Please be explicit.
Page 3, L90: The CESM reference should probably be Danabasoglu, et al 2020.
Danabasoglu, G., J.-F. Lamarque, J. Bacmeister, D. A. Bailey, A. K. DuVivier, J. Edwards, L. K. Emmons, et al. “The Community Earth System Model Version 2 (CESM2).” Journal of Advances in Modeling Earth Systems 12, no. 2 (2020): e2019MS001916. https://doi.org/10.1029/2019MS001916.
Page 5, L148: how much error is there in the approximations? Can you quantify it?
Page 9, L251: why would a random network do better? Is there an explanation? Isn’t that a form of overfitting?
Page 10, L289: Please describe these terms a bit. ReLU, ELU, Leaky ReLU and Parametric ReLU
Page 10, L291: what is a transfer function? Above you call them activation functions. Please clarify.
Page 11, Figure 2: is the red dot the ‘optimum’ network?
Page 12, L350: can you make the evaluation a bit more quantitative in spots? It seems a bit ‘weak’ right now, especially compared to the rest of the paper.
Page 13, L360: What does the first column (Table:) of table 1 mean? Should it say something?
Page 13, L364: Can you explain the patters in Figure 5? What do they arise from?
Page 16, L402: Other lessons learned? It would be great to share in the paper.
Citation: https://doi.org/10.5194/egusphere-2022-559-RC1 -
RC2: 'Comment on egusphere-2022-559', Anonymous Referee #2, 10 Aug 2022
Review of the manuscript egusphere-2022-559 by Geiss et al.
The manuscript (MS) presents a new method (based on Artificial Neural Networks (ANNs)) for online calculation of the optical properties of the internally mixed aerosols. Current parametrizations and look-up tables are either computationally unaffordable or fail to capture the large variabilities in aerosol properties. The training dataset is based on the Mie code that directly computes the optical properties of aerosols by considering the variability of the particle sizes, wavelengths, and refractive indices. This approach is similar to previous parameterizations but uses a higher resolution for different parameters. By evaluating ANNs with randomly generated wirings, the optimal network architectures are identified for SW and LW. The results show that randomly generated deep ANNs lead to lower error compared to the conventional multi-layer perceptron. Besides, the ANN-based parameterization outperforms the current parameterization.
The paper is very well structured and written. I really enjoyed the detailed explanations of the assumptions and methods that makes it easy to follow the results. The methods and results are robust with major benefits for the aerosol modeling community. Thus, I recommend publication after addressing the minor points/questions listed below.
- With respect to the I/O, it is not clear why nine variables are chosen. Any pre-processing or input selection procedure? Especially two parameters “surface mode radius over wavelength” and “surface mode radius” are obviously correlated. This should not happen.
- Why do you need one-hot encoding? What additional information does it contain for the modes?
- It can be expected that 2-3 hidden layers can capture the nonlinearities of the system very well and more hidden layers often lead to over-fitting (shown in Fig 2). But it is not clear if the rather minor MAE reduction by random wired networks (outperform is too strong here) is justified by its computational costs/complexity.
- I would like to see ANN vs. Mie similar to figure 5 but for all parameters: extinction coefficient, single scattering albedo and asymmetry parameter in SW and WL.
Citation: https://doi.org/10.5194/egusphere-2022-559-RC2 -
CEC1: 'Comment on egusphere-2022-559', Juan Antonio Añel, 23 Aug 2022
Dear authors,
Many thanks for your effort to comply with the "Code and Data Policy" of the journal.
I would like to point out some issues and ask you to fix them. First, PyMieScatt is currently stored in GitHub, and GitHub is not a suitable repository according to our policy. However, PyMieScatt is released under the MIT license, allowing you to store it in another repository. Therefore, please, add it to the Zenodo already set up for this manuscript or create a different Zenodo repository. A similar thing applies to the miev0 subroutine.
Also, note that several of the hyperlinks in the "Code and Data Availability" section (in the PDF file) are broken. Please, fix it.
Regards,
Juan A. AñelGeosci. Model Dev. Exec. EditorCitation: https://doi.org/10.5194/egusphere-2022-559-CEC1 -
AC1: 'Reply on CEC1', Andrew Geiss, 06 Oct 2022
I have added copies of the Mie scattering codes and links to their sources to the Github and Zenodo repositories.
The hyperlinks all work for me. I did notice that one of them picks up a page number if you copy/paste from the pdf, so maybe this was the issue. In any case, I will double check everything in the updated manuscript before re-submitting.
Thanks,
Andrew
Citation: https://doi.org/10.5194/egusphere-2022-559-AC1
-
AC1: 'Reply on CEC1', Andrew Geiss, 06 Oct 2022
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2022-559', Anonymous Referee #1, 15 Jul 2022
This manuscript describes the development of neural networks to replace the aerosol optics in a climate model with a more detailed treatment, which is based on running the same types of codes with more detail to develop a parameterization which better represents more detailed modeling. In general the manuscript is well written and clear. There is a nice discussion of how the neural network is developed. However, my main critique is that the evaluation section (section 5) is pretty minimal. Just some error curves. What does it look like in the full model? You have demonstrated that the new parameterization represents the more detailed code better than the existing parameterization. Does it change the answers in the climate model it is designed for in any meaningful way, and does it cost anything more to run it. Also good to note in the conclusions what lessons you learn from this experience about building neural networks for parameterization replacement. This is probably suitable for publication with minor revisions, but with at least trying it in a climate model perhaps.Detailed comments below.
Page 1, L10: Would be good to have more detail on what ‘outperform’ means specifically in another sentence or two.
Page 1, L15: Disingenuous. The direct effects of aerosols are not the largest uncertainty: only indirect effects on clouds.
Page 2, L29: Example of climate models generating training data (for replacing part of a parameterization: Gettelman et al 2021.
Gettelman, A., D. J. Gagne, C.-C. Chen, M. W. Christensen, Z. J. Lebo, H. Morrison, and G. Gantos. “Machine Learning the Warm Rain Process.” Journal of Advances in Modeling Earth Systems 13, no. 2 (2021): e2020MS002268. https://doi.org/10.1029/2020MS002268.
Page 3, L63: clarify ‘these optical properties (absorption…etc’Page 3, L77: what is the size range here? Please be explicit.
Page 3, L90: The CESM reference should probably be Danabasoglu, et al 2020.
Danabasoglu, G., J.-F. Lamarque, J. Bacmeister, D. A. Bailey, A. K. DuVivier, J. Edwards, L. K. Emmons, et al. “The Community Earth System Model Version 2 (CESM2).” Journal of Advances in Modeling Earth Systems 12, no. 2 (2020): e2019MS001916. https://doi.org/10.1029/2019MS001916.
Page 5, L148: how much error is there in the approximations? Can you quantify it?
Page 9, L251: why would a random network do better? Is there an explanation? Isn’t that a form of overfitting?
Page 10, L289: Please describe these terms a bit. ReLU, ELU, Leaky ReLU and Parametric ReLU
Page 10, L291: what is a transfer function? Above you call them activation functions. Please clarify.
Page 11, Figure 2: is the red dot the ‘optimum’ network?
Page 12, L350: can you make the evaluation a bit more quantitative in spots? It seems a bit ‘weak’ right now, especially compared to the rest of the paper.
Page 13, L360: What does the first column (Table:) of table 1 mean? Should it say something?
Page 13, L364: Can you explain the patters in Figure 5? What do they arise from?
Page 16, L402: Other lessons learned? It would be great to share in the paper.
Citation: https://doi.org/10.5194/egusphere-2022-559-RC1 -
RC2: 'Comment on egusphere-2022-559', Anonymous Referee #2, 10 Aug 2022
Review of the manuscript egusphere-2022-559 by Geiss et al.
The manuscript (MS) presents a new method (based on Artificial Neural Networks (ANNs)) for online calculation of the optical properties of the internally mixed aerosols. Current parametrizations and look-up tables are either computationally unaffordable or fail to capture the large variabilities in aerosol properties. The training dataset is based on the Mie code that directly computes the optical properties of aerosols by considering the variability of the particle sizes, wavelengths, and refractive indices. This approach is similar to previous parameterizations but uses a higher resolution for different parameters. By evaluating ANNs with randomly generated wirings, the optimal network architectures are identified for SW and LW. The results show that randomly generated deep ANNs lead to lower error compared to the conventional multi-layer perceptron. Besides, the ANN-based parameterization outperforms the current parameterization.
The paper is very well structured and written. I really enjoyed the detailed explanations of the assumptions and methods that makes it easy to follow the results. The methods and results are robust with major benefits for the aerosol modeling community. Thus, I recommend publication after addressing the minor points/questions listed below.
- With respect to the I/O, it is not clear why nine variables are chosen. Any pre-processing or input selection procedure? Especially two parameters “surface mode radius over wavelength” and “surface mode radius” are obviously correlated. This should not happen.
- Why do you need one-hot encoding? What additional information does it contain for the modes?
- It can be expected that 2-3 hidden layers can capture the nonlinearities of the system very well and more hidden layers often lead to over-fitting (shown in Fig 2). But it is not clear if the rather minor MAE reduction by random wired networks (outperform is too strong here) is justified by its computational costs/complexity.
- I would like to see ANN vs. Mie similar to figure 5 but for all parameters: extinction coefficient, single scattering albedo and asymmetry parameter in SW and WL.
Citation: https://doi.org/10.5194/egusphere-2022-559-RC2 -
CEC1: 'Comment on egusphere-2022-559', Juan Antonio Añel, 23 Aug 2022
Dear authors,
Many thanks for your effort to comply with the "Code and Data Policy" of the journal.
I would like to point out some issues and ask you to fix them. First, PyMieScatt is currently stored in GitHub, and GitHub is not a suitable repository according to our policy. However, PyMieScatt is released under the MIT license, allowing you to store it in another repository. Therefore, please, add it to the Zenodo already set up for this manuscript or create a different Zenodo repository. A similar thing applies to the miev0 subroutine.
Also, note that several of the hyperlinks in the "Code and Data Availability" section (in the PDF file) are broken. Please, fix it.
Regards,
Juan A. AñelGeosci. Model Dev. Exec. EditorCitation: https://doi.org/10.5194/egusphere-2022-559-CEC1 -
AC1: 'Reply on CEC1', Andrew Geiss, 06 Oct 2022
I have added copies of the Mie scattering codes and links to their sources to the Github and Zenodo repositories.
The hyperlinks all work for me. I did notice that one of them picks up a page number if you copy/paste from the pdf, so maybe this was the issue. In any case, I will double check everything in the updated manuscript before re-submitting.
Thanks,
Andrew
Citation: https://doi.org/10.5194/egusphere-2022-559-AC1
-
AC1: 'Reply on CEC1', Andrew Geiss, 06 Oct 2022
Peer review completion
Journal article(s) based on this preprint
Data sets
Aerosol Optics ML Datasets Andrew Geiss https://doi.org/10.5281/zenodo.6762700
Model code and software
Aerosol Optics ML Code Andrew Geiss https://doi.org/10.5281/zenodo.6767169
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
673 | 162 | 15 | 850 | 7 | 6 |
- HTML: 673
- PDF: 162
- XML: 15
- Total: 850
- BibTeX: 7
- EndNote: 6
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Po-Lun Ma
Balwinder Singh
Joseph C. Hardin
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(2695 KB) - Metadata XML