the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Correction of temperature and relative humidity biases in ERA5 by bivariate quantile mapping: Implications for contrail classification
Nicolas Bellouin
Olivier Boucher
Susanne Rohs
Abstract. The skill of the atmospheric reanalysis ERA5 from the European Centre for Medium-Range Weather Forecasts (ECMWF) at simulating upper atmospheric temperature and relative humidity is assessed by using five years of In-service Aircraft for a Global Observing System (IAGOS) observations. IAGOS flight trajectories are used to extract co-located meteorological conditions – temperature, relative humidity, and wind speed – and are compared with the IAGOS measurements. This assessment is particularly relevant to the study of contrail formation, so focuses on the highly frequented air space that spans the Eastern United States over the North Atlantic and to central Europe. The comparison is performed in terms of mean, median, probability density functions, and a confusion matrix. For temperature a good agreement is identified with a maximum bias of −0.4 K at the 200 hPa level. Larger biases are found for relative humidity with up to −5.5 % at the 250 hPa level. To remove the systematic biases, which mostly tend towards too dry and cold, a bias correction method, based on a multivariate quantile technique, is proposed and applied. After the correction the bias in temperature is reduced to below 0.1 K and in relative humidity to below −1.5 %. To estimate the representation of contrail occurrence in ERA5, data points from IAGOS as well as corrected and uncorrected data points from ERA5 of temperature and relative humidity are flagged for contrail formation using the Schmidt-Appleman–criterion. In the IAGOS data set 39.2 and 16.9 % of the samples represent conditions for non-persistent contrails and persistent contrails, respectively. The corresponding numbers for original ERA5 analyses are 40.8 and 17.5 %, respectively, indicating good agreement overall. Applying a proposed quantile mapping correction method and removing the biases in temperature and relative humidity has only a small effect on the distributions but leads to an overestimation of non-persistent contrail occurrence (44.0 %) and underestimation of persistent contrails (16.8 %). Differences in contrail occurrence that remain after the bias correction are traced back to the underling biases in temperature and relative humidity, indicating that ERA5 is either too dry and warm or cold and moist with largest differences at 250 hPa and decreasing with increasing altitude.
- Preprint
(3093 KB) - Metadata XML
- BibTeX
- EndNote
Kevin Wolf et al.
Status: open (until 25 Jan 2024)
-
RC1: 'Comment on egusphere-2023-2356', Anonymous Referee #1, 28 Nov 2023
reply
Review of
Correction of temperature and relative humidity biases in ERA5 by bivariate quantile mapping: Implications for contrail classification
by K. Wolf et al. (egusphere-2023-2356)
The paper evaluates the ability of ERA5 to correctly represent temperature and relative humidity in the upper troposphere using data from IAGOS as reference. Biases in both fields are detected and characterised and a correction method is applied that corrects both T and then RHi using the so-called quantile mapping method. It is found that this method indeed is able to reduce the biases. Unfortunately it turns out that in spite of this improvement the prediction of contrail formation or contrail persistence is not improved. But at least, for other quantities that are relevant for the assessment of aviation climate impacts, e.g. the optical thickness of contrails, improved estimation seems possible (not individually, but in a statistical sense).While the method leads certainly to a significant improvement of the RHi-statistics of ERA5, the investigations and analyses in the paper are somtimes a bit lengthy and, at least to my view, not always necessary. The differences between the original ERA5, and after the application of the QM method of the correction of Teoh et al are often minor and it is not sure whether they are always real or simply caused by statistical noise. But even if they are real in certain cases, it is not always clear to me why the reader needs to know the potential causes of these differences. I have the feeling that the reader can easily get lost in these details and that the straight way from the analysis to the results and conclusions becomes unclear. So, I think, there is potential to make the paper more concise and to clearly convey a message.
This paper needs at some places more elaborateness in its formulations. For instance, the sentence (Page 1) "IAGOS flight trajectories are used to extract co-located meteorological conditions - temperature, relative humidity, and wind speed - and are compared with the IAGOS measurements" is obviously a faulty. In short, it says that IAGOS trajectories are compared with IAGOS measurements, which is evidently nonsense.
Also "representation of contrail occurrence in ERA5" (P 1) is misleading, since contrails are not represented in the ECMWF model and reanalysis.Therefore I recommend a major revision of this paper to make its messages stronger and clearer.
Major issues:
Reading the abstract it seems that the evaluation merely uses bulk statistics, which I deem insufficient. An good indication of this is figure 5, where, apart from MD, all other measures are very insensitive to the applied corrections. The r² hardly change (and these tiny changes may be insignificant on a p=0.05 level). The r², like the other insensitive statistics, are bulk measures, that say nothing to a point-by-point comparison. The authors acknowledge this in the first sentence of section 3.4.
Section 3.4: The selection of the score values is an unlucky choice to my view, since the TN cases dominate in this data set. Thus, if one would ignore everything and always predict "no ISS", one would already get an impressive accuracy and FA rate. The statement "which indicates that the overall performance of ERA5, even in the uncorrected form, is at least similar or even has improved" is thus misleading. It may be that the author made their choice to compare with Tompkins et al. (2007). If the goal of the paper is to provide better data on the occurrence of ISS, this might be ok, although I still think, that scores that downweight the default TN would be better. If the ultimate goal is, however, forecast of contrail persistence, I think, that the authors should better compare with Gierens et al. (2020) and also use the ETS score described there.
Minor issues:
Lines 34-38: I suggest that you state that RF is a global (or at least regional) quantity, averaged over a long time period. On first reading, it was not so clear to me whether you refer to single contrails or contrails in general. Furthermore, aren't the quoted values ERF values (in Lee et al.) rather than RF values?
L 48 and following: This text mixes up different things which is not good. In order to avoid contrails, one needs a precise PREDICTION of where they WILL occur. Knowledge of their occurrence is insufficient, it can only result in a kind of climatology. Schumann's use of the roof-camera is a bad example for predictive purposes, as it shows contrails that already exist and perhaps exist already quite some time. The same comment can be made for satellite or other observations of already present contrails. Then, contrail simulations in a climate model are not intended for contrail avoidance or prediction. CoCiP is the only example here where contrail prediction is the intention, but of course only, if it is fed with actual weather forecasts, not with ERA5.
L 75: It is a bit surprising that a "fourth approach" is now mentioned. I believe that the mixed-up list from above (L 48 ff) is now continued, but that is not sufficiently clear. Again, this is not an approach to prediction and should thus not be mentioned as something that has to do with contrail avoidance. I find also, that this section interrupts the logic of the argumentation. It would be better if the paragraph that introduces various correction attempts would directly by followed with tha paragraph explaining the goal of the present paper.
Fig. 2 and corresponding text: it seems that either the information in the figure is useless or something is wrong. For instance, why are there so many 175hPa data for EU, where I would expect a lot of landing and departures (i.e. low altitudes)? How do I have to read the figures? It seems, the interpratation is: on 300 hPa most flights are over EU etc..... and on 175 hPa again most flights are over EU. Is this, because most flights are over EU anyway? Then the information is useless. Should't it rather be the following: In EU most flights are on lower altitudes because of a lower fraction of cruise, similar in US but perhaps not as strong, and over the NA all flights are in cruise, therefore a predominance of high altitudes (or low pressures). It seems, the data should be organised the other way, i.e three panels "EU", "NA", "US" and then showing the fraction of pressure levels in the threee panels.
Section 2.1: Please indicate whether all data are used along a flight (it seems so) or a subset to avoid autocorrelation which might spoil the statistics. If autocorrelation has not been avoided, a check should be made whether this affects the results or at least good arguments for this should be given.
L 153/154: I suggest to delete this statement since the procedure is better explained below from L 173 on. Anyway, I think the argument is weak (I don't remember Schumann's reasons for this) and refers rather to interpolation of specific humidity than relative humidity.
L 210/211: The statement "The CDFs describe the probability that a certain quantity, for example temperature or relative humidity, exists in the underlying data set" is wrong. I think, we know which quatities are in the data sets, therefore the probability is either zero or one. Please consult a textbook on probability and correct the sentence or leave it out.
L 239/240: You should delete the second part of the sentence. Contrail formation takes place a few tenths of a second after exhaust. It has nothing at all to do with the vortex phase which starts at, perhaps 20 seconds. I also dislike the next sentence. The SAC has been tested on many flights long ago, and it works excellently. There is a figure in the 1999 IPCC report that shows this (I think, the figure has been taken from a paper by Kärcher).
Section 2.4: I miss information on your choice of an overall propulsion efficiency.
L 272: is there really a general decrease of r_ice with p? Why then occurs ISS mostly directly below the tropopause?
L 297/298: it is not clear to me why the T22 correction cannot modify the shape of the pdf. Please explain.
L 370 ff: To my view the discussion of the differences between the QM and T22 is not convincing. For instance, the quoted thresholds from CoCiP are very very low, so they are not really a constraint. To me, the first question is, whether these differences are statistically robust. If a subset of four out of the five years is used, how large is the change of the quoted values? If it is much smaller than the difference between QM and T22, then a real difference seems more plausible. Second, it might be that you compare apples with oranges, i.e. for instance contrail distance with relative frequency of occurrence.
L 408 ff: It is not clear to me how you interpret the SAc. Why does an rcrit occur? As far as I understand it, the SAc gives a threshold temperature which is the maximum T where a contrail could in principle form (at 100% RH). Is rcrit the RHi values along the tangential mixing line between the T threshold (rcrit=100% RH) and the lower temperature below which contrails are always formed (rcrit=0%). This is not completely clear to me.
Figure 7 and corresponding text: This text contains a lot of details and numbers that I find not necessary. What is the take-away message that the reader learns from these details? Again, how robust are these values, if you leave one your out, then another one, etc.?
L 482: "Applying ... more correctly DETECTED...", please reformulate. The QM method does not change any detection.
L 493-506: This is an interesting consideration. But it comes too late. Perhaps a lot of not-so-important numbers could be saved if this consideration would be presented earlier.
Section 3.5 and Figure 9: please state whether you use for the analysis original or QM corrected ERA5 data.
L 536-539: If that happens only on 225 hPa, is it then really likely that small scale variations cause these differences? Don't such variations occur on the neighbouring pressure levels as well? In which sense are T and rice "more homogeneous" on 200 hPa? What does that mean and from where do you derive such a conclusion?
L 539/540: Isn't this a trivial statement? Did you expect something else? Perhaps there are more interesting results in this section that could be more elaborated. For instance, which error usually dominates, is it the error in T or the error in rice. It would also be interesting to see how the differences in absolute humidity contribute to the misclassifications. What is here the conclusion? What needs to be fixed in the ECMWF model more urgently, is it T or the water vapour field? What is the effect of the QM correction in this analysis? I suggest you add a second set of points (squares or empty points), perhaps with arrows, to show how the QM correction shrinks the errors. Please think also on getting rid of the "default" class in the figure. It seems as if a couple of dots are hidden behind the big black dot in the middle.
L 575: "So from a statistical perspective, that the original ERA5 model output is able to adequately represent contrail formation." First, the sentence is gramatically incorrect. Second, please explain what means that contrail formation is adequately represented FROM A STATISTICAL PERSPECTIVE? Assume, the model would only predict ISS and PC over Antarctica and nowhere else, but with the correct frequency of occurrence averaged globally, then one could also state that the formation is represented adequately from a statistical perspective. So my question is, whether your statement makes sense.
L 590/591: Isn't it the other way, that is, the special bias combination leads to the corresponding entry in the contingency table?
Section 4: should not be entitled "Summary and discussion", since it does not contain any discussion.
Miscellaneous
Line 3: upper tropospheric
L 19: underlying
L 30: delete "bonds"
L 31: What is WC?
L 41-42: The word "defined" is too strong. In fact, this was never defined. There is only a vague understanding that contrails that survive the vortex phase are somehow persistent.
L 67: "due to the high temporal and spatial distribution of WV" ???
L 73: What is "contrail estimation"?
L 126: Replave "multiple" by "many". I believe "multiple years" is nonsense.
L 199: The expression "minimizing the C... test" is confusing. How can a test be minimized. I think this sentence can be dropped since it does not provide essential information.
L 241: criterion (singular).
L 244: slow
L 335: r_ice IS
L 433: remove one instance of correction
Table 6: the caption says "TO07" while the table headline says "T07"
L 513: delete comma after ERA5
L 658: require
L 671: showS and remainS
Table A1: eta should be labelled "overall propulsion efficiency". It is not necessary (and not good) to introduce new notions.
Figure A1(e): The y-label contains [*2e6]. Please check.Citation: https://doi.org/10.5194/egusphere-2023-2356-RC1
Kevin Wolf et al.
Kevin Wolf et al.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
85 | 29 | 3 | 117 | 3 | 3 |
- HTML: 85
- PDF: 29
- XML: 3
- Total: 117
- BibTeX: 3
- EndNote: 3
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1