Sensitivity of tunable infrared laser spectroscopic measurements of ∆&rsquo;<sup>17</sup>O in CO<sub>2</sub> to analytical conditions

Bajnai, David; Hare, Vincent J.

doi:10.5194/egusphere-2025-3040

Preprints

https://doi.org/10.5194/egusphere-2025-3040

Preprints

14 Aug 2025

| 14 Aug 2025

Sensitivity of tunable infrared laser spectroscopic measurements of ∆’¹⁷O in CO₂ to analytical conditions

David Bajnai and Vincent J. Hare

Abstract. Triple oxygen isotope (∆’¹⁷O) measurements of CO₂ are increasingly used in paleoenvironmental and atmospheric sciences, in part due to the emergence of tunable infrared laser direct absorption spectroscopy (TILDAS) as a cost- and time-effective method for quantifying rare isotopologues in CO₂. This study aims to provide users with a clear understanding of how the stability of analytical conditions — such as optical cell temperature, pressure, and CO₂ concentration — affects measurement quality. Using data from two laboratories equipped with TILDAS instruments (University of Göttingen and University of Cape Town), both operating in high-precision dual-inlet mode, we demonstrate how variations in these parameters influence measurement repeatability and long-term stability. The most significant factor affecting short-term repeatability of ∆’¹⁷O is a mismatch in CO₂ concentration between sample and working standard. The resulting scale-offset effect can amount to several ppm per 1 µmol mol mismatch, depending on instrumental parameters. We show that empirical corrections for such offsets, arising from variable pCO₂ of the analyte across measurements, significantly improve reproducibility. In contrast, the dominant influence on long-term stability is drift in optical cell temperature and pressure. In air monitoring studies, unrecognized instrumental drift due to variations in optical cell temperature, pressure, and CO₂ concentrations can be misinterpreted as genuine seasonal variations in ∆’¹⁷O. We conclude with practical recommendations for achieving the highest possible precision with TILDAS, emphasizing that continuous monitoring and reporting of analytical conditions is essential.

Received: 25 Jun 2025 – Discussion started: 14 Aug 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

David Bajnai and Vincent J. Hare

Status: closed

RC1:
'Comment on egusphere-2025-3040', David Nelson, 01 Sep 2025

This is an excellent paper and the content is appropriate for this journal. It should certainly be published with a few minor revisions as described below.
Comments
The meaning of short term and long term in the abstract (and throughout the paper) are not well defined. The authors should define the time scale that they mean by long term drift. Also it seems to me that pressure variation is also a short term drift. Since the pressure in the optical cell changes each time the cell is filled, it generally varies more rapidly than sample concentration. Hence, it would seem that pressure variation should also be categorized as a short term effect just as the authors do with variation in sample concentration. Both concentration mismatch and pressure mismatch are drivers of instrumental measurement error. However, both are precisely quantified scalars whose effects can be quantified and corrected. The effects of temperature drift are much more complex beginning with the observation that there is not just one temperature. Many relevant temperatures are drifting simultaneously and continuously: cell temperature, laser temperature, the temperatures of various key electronic components, etc...
At line 113, I would suggest rephrasing to something like: “mixtures … are used to create optimal spectral line broadening due to collisional broadening at pressures between 30 and 40 Torr.”
In Section 3.2 the authors seem to give the impression that concentration dependence arises from the inability to measure all isotopologues of CO₂. I don’t think this is correct. The root causes of concentration dependence are subtle and still being studied. But major factors seem to include systematic errors in the non-linear spectral retrievals and non-linearity in the infrared detector response function. Whatever the cause of concentration dependence, the empirical first order correction adopted by the authors (x=a*x’ + b) is appropriate.
At line 255 perhaps it should be explicitly stated that the reference gas matrix must be selected to match the sample gas matrix.
At line 269 the authors state “Long-term drifts in analytical conditions — such as a gradual temperature change of 0.5 K over the course of a year”. In some laboratories, temperature drifts of 0.5 K can occur within a few hours. There is a need to differentiate time drift versus temperature drift. Do the authors have data that isolate the effect of temperature on scale compression? That would be interesting to see. It seems that a key question is: how frequently does a two point calibration need to be performed? Or, perhaps, how much do such calibrations depend on ambient temperature? The answer may depend on the range of ¹⁸O isotopic composition in the samples and standards being measured.

Citation: https://doi.org/10.5194/egusphere-2025-3040-RC1
- AC1: 'Reply on RC1', David Bajnai, 23 Oct 2025
  
  We thank the reviewer for his constructive comments.
  The meaning of short term and long term in the abstract (and throughout the paper) are not well defined. The authors should define the time scale that they mean by long term drift.
  In the revised version, we explicitly state that by “long-term” we refer to a time span of several weeks. We also consistently use the phrase “long-term drift” throughout the manuscript. To avoid confusion, we omitted the phrase “short-term” and instead specify precisely what we mean, i.e., variations from one measurement cycle to the next within a replicate.
  
  Also it seems to me that pressure variation is also a short term drift. Since the pressure in the optical cell changes each time the cell is filled, it generally varies more rapidly than sample concentration. Hence, it would seem that pressure variation should also be categorized as a short term effect just as the authors do with variation in sample concentration.
  You are correct that pressure variations may contribute to cycle-to-cycle variations in the measured mixing ratios. In practice, however, the cell pressure is generally easy to maintain at a constant level, such that it does not measurably contribute to the cycle-to-cycle repeatability of the ∆’¹⁷O values (Fig. 6c-d). In contrast, changes in analyte pressure over the course of several weeks, as shown in Figs 7, 8, and 9, do contribute to the long-term drift.
  
  Both concentration mismatch and pressure mismatch are drivers of instrumental measurement error. However, both are precisely quantified scalars whose effects can be quantified and corrected. The effects of temperature drift are much more complex beginning with the observation that there is not just one temperature. Many relevant temperatures are drifting simultaneously and continuously: cell temperature, laser temperature, the temperatures of various key electronic components, etc...
  Thank you very much for this comment! In response to your suggestion that there is not just one relevant temperature, we have also considered variations in the electronics temperature in the revised manuscript. Interestingly, we found that the variability in electronics temperature has a more significant effect on the drift than the cell temperature. We added a corresponding discussion to the text and included electronics temperature in Figures 5–9.
  
  At line 113, I would suggest rephrasing to something like: “mixtures … are used to create optimal spectral line broadening due to collisional broadening at pressures between 30 and 40 Torr.”
  Thank you for the comment! The revised sentence now reads “In TILDAS, mixtures of CO₂ and a collision gas (e.g., CO₂-free air or pure N2) are used to create optimal spectral line broadening due to collisional broadening at pressures between around 30 Torr and 40 Torr.“
  
  In Section 3.2 the authors seem to give the impression that concentration dependence arises from the inability to measure all isotopologues of CO₂. I don’t think this is correct. The root causes of concentration dependence are subtle and still being studied. But major factors seem to include systematic errors in the non-linear spectral retrievals and non-linearity in the infrared detector response function. Whatever the cause of concentration dependence, the empirical first order correction adopted by the authors (x=a*x’ + b) is appropriate.
  You are correct, we rephrased the corresponding text and relate the root cause of concentration dependence to “systematic errors in the measurement of mole fractions and likely include nonlinearities related to spectral retrievals and the infrared detector response” (se beginning of Chapter 4.1).
  
  At line 255 perhaps it should be explicitly stated that the reference gas matrix must be selected to match the sample gas matrix.
  Very good point, thank you. We not only added this caveat but also expanded the corresponding section with two more paragraphs on additional discussion of gas-matrix–related effects (end of Chapter 4.3).
  
  At line 269 the authors state “Long-term drifts in analytical conditions — such as a gradual temperature change of 0.5 K over the course of a year”. In some laboratories, temperature drifts of 0.5 K can occur within a few hours. There is a need to differentiate time drift versus temperature drift. Do the authors have data that isolate the effect of temperature on scale compression? That would be interesting to see. It seems that a key question is: how frequently does a two point calibration need to be performed? Or, perhaps, how much do such calibrations depend on ambient temperature? The answer may depend on the range of 18O isotopic composition in the samples and standards being measured.
  A simple quantification is given in Chapter 4.2, where we note that in a not thermally stabilized setup a 2 K variation in the electronics temperature resulted in a ca 1500 ppm drift in ∆’¹⁷O. With the introduction of bracketing this is mitigated but the remainder temperature dependence depends on the speed of the speed of the temperature drift relative to the changeover time.
  
  Citation: https://doi.org/10.5194/egusphere-2025-3040-AC1
RC2:
'Comment on egusphere-2025-3040', Mathieu Daëron, 29 Sep 2025

This kind of study, although perhaps non-glamorous, is extremely useful in the early days of a novel analytical technique. Virtually all groups already using optical methods to target cap-deltas in CO2 (or about to do so) will benefit from the observations reported here. Below, I list a number of comments/suggestions intending to improve and clarify the text.
* terms like "repeatability" "reproducibility" and "long-term stability" should be defined explicitly, early in the text. When using notions such as "standard deviation", it must be clear what "one observation" means (e.g., a single injection of an aliquot of CO2, the average of several injections, a one-second-long average optical measurement, etc). This is usually obvious to the authors but not to the readers, who may follow different conventions.

* l. 43: Petersen et al. (2019) does not seem to be directly relevant here.

* l. 48: Technically correct but somewhat misleading: the "precise" measurements reported by Adnew et al. (2019) come at the cost of prohibitively long integration times (20 h integration for 14 ppm internal SE).

* l. 60 "most apply to the technique in general": you may want to specify if "the technique" refers here to TILDAS of infrared spectroscopy in general. I am of course biased, but VCOF-CRDS as implemented by Chaillot et al. (2025) is not sensitive to analyte pressure, for example.

* l. 153-154 "This indicates that the correction is independent of the isotopic composition of the sample analyte": what is the difference in δ13C, δ18O values between IAEA-603-derived CO2 and the working reference gas?

* l. 156-157 "This suggests that the correction slope varies slightly between sessions": in practice, this means that one must determine lab-and-session-specific correction parameter(s) based on repeated analyses of some known CO2. These estimates for correction parameter(s) will have uncertainties, and it would seem useful to investigate/quantify the final contribution to analytical uncertainties from this source of error.

* l. 169 "eliminating the need for such a correction": to the authors' discretion, it might be relevant to note that this was tested and verified experimentally.

* l. 176-182: Regarding the data shown in figure 5, it is not entirely clear if these measurements were corrected by repeated bracketing by a working standard, as reported in section 3.1. The next paragraph is quite confusing IMHO because of this: if the figure 5 data is reference-bracket-corrected, the δ18O and Δ17O variability seems enormous; if not, then what do these data look like after reference-bracket-correction? And if the corresponding reference gas measurements were not performed, how could the authors compute their "mismatch parameter"?

* figure 6: How is "internal Δ'17O error" defined?

* l. 187-188 "The internal error of a single replicate analysis, i.e., the

repeatability of approximately 10 sample cycles within a bracketing measurement": This is good exemple of ambiguity introduced by using undefined terms. Is the "internal error" not the standard error but the standard deviation of ~10 non-independent but separate bracket-corrected measurements? The answers to this question may be obvious for the authors, but not for most readers.

* l. 211-213 "This suggests that a temperature stability of ±1 mK, a pressure stability of ±10 Pa, and a χ’626 stability of ±1 µmol/mol are sufficient during a replicate measurements to prevent any systematic impact on internal repeatability.": I suggest quantifying this statement by adding "[any systematic impact on internal repeatability] beyond +/- X ppm on Δ’17O."

* l. 220-221 "In this context, drift in compression directly affects the accuracy of the final ∆’17O values.": This is surprising. One would expect that drift in compression would be effectively corrected by a two-anchor standardization approach. Is this not the case here?

* l. 243-249: Here the authors make an important point, too often unacknowledged. Kudos for stating it succintly and clearly.

* l. 250-257: I would suggest adding recommendations about what to do in such cases. Is it feasable to inspect the fit residuals to look for a signature of such matrix effects? Would other approaches work better? Would labs using the exact same methods but different collision gases still get consistent results after two-point standardization?

* Regarding the monitoring of long-term drifts in analytical conditions, does this imply that things like temperature sensors should be peridoically recalibrated to avoid drifts in true T (despite logged T being constant)?

Citation: https://doi.org/10.5194/egusphere-2025-3040-RC2
- AC2: 'Reply on RC2', David Bajnai, 23 Oct 2025
  
  This kind of study, although perhaps non-glamorous, is extremely useful in the early days of a novel analytical technique. Virtually all groups already using optical methods to target cap-deltas in CO2 (or about to do so) will benefit from the observations reported here. Below, I list a number of comments/suggestions intending to improve and clarify the text.
  We thank the reviewer for his constructive comments.
  
  * terms like "repeatability" "reproducibility" and "long-term stability" should be defined explicitly, early in the text. When using notions such as "standard deviation", it must be clear what "one observation" means (e.g., a single injection of an aliquot of CO2, the average of several injections, a one-second-long average optical measurement, etc). This is usually obvious to the authors but not to the readers, who may follow different conventions.
  In the revised version, we explicitly state what we mean by repeatability (i.e., whether it refers to 1 SE or 1 SD) and clearly indicate which subset of data each value pertains to.
  
  * l. 43: Petersen et al. (2019) does not seem to be directly relevant here.
  Agreed! We removed the Petersen et al. (2019) reference.
  
  * l. 48: Technically correct but somewhat misleading: the "precise" measurements reported by Adnew et al. (2019) come at the cost of prohibitively long integration times (20 h integration for 14 ppm internal SE).
  We added the caveat: “However, the above methods are generally too labor-intensive to be practical for routine monitoring of atmospheric CO₂.”
  
  * l. 60 "most apply to the technique in general": you may want to specify if "the technique" refers here to TILDAS of infrared spectroscopy in general. I am of course biased, but VCOF-CRDS as implemented by Chaillot et al. (2025) is not sensitive to analyte pressure, for example.
  We changed the word “technique” to “tunable laser absorption spectroscopy” to be clear.
  
  * l. 153-154 "This indicates that the correction is independent of the isotopic composition of the sample analyte": what is the difference in δ¹³C, δ¹⁸O values between IAEA-603-derived CO₂ and the working reference gas?
  In Göttingen the difference would be ∆δ¹⁸O_CO2 = 10‰, whereas in Cape Town 25‰, however in Göttingen the light and heavy CO₂ differ by up to 48% from the δ¹⁸O of the working gas. We do not discuss δ¹³C in the paper. In the revised version we write: “The observation that gases with different isotope compositions (∆δ¹⁸O relative to the working gas ranging from 28‰ to +48‰) yield identical correction slopes m within the same analytical sessions suggests that m is largely independent of the isotopic composition of the sample analyte.”
  
  * l. 156-157 "This suggests that the correction slope varies slightly between sessions": in practice, this means that one must determine lab-and-session-specific correction parameter(s) based on repeated analyses of some known CO2. These estimates for correction parameter(s) will have uncertainties, and it would seem useful to investigate/quantify the final contribution to analytical uncertainties from this source of error.
  Thank you for this comment. In the revised version, we now discuss the additional uncertainty introduced by the correction model (Eq. 9) to the data (see the end of Chapter 4.1 and Appendix A).
  
  * l. 169 "eliminating the need for such a correction": to the authors' discretion, it might be relevant to note that this was tested and verified experimentally.
  Thank you for the remark!
  
  * l. 176-182: Regarding the data shown in figure 5, it is not entirely clear if these measurements were corrected by repeated bracketing by a working standard, as reported in section 3.1. The next paragraph is quite confusing IMHO because of this: if the figure 5 data is reference-bracket-corrected, the δ18O and Δ17O variability seems enormous; if not, then what do these data look like after reference-bracket-correction? And if the corresponding reference gas measurements were not performed, how could the authors compute their "mismatch parameter"?
  In the revised version, we use consistent notation throughout the text and figures to clarify what each dataset refers to. Specifically, we use the subscript “smp/wg” for bracketed measurements and “meas” for raw values. In the case of Fig. 5, these are non-bracketed values. There was no changeover measurement performed during this test.
  
  * figure 6: How is "internal Δ'¹⁷O error" defined?
  In line 511, we write: “and the internal error of individual replicate measurements (68% confidence interval of the approximately 10 sample cycles bracketed by working gas analyses)”. We also repeat this information in the caption of Fig. 6.
  
  * l. 187-188 "The internal error of a single replicate analysis, i.e., the repeatability of approximately 10 sample cycles within a bracketing measurement": This is good example of ambiguity introduced by using undefined terms. Is the "internal error" not the standard error but the standard deviation of ~10 non-independent but separate bracket-corrected measurements? The answers to this question may be obvious for the authors, but not for most readers.
  See our comment above.
  
  * l. 211-213 "This suggests that a temperature stability of ±1 mK, a pressure stability of ±10 Pa, and a χ’626 stability of ±1 µmol/mol are sufficient during a replicate measurements to prevent any systematic impact on internal repeatability.": I suggest quantifying this statement by adding "[any systematic impact on internal repeatability] beyond +/- X ppm on Δ’¹⁷O."
  Done!
  
  * l. 220-221 "In this context, drift in compression directly affects the accuracy of the final ∆’¹⁷O values.": This is surprising. One would expect that drift in compression would be effectively corrected by a two-anchor standardization approach. Is this not the case here?
  Thank you for this question as it prompted us to revisit this issue. On one hand, we rephrased the cited sentence to avoid ambiguity: “In this context, drift in the scale compression within a measurement period directly affects the accuracy of the final ∆’¹⁷O values.” On the other hand, we now note that “Figure 7a illustrates that the magnitude and direction of the drift in the ∆’17O values of the two standards used in the Göttingen laboratory are not identical. The δ18O values of these two standards differ by ca. 80‰. As discussed above and shown in Fig. 5, the mixing ratios of the three CO₂isotopologues respond in an uncorrelated fashion to variations in analytical conditions related to systematic errors in the measurement of mole fractions. It follows that the magnitude of the drift in the measured δ and ∆’¹⁷O values in response to changing analytical conditions, particularly temperature, depends on the isotopic composition of the analyte.”
  
  * l. 243-249: Here the authors make an important point, too often unacknowledged. Kudos for stating it succinctly and clearly.
  Thank you!
  
  * l. 250-257: I would suggest adding recommendations about what to do in such cases. Is it feasible to inspect the fit residuals to look for a signature of such matrix effects? Would other approaches work better? Would labs using the exact same methods but different collision gases still get consistent results after two-point standardization?
  We expanded the paragraph on gas purity effects to provide additional details on the potential pitfalls of not matrix-matching the sample and working gas analytes. We now also state: “Matching the matrices of the working gas to that of the sample analyte as closely as possible helps prevent detrimental effects arising from variable scale-offsets.” We note, however, that this may not be practical for air monitoring studies, where the analyte composition can vary (e.g., due to changing argon concentrations in air), and should therefore be further evaluated. One possible solution would be to extract CO₂ and dilute it again with the same collision gas used for the working gas.
  
  * Regarding the monitoring of long-term drifts in analytical conditions, does this imply that things like temperature sensors should be periodically recalibrated to avoid drifts in true T (despite logged T being constant)?
  Calibrating sensors should be standard practice. In TDLWintel, the control software of Aerodyne Research Inc’s TILDAS instruments, this is easily done under Edit->PTL…->Set P T offset. We added the following to our recommendations: ”Continuous monitoring and reporting of the analytical conditions, along with the periodical recalibration of temperature and pressure sensors, are therefore essential to ensure data integrity over extended timescales.”
  
  Citation: https://doi.org/10.5194/egusphere-2025-3040-AC2

Status: closed

RC1:
'Comment on egusphere-2025-3040', David Nelson, 01 Sep 2025

This is an excellent paper and the content is appropriate for this journal. It should certainly be published with a few minor revisions as described below.
Comments
The meaning of short term and long term in the abstract (and throughout the paper) are not well defined. The authors should define the time scale that they mean by long term drift. Also it seems to me that pressure variation is also a short term drift. Since the pressure in the optical cell changes each time the cell is filled, it generally varies more rapidly than sample concentration. Hence, it would seem that pressure variation should also be categorized as a short term effect just as the authors do with variation in sample concentration. Both concentration mismatch and pressure mismatch are drivers of instrumental measurement error. However, both are precisely quantified scalars whose effects can be quantified and corrected. The effects of temperature drift are much more complex beginning with the observation that there is not just one temperature. Many relevant temperatures are drifting simultaneously and continuously: cell temperature, laser temperature, the temperatures of various key electronic components, etc...
At line 113, I would suggest rephrasing to something like: “mixtures … are used to create optimal spectral line broadening due to collisional broadening at pressures between 30 and 40 Torr.”
In Section 3.2 the authors seem to give the impression that concentration dependence arises from the inability to measure all isotopologues of CO₂. I don’t think this is correct. The root causes of concentration dependence are subtle and still being studied. But major factors seem to include systematic errors in the non-linear spectral retrievals and non-linearity in the infrared detector response function. Whatever the cause of concentration dependence, the empirical first order correction adopted by the authors (x=a*x’ + b) is appropriate.
At line 255 perhaps it should be explicitly stated that the reference gas matrix must be selected to match the sample gas matrix.
At line 269 the authors state “Long-term drifts in analytical conditions — such as a gradual temperature change of 0.5 K over the course of a year”. In some laboratories, temperature drifts of 0.5 K can occur within a few hours. There is a need to differentiate time drift versus temperature drift. Do the authors have data that isolate the effect of temperature on scale compression? That would be interesting to see. It seems that a key question is: how frequently does a two point calibration need to be performed? Or, perhaps, how much do such calibrations depend on ambient temperature? The answer may depend on the range of ¹⁸O isotopic composition in the samples and standards being measured.

Citation: https://doi.org/10.5194/egusphere-2025-3040-RC1
- AC1: 'Reply on RC1', David Bajnai, 23 Oct 2025
  
  We thank the reviewer for his constructive comments.
  The meaning of short term and long term in the abstract (and throughout the paper) are not well defined. The authors should define the time scale that they mean by long term drift.
  In the revised version, we explicitly state that by “long-term” we refer to a time span of several weeks. We also consistently use the phrase “long-term drift” throughout the manuscript. To avoid confusion, we omitted the phrase “short-term” and instead specify precisely what we mean, i.e., variations from one measurement cycle to the next within a replicate.
  
  Also it seems to me that pressure variation is also a short term drift. Since the pressure in the optical cell changes each time the cell is filled, it generally varies more rapidly than sample concentration. Hence, it would seem that pressure variation should also be categorized as a short term effect just as the authors do with variation in sample concentration.
  You are correct that pressure variations may contribute to cycle-to-cycle variations in the measured mixing ratios. In practice, however, the cell pressure is generally easy to maintain at a constant level, such that it does not measurably contribute to the cycle-to-cycle repeatability of the ∆’¹⁷O values (Fig. 6c-d). In contrast, changes in analyte pressure over the course of several weeks, as shown in Figs 7, 8, and 9, do contribute to the long-term drift.
  
  Both concentration mismatch and pressure mismatch are drivers of instrumental measurement error. However, both are precisely quantified scalars whose effects can be quantified and corrected. The effects of temperature drift are much more complex beginning with the observation that there is not just one temperature. Many relevant temperatures are drifting simultaneously and continuously: cell temperature, laser temperature, the temperatures of various key electronic components, etc...
  Thank you very much for this comment! In response to your suggestion that there is not just one relevant temperature, we have also considered variations in the electronics temperature in the revised manuscript. Interestingly, we found that the variability in electronics temperature has a more significant effect on the drift than the cell temperature. We added a corresponding discussion to the text and included electronics temperature in Figures 5–9.
  
  At line 113, I would suggest rephrasing to something like: “mixtures … are used to create optimal spectral line broadening due to collisional broadening at pressures between 30 and 40 Torr.”
  Thank you for the comment! The revised sentence now reads “In TILDAS, mixtures of CO₂ and a collision gas (e.g., CO₂-free air or pure N2) are used to create optimal spectral line broadening due to collisional broadening at pressures between around 30 Torr and 40 Torr.“
  
  In Section 3.2 the authors seem to give the impression that concentration dependence arises from the inability to measure all isotopologues of CO₂. I don’t think this is correct. The root causes of concentration dependence are subtle and still being studied. But major factors seem to include systematic errors in the non-linear spectral retrievals and non-linearity in the infrared detector response function. Whatever the cause of concentration dependence, the empirical first order correction adopted by the authors (x=a*x’ + b) is appropriate.
  You are correct, we rephrased the corresponding text and relate the root cause of concentration dependence to “systematic errors in the measurement of mole fractions and likely include nonlinearities related to spectral retrievals and the infrared detector response” (se beginning of Chapter 4.1).
  
  At line 255 perhaps it should be explicitly stated that the reference gas matrix must be selected to match the sample gas matrix.
  Very good point, thank you. We not only added this caveat but also expanded the corresponding section with two more paragraphs on additional discussion of gas-matrix–related effects (end of Chapter 4.3).
  
  At line 269 the authors state “Long-term drifts in analytical conditions — such as a gradual temperature change of 0.5 K over the course of a year”. In some laboratories, temperature drifts of 0.5 K can occur within a few hours. There is a need to differentiate time drift versus temperature drift. Do the authors have data that isolate the effect of temperature on scale compression? That would be interesting to see. It seems that a key question is: how frequently does a two point calibration need to be performed? Or, perhaps, how much do such calibrations depend on ambient temperature? The answer may depend on the range of 18O isotopic composition in the samples and standards being measured.
  A simple quantification is given in Chapter 4.2, where we note that in a not thermally stabilized setup a 2 K variation in the electronics temperature resulted in a ca 1500 ppm drift in ∆’¹⁷O. With the introduction of bracketing this is mitigated but the remainder temperature dependence depends on the speed of the speed of the temperature drift relative to the changeover time.
  
  Citation: https://doi.org/10.5194/egusphere-2025-3040-AC1
RC2:
'Comment on egusphere-2025-3040', Mathieu Daëron, 29 Sep 2025

This kind of study, although perhaps non-glamorous, is extremely useful in the early days of a novel analytical technique. Virtually all groups already using optical methods to target cap-deltas in CO2 (or about to do so) will benefit from the observations reported here. Below, I list a number of comments/suggestions intending to improve and clarify the text.
* terms like "repeatability" "reproducibility" and "long-term stability" should be defined explicitly, early in the text. When using notions such as "standard deviation", it must be clear what "one observation" means (e.g., a single injection of an aliquot of CO2, the average of several injections, a one-second-long average optical measurement, etc). This is usually obvious to the authors but not to the readers, who may follow different conventions.

* l. 43: Petersen et al. (2019) does not seem to be directly relevant here.

* l. 48: Technically correct but somewhat misleading: the "precise" measurements reported by Adnew et al. (2019) come at the cost of prohibitively long integration times (20 h integration for 14 ppm internal SE).

* l. 60 "most apply to the technique in general": you may want to specify if "the technique" refers here to TILDAS of infrared spectroscopy in general. I am of course biased, but VCOF-CRDS as implemented by Chaillot et al. (2025) is not sensitive to analyte pressure, for example.

* l. 153-154 "This indicates that the correction is independent of the isotopic composition of the sample analyte": what is the difference in δ13C, δ18O values between IAEA-603-derived CO2 and the working reference gas?

* l. 156-157 "This suggests that the correction slope varies slightly between sessions": in practice, this means that one must determine lab-and-session-specific correction parameter(s) based on repeated analyses of some known CO2. These estimates for correction parameter(s) will have uncertainties, and it would seem useful to investigate/quantify the final contribution to analytical uncertainties from this source of error.

* l. 169 "eliminating the need for such a correction": to the authors' discretion, it might be relevant to note that this was tested and verified experimentally.

* l. 176-182: Regarding the data shown in figure 5, it is not entirely clear if these measurements were corrected by repeated bracketing by a working standard, as reported in section 3.1. The next paragraph is quite confusing IMHO because of this: if the figure 5 data is reference-bracket-corrected, the δ18O and Δ17O variability seems enormous; if not, then what do these data look like after reference-bracket-correction? And if the corresponding reference gas measurements were not performed, how could the authors compute their "mismatch parameter"?

* figure 6: How is "internal Δ'17O error" defined?

* l. 187-188 "The internal error of a single replicate analysis, i.e., the

repeatability of approximately 10 sample cycles within a bracketing measurement": This is good exemple of ambiguity introduced by using undefined terms. Is the "internal error" not the standard error but the standard deviation of ~10 non-independent but separate bracket-corrected measurements? The answers to this question may be obvious for the authors, but not for most readers.

* l. 211-213 "This suggests that a temperature stability of ±1 mK, a pressure stability of ±10 Pa, and a χ’626 stability of ±1 µmol/mol are sufficient during a replicate measurements to prevent any systematic impact on internal repeatability.": I suggest quantifying this statement by adding "[any systematic impact on internal repeatability] beyond +/- X ppm on Δ’17O."

* l. 220-221 "In this context, drift in compression directly affects the accuracy of the final ∆’17O values.": This is surprising. One would expect that drift in compression would be effectively corrected by a two-anchor standardization approach. Is this not the case here?

* l. 243-249: Here the authors make an important point, too often unacknowledged. Kudos for stating it succintly and clearly.

* l. 250-257: I would suggest adding recommendations about what to do in such cases. Is it feasable to inspect the fit residuals to look for a signature of such matrix effects? Would other approaches work better? Would labs using the exact same methods but different collision gases still get consistent results after two-point standardization?

* Regarding the monitoring of long-term drifts in analytical conditions, does this imply that things like temperature sensors should be peridoically recalibrated to avoid drifts in true T (despite logged T being constant)?

Citation: https://doi.org/10.5194/egusphere-2025-3040-RC2
- AC2: 'Reply on RC2', David Bajnai, 23 Oct 2025
  
  This kind of study, although perhaps non-glamorous, is extremely useful in the early days of a novel analytical technique. Virtually all groups already using optical methods to target cap-deltas in CO2 (or about to do so) will benefit from the observations reported here. Below, I list a number of comments/suggestions intending to improve and clarify the text.
  We thank the reviewer for his constructive comments.
  
  * terms like "repeatability" "reproducibility" and "long-term stability" should be defined explicitly, early in the text. When using notions such as "standard deviation", it must be clear what "one observation" means (e.g., a single injection of an aliquot of CO2, the average of several injections, a one-second-long average optical measurement, etc). This is usually obvious to the authors but not to the readers, who may follow different conventions.
  In the revised version, we explicitly state what we mean by repeatability (i.e., whether it refers to 1 SE or 1 SD) and clearly indicate which subset of data each value pertains to.
  
  * l. 43: Petersen et al. (2019) does not seem to be directly relevant here.
  Agreed! We removed the Petersen et al. (2019) reference.
  
  * l. 48: Technically correct but somewhat misleading: the "precise" measurements reported by Adnew et al. (2019) come at the cost of prohibitively long integration times (20 h integration for 14 ppm internal SE).
  We added the caveat: “However, the above methods are generally too labor-intensive to be practical for routine monitoring of atmospheric CO₂.”
  
  * l. 60 "most apply to the technique in general": you may want to specify if "the technique" refers here to TILDAS of infrared spectroscopy in general. I am of course biased, but VCOF-CRDS as implemented by Chaillot et al. (2025) is not sensitive to analyte pressure, for example.
  We changed the word “technique” to “tunable laser absorption spectroscopy” to be clear.
  
  * l. 153-154 "This indicates that the correction is independent of the isotopic composition of the sample analyte": what is the difference in δ¹³C, δ¹⁸O values between IAEA-603-derived CO₂ and the working reference gas?
  In Göttingen the difference would be ∆δ¹⁸O_CO2 = 10‰, whereas in Cape Town 25‰, however in Göttingen the light and heavy CO₂ differ by up to 48% from the δ¹⁸O of the working gas. We do not discuss δ¹³C in the paper. In the revised version we write: “The observation that gases with different isotope compositions (∆δ¹⁸O relative to the working gas ranging from 28‰ to +48‰) yield identical correction slopes m within the same analytical sessions suggests that m is largely independent of the isotopic composition of the sample analyte.”
  
  * l. 156-157 "This suggests that the correction slope varies slightly between sessions": in practice, this means that one must determine lab-and-session-specific correction parameter(s) based on repeated analyses of some known CO2. These estimates for correction parameter(s) will have uncertainties, and it would seem useful to investigate/quantify the final contribution to analytical uncertainties from this source of error.
  Thank you for this comment. In the revised version, we now discuss the additional uncertainty introduced by the correction model (Eq. 9) to the data (see the end of Chapter 4.1 and Appendix A).
  
  * l. 169 "eliminating the need for such a correction": to the authors' discretion, it might be relevant to note that this was tested and verified experimentally.
  Thank you for the remark!
  
  * l. 176-182: Regarding the data shown in figure 5, it is not entirely clear if these measurements were corrected by repeated bracketing by a working standard, as reported in section 3.1. The next paragraph is quite confusing IMHO because of this: if the figure 5 data is reference-bracket-corrected, the δ18O and Δ17O variability seems enormous; if not, then what do these data look like after reference-bracket-correction? And if the corresponding reference gas measurements were not performed, how could the authors compute their "mismatch parameter"?
  In the revised version, we use consistent notation throughout the text and figures to clarify what each dataset refers to. Specifically, we use the subscript “smp/wg” for bracketed measurements and “meas” for raw values. In the case of Fig. 5, these are non-bracketed values. There was no changeover measurement performed during this test.
  
  * figure 6: How is "internal Δ'¹⁷O error" defined?
  In line 511, we write: “and the internal error of individual replicate measurements (68% confidence interval of the approximately 10 sample cycles bracketed by working gas analyses)”. We also repeat this information in the caption of Fig. 6.
  
  * l. 187-188 "The internal error of a single replicate analysis, i.e., the repeatability of approximately 10 sample cycles within a bracketing measurement": This is good example of ambiguity introduced by using undefined terms. Is the "internal error" not the standard error but the standard deviation of ~10 non-independent but separate bracket-corrected measurements? The answers to this question may be obvious for the authors, but not for most readers.
  See our comment above.
  
  * l. 211-213 "This suggests that a temperature stability of ±1 mK, a pressure stability of ±10 Pa, and a χ’626 stability of ±1 µmol/mol are sufficient during a replicate measurements to prevent any systematic impact on internal repeatability.": I suggest quantifying this statement by adding "[any systematic impact on internal repeatability] beyond +/- X ppm on Δ’¹⁷O."
  Done!
  
  * l. 220-221 "In this context, drift in compression directly affects the accuracy of the final ∆’¹⁷O values.": This is surprising. One would expect that drift in compression would be effectively corrected by a two-anchor standardization approach. Is this not the case here?
  Thank you for this question as it prompted us to revisit this issue. On one hand, we rephrased the cited sentence to avoid ambiguity: “In this context, drift in the scale compression within a measurement period directly affects the accuracy of the final ∆’¹⁷O values.” On the other hand, we now note that “Figure 7a illustrates that the magnitude and direction of the drift in the ∆’17O values of the two standards used in the Göttingen laboratory are not identical. The δ18O values of these two standards differ by ca. 80‰. As discussed above and shown in Fig. 5, the mixing ratios of the three CO₂isotopologues respond in an uncorrelated fashion to variations in analytical conditions related to systematic errors in the measurement of mole fractions. It follows that the magnitude of the drift in the measured δ and ∆’¹⁷O values in response to changing analytical conditions, particularly temperature, depends on the isotopic composition of the analyte.”
  
  * l. 243-249: Here the authors make an important point, too often unacknowledged. Kudos for stating it succinctly and clearly.
  Thank you!
  
  * l. 250-257: I would suggest adding recommendations about what to do in such cases. Is it feasible to inspect the fit residuals to look for a signature of such matrix effects? Would other approaches work better? Would labs using the exact same methods but different collision gases still get consistent results after two-point standardization?
  We expanded the paragraph on gas purity effects to provide additional details on the potential pitfalls of not matrix-matching the sample and working gas analytes. We now also state: “Matching the matrices of the working gas to that of the sample analyte as closely as possible helps prevent detrimental effects arising from variable scale-offsets.” We note, however, that this may not be practical for air monitoring studies, where the analyte composition can vary (e.g., due to changing argon concentrations in air), and should therefore be further evaluated. One possible solution would be to extract CO₂ and dilute it again with the same collision gas used for the working gas.
  
  * Regarding the monitoring of long-term drifts in analytical conditions, does this imply that things like temperature sensors should be periodically recalibrated to avoid drifts in true T (despite logged T being constant)?
  Calibrating sensors should be standard practice. In TDLWintel, the control software of Aerodyne Research Inc’s TILDAS instruments, this is easily done under Edit->PTL…->Set P T offset. We added the following to our recommendations: ”Continuous monitoring and reporting of the analytical conditions, along with the periodical recalibration of temperature and pressure sensors, are therefore essential to ensure data integrity over extended timescales.”
  
  Citation: https://doi.org/10.5194/egusphere-2025-3040-AC2

David Bajnai and Vincent J. Hare

Viewed

Total article views: 1,607 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
1,502	81	24	1,607	32	39

HTML: 1,502
PDF: 81
XML: 24
Total: 1,607
BibTeX: 32
EndNote: 39

Views and downloads (calculated since 14 Aug 2025)

Month	HTML	PDF	XML	Total
Aug 2025	515	14	1	530
Sep 2025	895	18	8	921
Oct 2025	55	31	9	95
Nov 2025	37	18	6	61

Cumulative views and downloads (calculated since 14 Aug 2025)

Month	HTML	PDF	XML	Total
Aug 2025	515	14	1	530
Sep 2025	895	18	8	921
Oct 2025	55	31	9	95
Nov 2025	37	18	6	61

Viewed (geographical distribution)

Total article views: 1,593 (including HTML, PDF, and XML) Thereof 1,593 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 30 Nov 2025

Short summary

We studied how to improve the accuracy of a laser-based method for measuring rare forms of oxygen in carbon dioxide, which helps scientists understand Earth’s climate and atmosphere. By testing instruments in two labs and adjusting temperature, pressure, and gas levels, we found ways to make results more reliable. Our findings provide practical tips for researchers and make this cost-effective method more accessible for environmental studies.


Total:	0
HTML:	0
PDF:	0
XML:	0

Sensitivity of tunable infrared laser spectroscopic measurements of ∆’17O in CO2 to analytical conditions

Viewed

Viewed (geographical distribution)

Sensitivity of tunable infrared laser spectroscopic measurements of ∆’¹⁷O in CO₂ to analytical conditions