the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Retrieval of Bulk Hygroscopicity From PurpleAir PM2.5 Sensor Measurements
Abstract. PurpleAir sensors offer a unique opportunity for a large-scale and densely populated array of sensors to study surface air quality. While the PurpleAir sensors are inexpensive and abundant, they require calibration against a validated coincident measurement to ensure the quality of the measurement. Traditionally, this calibration is performed using statistical (empirical) methods. We propose a method to extend the aerosol properties determined by PurpleAir sensors to include estimates of the hygroscopic growth of aerosols using a novel calibration method based on the optimal estimation method (OEM). The hygroscopic growth can be estimated during calibration due to the calibration’s sensitivity to relative humidity, which influences the measured size distribution of the aerosols. Our OEM-based retrieval for calibration includes an estimation of the bulk hygroscopicity. By employing the physically-based calibration with the London’s Ministry of the Environment, Conservation and Parks site calibrated measurements, the average daily Mean Absolute Error (MAE) of the PurpleAir PM2.5 measurements decreased from 5.58 μg/m3 to 1.68 μg/m3, and the average daily bias from 4.75 μg/m3 to -0.23 μg/m3. This improvement is comparable to the improvement seen using conventional statistical methodologies. In addition to calibration, using our OEM retrieved allowed us to estimate seasonal bulk hygroscopicity values ranging from 0.33 to 0.40. These values are consistent with the accepted ranges of bulk hygroscopicity values determined in previous studies using sophisticated air quality measurement instruments.
- Preprint
(568 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 11 Feb 2025)
-
RC1: 'Comment on egusphere-2024-3618', Anonymous Referee #1, 17 Dec 2024
reply
This paper presents an interesting approach for extracting aerosol properties from low-cost air quality sensor data. While the method is novel and the results are promising, I would suggest a few additional analyses and clarifications in the work.
First, and most significantly, it seems that the same data are used for both calibrating the OEM method (i.e., extracting the aerosol properties) and for evaluating the performance of this method. The same holds true for the MLR method against which the OEM is being compared. I don’t think that this is a fair way to do these comparisons. If possible, I would suggest separating the data from each season into distinct sets for calibration and evaluation. This separation could be done randomly or (my preference) using e.g. one month of data to run the OEM (and to calibrate the MLR) and the other months to evaluate the calibration performance. I prefer this latter approach since it is more realistic to how these sensors might be used in practice, i.e., calibrated near a reference for some period of time and then deployed to another site. Overall, I think this will be a more realistic evaluation of the method and its strengths/limitations compared to the “traditional” MLR.
Second, there are some comments that the method is quite sensitive to the relative humidity, and that the PurpleAir relative humidity data are of insufficient quality. Can more be said about this? I would suggest, for example, testing the method with the relative humidity as measured by PurpleAir, but using a higher relative uncertainty for these measurements in the covariance matrices within OEM. Is the OEM method still able to resolve the aerosol properties in such cases? While accurate humidity data from a nearby whether station may be available, it would be useful to also see how well the calibration can perform using only the information from the PurpleAir itself (while understanding the relatively lower quality of these data).
Third, the OEM method requires appropriate prior terms, and especially appropriate uncertainties (parameterized in covariance matrices) for these terms. While there is some discussion of the sensitivity of the method to various parameters, there is little quantitative information here which would help other researchers understand the applicability of these findings to their work. I would suggest describing how the measurement error covariance matrix and the state vector covariance matrix were defined in more detail. Furthermore, I would suggest presenting the results of any sensitivity analyses conducted for these terms, possibly in a supplement or appendix.
I also have several specific comments and suggested corrections, listed here:
Line 20: Note that these data are still publicly available, but not freely available.
Line 25: Please provide a source or reference for the description of the operating principles.
Line 30: “was” should be “were”.
Line 43: The calibration is not only applicable to “their” sensor, but also any other sensor, within the range of conditions under which the calibration was created and validated.
Lines 71-72: The meaning of this sentence is unclear. I think you might intend to say that the physics-based correction model which was just described was also compared with a purely statistical correction model incorporating multiple linear regression terms. Please consider rephrasing if this is indeed what you meant.
Line 87: What is meant by “clearly erroneous readings”? Is this the result of the same quality control procedure just described, or are these data erroneous for different reasons?
Section 2.2.3: Not a necessary change, but I would suggest moving most of this information to the introduction, as it is background detail and motivation for the study. The specific model of reference instrument used can be mentioned in Section 2.2.2 instead.
Section 2.3: Values for the a-priori state vector and the error covariance matrices should be provided; this can be done in a supplement or appendix. Some argument should be provided about how these values were selected as well, so that others can make appropriate decisions when replicating this.
Section 3.1: Are there any quantitative results which can be presented based on this analysis to support the conclusions that particle diameter and temperature are of lesser importance? For example, relative magnitudes of terms in the error covariance matrix?
Line 176: remove “from”.
Section 3.2.1: Are the same results observed for the empirical calibration, or is this over-correction unique to the physical calibration? There is a comment addressing this in lines 229-230; maybe this could be moved up and expanded on. The comment seems to suggest that, based on the data, the calibration dependence on humidity should in fact be linear, as opposed to nonlinear as in the physical calibration. This could potentially mean that humidity is impacting the sensor performance in more complex ways than just the hygroscopicity of the particles.
Line 196: I would suggest using the measured humidity from the PurpleAir as your input and calibrating the uncertainty for this term using your comparison to a nearby weather station. This could give a sense of how robust the measurements are to the data quality of the internal humidity sensor, which you have noted is not high.
Lines 218-219: This argument seems to contradict the previous statements that the method is not sensitive to particle diameter.
Line 221: “2.5” should be subscripted.
Table 2: Why is the R-squared of the raw data not reported? Reporting the biases across the different methods could also be useful.
Line 254: The bias of the linear correction being zero indicates that it is being assessed on the same data on which it is calibrated. I’d suggest defining separate calibration and evaluation datasets for each month; see my general comment on this from above.
Citation: https://doi.org/10.5194/egusphere-2024-3618-RC1
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
117 | 27 | 6 | 150 | 3 | 4 |
- HTML: 117
- PDF: 27
- XML: 6
- Total: 150
- BibTeX: 3
- EndNote: 4
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1