Performance and longevity of compact all-in-one weather stations &ndash; the good, the bad and the ugly

Brown, Christopher Walker; Schadee, Marin; de Haij, Marijn; Brandsma, Theo

doi:10.5194/egusphere-2025-5194

Preprints

https://doi.org/10.5194/egusphere-2025-5194

Preprints

27 Oct 2025

| 27 Oct 2025

Performance and longevity of compact all-in-one weather stations – the good, the bad and the ugly

Christopher Walker Brown, Marin Schadee, Marijn de Haij, and Theo Brandsma

Abstract. We provide a long-term evaluation of compact, all-in-one automatic weather stations (AiOWS) compared to professional-grade Automatic Weather Stations (AWS). We examine the performance, longevity, and degradation of six AiO WS models over several years of non serviced use. The objective was to determine how closely these low-cost stations meet World Meteorological Organization (WMO) performance standards for temperature, humidity, wind, and precipitation, and to identify their weaknesses and maintenance needs.

Previous studies show the potential value of AiOWS when data are properly quality-controlled, yet long-term reliability remains uncertain. To address this we deployed six AiO WS units— Davis VVue, Davis VP2, METER ATMOS41, Lufft WS601, and Vaisala WXT520, alongside two collocated reference AWS meeting WMO standards. Before field installation, each unit was tested in (KNMI’s) calibration lab for baseline validation. The stations were then operated in open terrain for multiple years without any servicing, simulating typical end-user neglect.

Initially, all AiO WS met manufacturer specifications. After long-term exposure, however, sensors displayed varied durability. The Vaisala unit operated continuously for over 13 years, while others failed between four and seven years due to corrosion, component wear, and sensor drift. The METER and Davis VVue remained mostly functional but with degraded performance, whereas both Davis VP2 rain gauges failed early due to reed switch damage.

Temperature measurements were the most robust. In climate chamber tests, new and aged sensors maintained accuracy within ±0.3 °C across -15 °C to 30 °C, drifting slightly (underestimating by 0.5–0.7 °C) above 30 °C. Field data confirmed these results, though strong solar radiation caused overestimations during summer. The Vaisala and Davis VVue units remained within WMO Class B limits after a decade. Relative humidity showed consistent deterioration. Most sensors overestimated low humidity and underestimated above 90 %, particularly the METER unit, whose bias grew markedly after five years. Wind speed accuracy degraded due to mechanical wear. Cup anemometers underreported low winds and failed completely in some cases. Sonic sensors (Vaisala, METER) produced erratic readings after several years, highlighting their fragility outdoors. Precipitation performance was weakest across all models. Tipping bucket designs suffered from clogging, internal corrosion, and undercatch errors, while haptic or drip-based sensors became inaccurate as components aged or fouled.

We concluded that compact AiO WS can provide scientifically useful temperature data if properly managed but fall short for humidity, wind, and particularly precipitation unless regularly serviced. Long-term unattended operation severely limits reliability, yet moderate maintenance can potentially restore performance close to WMO Class A/B standards, extending their utility for dense observation networks.

Received: 20 Oct 2025 – Discussion started: 27 Oct 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Christopher Walker Brown, Marin Schadee, Marijn de Haij, and Theo Brandsma

Status: final response (author comments only)

RC1:
'Comment on egusphere-2025-5194', Anonymous Referee #1, 19 Dec 2025
The authors have performed a very interesting, long-term study on the quality of "all-in-one" weather stations, or Personal Weather Stations, looking at the decay of quality over time when these stations are operated without maintenance. It's a unique study, and thereby worthy of publishing in my opinion since it addresses a big unknown in the use of non-WMO data (quality drift over time). However, the presentation of the results, the structure of the text and the meager application discussions leave the overall quality of the manuscript something to be desired. I recognize that additional experiments for such a long-term study are fully impossible, but there are some scientific improvements to be made nevertheless before the article is fully suitable for publication. hence my recommendation for Major Revisions, with the sidenote that this mainly contains the framing, structure and presentation of the work and not so much its experimental core.
A major point I felt while reading through the manuscript is that, while it's a really interesting work, the presentation feels shallow. The authors do not go much beyond presenting some statistics on performance, and the only comparisons made are to the standard WMO table of station siting (as well as their reference data at the weather field). I would have liked to see some comparisons to similar studies, or studies using PWS data: for instance in section 3.2.1, a lot of the somewhat cheaper brands of PWS (e.g. Netatmo) suffer from moisture retention at high RH values - hence what you see in those stations is that moisture gets inside the sensor and oversaturates it (RH reported at nearly 100) for a long time. This problem of moisture pooling inside the sensor is also an issue for e.g. the Netatmo sonic anemometer which understandably deteriorates its usage - it would have been interesting to draw those comparisons and look a little further than just the findings in the field: what do they mean? Similarly, the authors could dive a little deeper into the data: I get that for wind observations a direct comparison to a different height is tricky, but it would benefit the manuscript if that was at least given a go. Now the wind results, as well as the rain results because of the equipment failure, feel fairly underwhelming and inconclusive. On the application side of things, I feel like the focus is too much on direct comparison to WMO guidelines and equipment, which will always be an unfair comparison. Rather, the power and interesting use cases of PWS data is in those locations where WMO siting will always say it's imperfect: heterogeneous terrain and especially cities. So rather than focus on the poor performance, I would like to see the authors' thoughts on when these data CAN help: where and how should we as scientists, or citizen scientists, deploy these stations, in order to have them both running well for a longer time, and provide good data? There are quite a few other studies using PWS data (the authors already mention a few) that are pretty positive on their usage, but a thorough discussion of the link between this work and those studies is now missing from this paper. Creating that connection, between this well-studied field experiment and those opportunistic sensing studies would strengthen the field as a whole.
The figures don't really help with that feeling of shallow presentation: figures 4 and 5 especially are giant tables, without proper captions, that I cannot read very well in the printed version of the manuscript. The presentation idea is very nice, showing the bias in time, but providing a giant table without context is fairly overwhelming. Also the RH colorbar is counterintuitive: positive biases would mean higher RH for the observations, which tends to be colored blue (minor detail). Figure 2 is of quite low resolution. Figure 6 is quite nice as an example of the level of filth that can accumulate in rain gauges, though a small explanation of the scale bar on the bottom would be nice (I imagine it's a ruler in cm?). table 1 can be referred to a bit more often when WMO siting classes are referenced in the text, e.g. in the conclusions. In that table, an overview of the measurement equipment beyond their accuracy would be helpful: e.g. the type of wind sensor, do they have a radiation shield, single/double tipping bucket etc etc, for easier comparison between the brands of PWS.
Some smaller comments, issues and points below:
In the literature, the stations that the authors call "All-in-one weather stations" are usually called Personal Weather Stations (PWS) or sometimes Citizen Weather Stations (CWS). Given that the authors actually use the term PWS in figure 2, I would recommend that they use this term throughout the manuscript instead of AiOWS, to keep it consistent with earlier work on these stations.

L.109: Automatic Weather Stations doesn't need to be written out fully here

A question I had was: what is the typical longevity of a PWS? In other words: is the level of deterioration realistic? Especially the station that ran for 14 years without any maintenance, seems quite unlikely that the owner wouldn't look at it at all. Would be interesting to get an idea of that to place the results in the application context.

L227: "both used failed almost immediately in the field": how come?

Do you have any idea of the initial quality of the observations? Of course there's the manufacturers list and the in-field data, but did you also perform the lab tests at the start of the experiment? Understandable if you didn't, but just curious!

L301: is it possible to explicitly test the radiation bias (following the study by Simon Bell, 2015)?

Section 4.2: rapid degradation is mentioned: any idea why specifically this rapid degradation occurs? Is there a specific kind of equipment damage that causes rapid versus gradual deterioration?

Ch 4 in general reads a bit messily, with a lot of short paragraphs, sometimes starting with just "Precipitation." (L. 328). I would recommend restructuring so it all flows a bit better

In figure 1: it would also be nice to clearly showcase all the instruments together, the rightmost figure shows most (?) of them but it's not clear which is which.
Citation: https://doi.org/10.5194/egusphere-2025-5194-RC1
RC2:
'Comment on egusphere-2025-5194', Anonymous Referee #2, 19 Dec 2025
Review of “Performance and longevity of compact all-in-one weather stations – the good, the bad and the ugly”
The authors address the performance and durability of six all-in-one weather stations (AiOWSs) over multiple years, by both evaluating it against a reference station in the lab and in the field. The study identifies the weaknesses and maintenance needs of these AiOWSs. Given that over the past decades there is a growing interest in using AiOWSs as an additional data source, this study is a relevant addition to the literature and is suited for AMT.
Major comments:
Overall, the level of English is good, however, the manuscript would benefit from improvements in readability and the structure. There are several typos or awkwardly formulated sentences, I recommend careful proofreading and revision to improve the clarity and flow. I addressed some of these issues as minor comments, but please go through the whole manuscript.

Please include in the data section more information about the different AiOWSs and AWSs used in this study. Consider the put the exact locations of all the different AiOWSs and AWSs in the left map in Figure 1 and showing the different AiOWSs. Also, include some additional information/specifications about all these weather stations, e.g. are they solar-powered, batteries, what is the temporal resolution, how do they measure rainfall, using tipping bucket system, drip count system or? Maybe some of this information can be presented in a table.

For clarity I suggest revising 3.1, making it clearer which instruments were functional at the end and which ones not, so when Figure 2 is presented, it is more obvious why not all instruments are shown. For example, first write:

“Prior to deployment, each AiO WS underwent calibration laboratory testing to quantify baseline accuracy, operational range, and to verify consistency with manufacturer specifications. All AiO WS tested initially met manufacturer specifications (Table 1). After being deployed in the field for a minimum of 5 years, the AiO WS were removed and reassessed in the calibration laboratory.”

Next, discuss which instruments worked for how long, e.g.:

“Both Davis VVue units (TX7 and TX8), and the METER ATMS41 remained functional at the end of deployment (TX7 and TX8 10 years, METER for 5 years). The Vaisala remained active for more than 13 years in the field. Eventually failing in July 2024. Both Davis VP2 units (TX1 after 7 years and 4 months, and TX2 after 6 years and 8 months) and the Lufft WS601 ceased transmitting data.”

Then discuss individual sensors, which (temporarily) failed. Please make it clear which sensor worked, and which did not etc. E.g. in L166-168 it is not clear which part is referred to in “partially functional again” working, same for L168 “partially recovered”.

Captions of figures and tables are sometimes missing or do not provide enough information. E.g. the caption of Table 2 is missing. In Figures 4 and 5 it is not clear how to bias is calculated.

The bias and MAE give insight into the systematic and average error. For a more complete analysis I would recommend also using the Pearson correlation coefficient. Did you check what the correlation between the different sensors is from TX1 and TX2 is and between TX7 and TX8? This also gives information about the accuracy of these stations.

Several studies demonstrate the potential of AiOWSs (see for example the references listed below), whereas the present study finds for example that precipitation measurements are unreliable. Can you please discuss these different findings?

Minor comments:
The literature provided in the introduction is limited. Please consider adding some additional literature in the introduction (L73-88), you may include following literature if you find them fitting:
https://doi.org/10.5194/nhess-20-299-2020 investigates how these AiOWSs can contribute observing deep-convection processes.

https://doi.org/10.1175/JAMC-D-11-0135.1 uses observations from AiOWSs to quantify urban heat islands.

https://doi.org/10.2166/nh.2023.136 how rainfall observations can fill the gap from official monitoring networks.

https://doi.org/10.5194/hess-29-4585-2025 evaluates (heavy) rainfall observations from AiOWSs against reference gauges.

https://doi.org/10.1002/qj.3811 investigates the potential of wind data from AiOWSs.

1088/1748-9326/ac5c0f investigates the potential of citizen weather stations in capturing complex dynamical and physical processes in urban environments.

https://doi.org/10.5194/nhess-24-907-2024 evaluates what the benefit of assimilating pressure data from AiOWSs is in numerical weather predictions.

L33: AiOWS is singular, AiOWSs is plural. Please adjust throughout the text.

L34: AWS is singular, AWSs is plural. Please adjust throughout the text.

L43: Add “Royal Netherlands Meteorological Institute”

L73: Remove “indeed”

L75: not only nowcasting also for numerical weather predictions: https://doi.org/10.5194/nhess-24-907-2024

L100: Remove: “systems”.

L104: Do you have a source that says they are poorly maintained? Otherwise, it is better to state that these are likely not maintained according to WMO guidelines.

L188: Which ones are new?

L202-203 & L204: Try to avoid one or two sentenced paragraphs.

L224-L251: Try to avoid words like “excellent”, this is subjective.

L241: This sentence is not clear: “If the drip is too small or large a volume, ....”

L248: What are low temperatures, please quantify.

L270: Over all the years of deployment, or which period?

L280-281: please quantify

L285: Consider changing it into: “Degradation for temperature, wind and rain sensors is seemingly governed less by….”

L288-291: Suggest to revise and not use two times “whilst” in one sentence.

L303-307: Please improve clarity

L322-L323: Please improve the clarity, e.g.: “All anemometers were underestimating windspeed compared to the reference AWS at 10m height. This underestimation was primarily due to the different height at which the AiOWSs were installed, namely 1.5m, and thus influenced by surface roughness at the ground.”

L324: Not clear what is meant by “our binned AiOWS”.

Figure 4 and 5: How is this bias determined? Is this averaged over each 5 min?

L328: Remove “Precipitation.”

L339: Remove “Temperature.”

L339: Not clear what is meant by ‘new units’

L341: platforms --> do you mean instruments?

L465: Now you use PWS instead of AiOWS, please be consistent.
Citation: https://doi.org/10.5194/egusphere-2025-5194-RC2

Christopher Walker Brown, Marin Schadee, Marijn de Haij, and Theo Brandsma

Viewed

Total article views: 296 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
169	110	17	296	13	14

HTML: 169
PDF: 110
XML: 17
Total: 296
BibTeX: 13
EndNote: 14

Views and downloads (calculated since 27 Oct 2025)

Month	HTML	PDF	XML	Total
Oct 2025	73	19	5	97
Nov 2025	62	40	10	112
Dec 2025	34	51	2	87

Cumulative views and downloads (calculated since 27 Oct 2025)

Month	HTML	PDF	XML	Total
Oct 2025	73	19	5	97
Nov 2025	62	40	10	112
Dec 2025	34	51	2	87

Viewed (geographical distribution)

Total article views: 285 (including HTML, PDF, and XML) Thereof 285 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 19 Dec 2025

Short summary

Compact all-in-one weather stations (AiOWS) were evaluated long-term against WMO-grade AWS for performance, durability, and degradation. Six AiOWS models were tested in lab and test field without servicing. Temperature accuracy remained strong, while humidity, wind, and precipitation data degraded due to sensor drift and wear. Failure took between 0.5 and 13 years. With maintenance, AiOWS can approach WMO Class A/B reliability.


Total:	0
HTML:	0
PDF:	0
XML:	0