the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Improvement of near-surface wind speed modeling through refined aerodynamic roughness length in built-up regions: implementation and validation in the Weather Research and Forecasting (WRF) model version 4.0
Abstract. Aerodynamic roughness length (z0) is a key parameter determining near-surface wind profiles, significantly influencing wind-related studies and applications. In built-up areas, surface roughness has been substantially altered by land use changes such as urbanization. However, many numerical models assign z0 values based on vegetation cover types, neglecting urban effects. This has resulted in a lack of reliable z0 data in built-up regions. To address this issue, this study proposed a cost-effective method to estimate z0 values at weather stations by adjusting z0 values to minimize the wind speed differences between ERA5 reanalysis data and weather station observation data. Using this approach, z0 values were derived for 1,805 stations in the built-up areas across China. Based on these estimates, a high-resolution monthly gridded z0 dataset was then developed for built-up areas in China using Random Forest Regression algorithm. Simulations with Weather Research and Forecasting (WRF) model show that implementation of the new z0 dataset significantly improves the accuracy of 10-m wind speed over built-up areas, reducing mean wind speed errors by 89.9 % and 88.9 % compared to the default z0 in WRF and a latest gridded z0 dataset, respectively. Independent validations of 100-m wind speed against anemometer tower data further confirm the dataset’s reliability. Therefore, this approach is valuable for wind-dependent studies and applications, such as urban planning, air quality management, and wind energy utilization, by enabling more accurate simulations of wind speed in built-up areas.
- Preprint
(2614 KB) - Metadata XML
-
Supplement
(409 KB) - BibTeX
- EndNote
Status: open (until 18 Jun 2025)
-
CC1: 'Comment on egusphere-2025-1513', Cheng Shen, 04 May 2025
reply
The manuscript proposes a method for refined estimation of aerodynamic roughness length (z0) in urban built-up areas and applies the results to improve near-surface wind speed simulations in the WRF model. The authors utilized ERA5 reanalysis data and China Meteorological Administration (CMA) station observations to optimize z0 values, subsequently employing a Random Forest Regression algorithm to generate a high-resolution gridded z0 dataset. The simulations indicate significant improvements in the accuracy of 10 m and 100 m wind speeds in Chinese urban areas. However, there are significant limitations in the study. My comments below:
1: The critical assumption that ERA5 100-m wind speed data closely aligns with observational data has not been sufficiently validated, especially for areas characterized by complex terrains or significant local environmental variations. The authors need to provide robust evidence supporting the applicability and limitations of this assumption.
2: The observation dataset without homogenization from CMA has shown large bias in https://journals.ametsoc.org/view/journals/clim/36/11/JCLI-D-22-0445.1.xml. This may significantly affect the generalizability and accuracy of z0 estimations across broader geographic contexts. Direct usage of the CMA wind data would absolutely reduce the robustness of the study. Thus, the homogenization on near-surface wind data is necessary.
3: Although a Random Forest Regression model is employed, the sensitivity analysis of different feature variables lacks depth and clarity. The authors are encouraged to conduct comprehensive sensitivity analyses to clearly illustrate the theoretical rationale and practical implications of feature selection on model accuracy.
4: The validation of the model's performance is restricted to simulations for only one month, limiting the assessment of its robustness across different seasons or under varying long-term climatic conditions. The authors should include additional simulations covering multiple seasons or a full year to demonstrate the general applicability and reliability of their approach.
Given these substantial issues, I recommend rejecting this manuscript in its current form.
Citation: https://doi.org/10.5194/egusphere-2025-1513-CC1 -
AC1: 'Reply on CC1', Kun Yang, 05 May 2025
reply
Dear Dr. Shen,
Â
We sincerely appreciate your time and effort for your comments on our manuscript, which may help us improve our work. Nevertheless, we would like to point out that three (Q1, Q2, and Q4) out of your four comments have already been extensively addressed in our original manuscript. Please see our point-by-point responses below.
Â
Q1. The critical assumption that ERA5 100-m wind speed data closely aligns with observational data has not been sufficiently validated, especially for areas characterized by complex terrains or significant local environmental variations. The authors need to provide robust evidence supporting the applicability and limitations of this assumption.
Response: In our manuscript, we have addressed this concern by presenting a two-fold justification for our assumption. First, we evaluated ERA5 100-m winds with measurements from 589 wind towers across China, each providing months to years of data spanning different periods between 2004 and 2022. The results show that ERA5 exhibits a smaller mean bias percentage in eastern regions compared to western areas, supporting its higher reliability in eastern regions. This finding led us to focus primarily on weather stations in eastern China and to derive z0 for 1,805 stations in these built-up regions. Please see this evaluation in Section 3.1. Second, we further validated our assumption through model experiments. The gridded z0 dataset was tested in WRF simulations and independently evaluated against both unseen station data (10-m winds) and additional tower measurements (100-m winds). Both validation tests confirmed significant improvements in wind speed simulations. Please see the model improvement in Section 3.3.
Regarding the applicability of our assumption in complex terrain areas with significant local environmental variations, we have explicitly addressed this limitation in our study (Lines 281-285). Our analysis shows that the gridded z0 produced based on station estimates provides only limited improvement for wind speed simulations in topographically complex regions. This suggests two possible explanations: (1) our fundamental assumption may not hold well in such areas, or more likely, (2) z0 is not the sole determinant of wind speed in these regions. As discussed in our manuscript, wind patterns in complex terrain are governed by multi-scale physical processes including microscale terrain features, turbulent orographic form drag, thermally-driven mountain-valley circulations, and mountain wave dynamics. These processes may make the simple z0-wind speed relationship invalid in flat terrain.
Â
Q2. The observation dataset without homogenization from CMA has shown large bias in https://journals.ametsoc.org/view/journals/clim/36/11/JCLI-D-22-0445.1.xml. This may significantly affect the generalizability and accuracy of z0 estimations across broader geographic contexts. Direct usage of the CMA wind data would absolutely reduce the robustness of the study. Thus, the homogenization on near-surface wind data is necessary.
Response: We appreciate your reference to Zhang and Wang's study regarding wind speed inhomogeneity in CMA stations. Their work identified significant inhomogeneities with breakpoints concentrated in the late 1970s, mid-1990s, and early 2000s, but they do not affect our results. Our study exclusively uses CMA station data from 2015-2019, when the CMA network had already completed its transition to automated observations with standardized instruments. In addition, we conducted quality control procedures before use, including missing value screening, physical range validation, and temporal consistency checks.
Â
Q3. Although a Random Forest Regression model is employed, the sensitivity analysis of different feature variables lacks depth and clarity. The authors are encouraged to conduct comprehensive sensitivity analyses to clearly illustrate the theoretical rationale and practical implications of feature selection on model accuracy.
Response: In our study, we have conducted comprehensive sensitivity tests at every step of the random forest (RF) methodology to ensure the robustness of our results in Section 2.3 and Figure 3. Specifically, for data partitioning, we evaluated the impact of random seed selection when splitting the dataset into training and test subsets (Figure 3a); for parameter tuning, we systematically adjusted multiple key parameters (e.g., max_depth, n_estimators, min_samples_split, min_samples_leaf and so on) and provided detailed sensitivity analysis on the most influential parameter--the number of decision trees (Figure 3b); for model validation: a five-fold cross-validation approach was used to further verify the stability of our model (Figure 3c); for feature importance, we conducted thorough feature importance analysis to identify the dominant predictors (Figure 3e). These rigorous sensitivity tests confirm the reliability of our RF model. Please refer to Section 2.3 for a complete description of the methodology.
Â
Q4. The validation of the model's performance is restricted to simulations for only one month, limiting the assessment of its robustness across different seasons or under varying long-term climatic conditions. The authors should include additional simulations covering multiple seasons or a full year to demonstrate the general applicability and reliability of their approach.
Given these substantial issues, I recommend rejecting this manuscript in its current form.
Response: We appreciate the reviewer's suggestion regarding the simulation period selection. Our choice to focus on April was motivated by both physical and practical considerations. As shown in Figure S3, April consistently exhibits the highest mean wind speeds across our study domain, making simulated wind speeds particularly sensitive to z0 effects and thus ideal for evaluating our parameterization. To ensure robust results while managing computational constraints, we employed a carefully designed re-initialization approach where each 36-hour simulation (initialized daily at 12:00 LT (LT=UTC+8)) included a 12-hour spin-up period followed by 24 hours of analysis. This strategy produced 30 independent realizations, capturing diverse meteorological conditions throughout April. The consistent improvement in wind speed simulations across all cases (Section 3.3) strongly supports the reliability of our findings. While the current results are statistically robust, we may extend simulations to other months to further validate the general applicability of our z0 dataset under varying climatic conditions.
Â
We hope that we have addressed your concerns. We remain open to further feedback and are committed to improving the quality of our work.
Â
Thank you very much!
Â
Sincerely,
Jiamin Wang and Kun Yang
On behalf of all co-authors-
CC2: 'Reply on AC1', Cheng Shen, 06 May 2025
reply
Thank you very much for your detailed and comprehensive responses to my comments. Your clarifications significantly improve my understanding of your methodology and your results. I suggest explicitly mentioning this broader implication in your discussion to further strengthen the reliability and applicability of your findings.
Again, thank you for addressing my concerns thoroughly.
Citation: https://doi.org/10.5194/egusphere-2025-1513-CC2
-
CC2: 'Reply on AC1', Cheng Shen, 06 May 2025
reply
-
AC1: 'Reply on CC1', Kun Yang, 05 May 2025
reply
Model code and software
all codes Jiamin Wang and Kun Yang https://doi.org/10.5281/zenodo.15108200
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
163 | 22 | 8 | 193 | 16 | 4 | 5 |
- HTML: 163
- PDF: 22
- XML: 8
- Total: 193
- Supplement: 16
- BibTeX: 4
- EndNote: 5
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1