Improvement of near-surface wind speed modeling through refined aerodynamic roughness length in built-up regions: implementation and validation in the Weather Research and Forecasting (WRF) model version 4.0

Wang, Jiamin; Yang, Kun; Liu, Jiarui; Zhou, Xu; Ma, Xiaogang; Tang, Wenjun; Yuan, Ling; Ren, Zuhuan

doi:10.5194/egusphere-2025-1513

Preprints

https://doi.org/10.5194/egusphere-2025-1513

Preprints

23 Apr 2025

| 23 Apr 2025

Improvement of near-surface wind speed modeling through refined aerodynamic roughness length in built-up regions: implementation and validation in the Weather Research and Forecasting (WRF) model version 4.0

Jiamin Wang, Kun Yang, Jiarui Liu, Xu Zhou, Xiaogang Ma, Wenjun Tang, Ling Yuan, and Zuhuan Ren

Abstract. Aerodynamic roughness length (z₀) is a key parameter determining near-surface wind profiles, significantly influencing wind-related studies and applications. In built-up areas, surface roughness has been substantially altered by land use changes such as urbanization. However, many numerical models assign z₀ values based on vegetation cover types, neglecting urban effects. This has resulted in a lack of reliable z₀ data in built-up regions. To address this issue, this study proposed a cost-effective method to estimate z₀ values at weather stations by adjusting z₀ values to minimize the wind speed differences between ERA5 reanalysis data and weather station observation data. Using this approach, z₀ values were derived for 1,805 stations in the built-up areas across China. Based on these estimates, a high-resolution monthly gridded z₀ dataset was then developed for built-up areas in China using Random Forest Regression algorithm. Simulations with Weather Research and Forecasting (WRF) model show that implementation of the new z₀ dataset significantly improves the accuracy of 10-m wind speed over built-up areas, reducing mean wind speed errors by 89.9 % and 88.9 % compared to the default z₀ in WRF and a latest gridded z₀ dataset, respectively. Independent validations of 100-m wind speed against anemometer tower data further confirm the dataset’s reliability. Therefore, this approach is valuable for wind-dependent studies and applications, such as urban planning, air quality management, and wind energy utilization, by enabling more accurate simulations of wind speed in built-up areas.

Received: 30 Mar 2025 – Discussion started: 23 Apr 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 2614 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (2614 KB)

Supplement (409 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

16 Dec 2025

Improvement of near-surface wind speed modeling through refined aerodynamic roughness length in high-roughness surface regions: implementation and validation in the Weather Research and Forecasting (WRF) model version 4.0

Jiamin Wang, Kun Yang, Jiarui Liu, Xu Zhou, Xiaogang Ma, Wenjun Tang, Ling Yuan, and Zuhuan Ren

Geosci. Model Dev., 18, 10077–10094, https://doi.org/10.5194/gmd-18-10077-2025,https://doi.org/10.5194/gmd-18-10077-2025, 2025

Short summary

Jiamin Wang, Kun Yang, Jiarui Liu, Xu Zhou, Xiaogang Ma, Wenjun Tang, Ling Yuan, and Zuhuan Ren

Interactive discussion

Status: closed

CC1:
'Comment on egusphere-2025-1513', Cheng Shen, 04 May 2025

The manuscript proposes a method for refined estimation of aerodynamic roughness length (z0) in urban built-up areas and applies the results to improve near-surface wind speed simulations in the WRF model. The authors utilized ERA5 reanalysis data and China Meteorological Administration (CMA) station observations to optimize z0 values, subsequently employing a Random Forest Regression algorithm to generate a high-resolution gridded z0 dataset. The simulations indicate significant improvements in the accuracy of 10 m and 100 m wind speeds in Chinese urban areas. However, there are significant limitations in the study. My comments below:
1: The critical assumption that ERA5 100-m wind speed data closely aligns with observational data has not been sufficiently validated, especially for areas characterized by complex terrains or significant local environmental variations. The authors need to provide robust evidence supporting the applicability and limitations of this assumption.
2: The observation dataset without homogenization from CMA has shown large bias in https://journals.ametsoc.org/view/journals/clim/36/11/JCLI-D-22-0445.1.xml. This may significantly affect the generalizability and accuracy of z0 estimations across broader geographic contexts. Direct usage of the CMA wind data would absolutely reduce the robustness of the study. Thus, the homogenization on near-surface wind data is necessary.
3: Although a Random Forest Regression model is employed, the sensitivity analysis of different feature variables lacks depth and clarity. The authors are encouraged to conduct comprehensive sensitivity analyses to clearly illustrate the theoretical rationale and practical implications of feature selection on model accuracy.
4: The validation of the model's performance is restricted to simulations for only one month, limiting the assessment of its robustness across different seasons or under varying long-term climatic conditions. The authors should include additional simulations covering multiple seasons or a full year to demonstrate the general applicability and reliability of their approach.
Given these substantial issues, I recommend rejecting this manuscript in its current form.

Citation: https://doi.org/10.5194/egusphere-2025-1513-CC1
- AC1:
  'Reply on CC1', Kun Yang, 05 May 2025
  
  Dear Dr. Shen,
  
  We sincerely appreciate your time and effort for your comments on our manuscript, which may help us improve our work. Nevertheless, we would like to point out that three (Q1, Q2, and Q4) out of your four comments have already been extensively addressed in our original manuscript. Please see our point-by-point responses below.
  
  Q1. The critical assumption that ERA5 100-m wind speed data closely aligns with observational data has not been sufficiently validated, especially for areas characterized by complex terrains or significant local environmental variations. The authors need to provide robust evidence supporting the applicability and limitations of this assumption.
  Response: In our manuscript, we have addressed this concern by presenting a two-fold justification for our assumption. First, we evaluated ERA5 100-m winds with measurements from 589 wind towers across China, each providing months to years of data spanning different periods between 2004 and 2022. The results show that ERA5 exhibits a smaller mean bias percentage in eastern regions compared to western areas, supporting its higher reliability in eastern regions. This finding led us to focus primarily on weather stations in eastern China and to derive z₀ for 1,805 stations in these built-up regions. Please see this evaluation in Section 3.1. Second, we further validated our assumption through model experiments. The gridded z₀ dataset was tested in WRF simulations and independently evaluated against both unseen station data (10-m winds) and additional tower measurements (100-m winds). Both validation tests confirmed significant improvements in wind speed simulations. Please see the model improvement in Section 3.3.
  Regarding the applicability of our assumption in complex terrain areas with significant local environmental variations, we have explicitly addressed this limitation in our study (Lines 281-285). Our analysis shows that the gridded z₀ produced based on station estimates provides only limited improvement for wind speed simulations in topographically complex regions. This suggests two possible explanations: (1) our fundamental assumption may not hold well in such areas, or more likely, (2) z₀ is not the sole determinant of wind speed in these regions. As discussed in our manuscript, wind patterns in complex terrain are governed by multi-scale physical processes including microscale terrain features, turbulent orographic form drag, thermally-driven mountain-valley circulations, and mountain wave dynamics. These processes may make the simple z₀-wind speed relationship invalid in flat terrain.
  
  Q2. The observation dataset without homogenization from CMA has shown large bias in https://journals.ametsoc.org/view/journals/clim/36/11/JCLI-D-22-0445.1.xml. This may significantly affect the generalizability and accuracy of z0 estimations across broader geographic contexts. Direct usage of the CMA wind data would absolutely reduce the robustness of the study. Thus, the homogenization on near-surface wind data is necessary.
  Response: We appreciate your reference to Zhang and Wang's study regarding wind speed inhomogeneity in CMA stations. Their work identified significant inhomogeneities with breakpoints concentrated in the late 1970s, mid-1990s, and early 2000s, but they do not affect our results. Our study exclusively uses CMA station data from 2015-2019, when the CMA network had already completed its transition to automated observations with standardized instruments. In addition, we conducted quality control procedures before use, including missing value screening, physical range validation, and temporal consistency checks.
  
  Q3. Although a Random Forest Regression model is employed, the sensitivity analysis of different feature variables lacks depth and clarity. The authors are encouraged to conduct comprehensive sensitivity analyses to clearly illustrate the theoretical rationale and practical implications of feature selection on model accuracy.
  Response: In our study, we have conducted comprehensive sensitivity tests at every step of the random forest (RF) methodology to ensure the robustness of our results in Section 2.3 and Figure 3. Specifically, for data partitioning, we evaluated the impact of random seed selection when splitting the dataset into training and test subsets (Figure 3a); for parameter tuning, we systematically adjusted multiple key parameters (e.g., max_depth, n_estimators, min_samples_split, min_samples_leaf and so on) and provided detailed sensitivity analysis on the most influential parameter--the number of decision trees (Figure 3b); for model validation: a five-fold cross-validation approach was used to further verify the stability of our model (Figure 3c); for feature importance, we conducted thorough feature importance analysis to identify the dominant predictors (Figure 3e). These rigorous sensitivity tests confirm the reliability of our RF model. Please refer to Section 2.3 for a complete description of the methodology.
  
  Q4. The validation of the model's performance is restricted to simulations for only one month, limiting the assessment of its robustness across different seasons or under varying long-term climatic conditions. The authors should include additional simulations covering multiple seasons or a full year to demonstrate the general applicability and reliability of their approach.
  Given these substantial issues, I recommend rejecting this manuscript in its current form.
  Response: We appreciate the reviewer's suggestion regarding the simulation period selection. Our choice to focus on April was motivated by both physical and practical considerations. As shown in Figure S3, April consistently exhibits the highest mean wind speeds across our study domain, making simulated wind speeds particularly sensitive to z₀ effects and thus ideal for evaluating our parameterization. To ensure robust results while managing computational constraints, we employed a carefully designed re-initialization approach where each 36-hour simulation (initialized daily at 12:00 LT (LT=UTC+8)) included a 12-hour spin-up period followed by 24 hours of analysis. This strategy produced 30 independent realizations, capturing diverse meteorological conditions throughout April. The consistent improvement in wind speed simulations across all cases (Section 3.3) strongly supports the reliability of our findings. While the current results are statistically robust, we may extend simulations to other months to further validate the general applicability of our z₀ dataset under varying climatic conditions.
  
  We hope that we have addressed your concerns. We remain open to further feedback and are committed to improving the quality of our work.
  
  Thank you very much!
  
  Sincerely,
  
  Jiamin Wang and Kun Yang
  
  On behalf of all co-authors
  
  Citation: https://doi.org/10.5194/egusphere-2025-1513-AC1
  - CC2: 'Reply on AC1', Cheng Shen, 06 May 2025
    
    Thank you very much for your detailed and comprehensive responses to my comments. Your clarifications significantly improve my understanding of your methodology and your results. I suggest explicitly mentioning this broader implication in your discussion to further strengthen the reliability and applicability of your findings.
    Again, thank you for addressing my concerns thoroughly.
    
    Citation: https://doi.org/10.5194/egusphere-2025-1513-CC2
RC1:
'Comment on egusphere-2025-1513', Anonymous Referee #1, 27 May 2025

This study estimated the aerodynamic roughness length (Z0) values using ERA5 analyses and weather station observations to improve the near-surface wind speed modeling. Technically, the Random Forest Regression algorithm is suitable for the estimation of Z0, and the results are encouraging, significantly improving the wind speed simulation in the WRF model. However, the evaluation of the improved Z0 on the WRF near-surface wind simulation was only for one month, and a longer time evaluation is needed. Therefore, I recommend Major Revision in this round.
Major comments:
1. The new estimated Z0 values were only evaluated for 1 month. A longer time evaluation should be conducted for a thorough evaluation.
2. The grid-based Z0 statistics are only available in the inner domain. This indicates that the Z0 could only be improved where there are surface weather station observations. How to improve the Z0 destination in areas where there is no good coverage of surface weather station observations? More discussions should be included.
Minor comments:
Line 39-41: It is a little bit causing here. Please revise it to be more clear.
Line 47: ERA5 is the analysis from a DA system. In my opinion, it is the blend of observations and model forecasts. Therefore, it is not proper to use it as an example.
Line 54: What does it mean here 'low-type' and 'high-type'?
Line 87: Better to add surface weather station observations before CMA.
Line 192: This could be because of the altitude differences between observation sites and the model terrain.
Line 227: What is the temporal coverage of this monthly Z0 dataset?
Figure 5: better to a reference line of y = 0 in panel (c) for reference, indicating which has a smaller bias.
Line 317: The values are significantly large when verified against the Mean values. However, if you take a deep look at Fig. 7d, the improvements are not that large from the perspective of MAB and RMSE.
Figure 7: Better to add staticts of mean/rms/r in the panels of (a). For (d), the units of MAB is not m/s, likely %.

Citation: https://doi.org/10.5194/egusphere-2025-1513-RC1
- AC2: 'Reply on RC1', Kun Yang, 06 Jun 2025
  
  We would like to express our sincere gratitude to you for your insightful comments and suggestions. Detailed responses are provided in the Supplement below.
  
  Citation: https://doi.org/10.5194/egusphere-2025-1513-AC2
RC2:
'Comment on egusphere-2025-1513', Anonymous Referee #2, 27 May 2025

General Comment:
This manuscript presents a novel and practical approach to improving the simulation of near-surface wind speed over built-up areas by refining the aerodynamic roughness length (𝑧₀) using a combination of ERA5 reanalysis and ground-based observations from the China Meteorological Administration (CMA). The authors developed a high-resolution monthly gridded 𝑧₀ dataset by applying a Random Forest Regression algorithm, and demonstrated its effectiveness through WRF simulations. The study is timely and potentially impactful for urban climate modeling and wind-related applications.
While the manuscript introduces a potentially useful methodology, the current version does not provide sufficient critical evaluation or methodological transparency. To be suitable for publication, the manuscript requires revision, including clarification of the observational setup, deeper theoretical consideration of the methodology's assumptions, and further analyses related to model resolution and 𝑧₀ scale dependency.
Major comments:
(1) Uncertainty about CMA Wind Observation Heights:     The manuscript assumes that CMA stations provide 10-m wind speed observations. However, there is no clear documentation or justification of this assumption in the text. Are all CMA anemometers calibrated and installed precisely at 10 m above ground level? Given that the accuracy of 𝑧₀ estimation strongly depends on the reference height of the wind speed, this should be clarified and supported by official metadata or references. Otherwise, the credibility of the derived 𝑧₀ values may be significantly undermined.
(2) Circular Logic in Using ERA5 to Derive 𝑧₀ and Then Evaluating WRF Performance:    The method uses ERA5 as the basis to derive optimal 𝑧₀ values, and then uses these 𝑧₀ values in WRF to simulate wind fields, which are subsequently compared to CMA observations. However, since the 𝑧₀ is essentially tuned to ERA5 wind characteristics, and WRF is driven by ERA5 data, it is not surprising that the WRF simulations become closer to observations. This circular logic reduces the strength of the validation. A deeper discussion is needed in the Discussion section to acknowledge this methodological dependency and to better clarify to what extent the improvements stem from 𝑧₀ refinement as opposed to alignment with the reanalysis base.
(3) Lack of Resolution-Dependent 𝑧₀ Consideration:    The aerodynamic roughness length is known to be resolution-dependent due to varying representations of land cover and orography. However, the manuscript does not address why a single 𝑧₀ value (derived from coarser ERA5 resolution) is applied across finer-resolution WRF simulations. A justification is needed as to why scale-dependent roughness parameters were not considered, especially when moving from ERA5 (∼30 km) to WRF (3 km). Moreover, higher-resolution simulations are expected to better resolve local features influencing 𝑧₀. Has the relationship between horizontal resolution and 𝑧₀ been explored in this study? Such an analysis would greatly strengthen the work, and I recommend adding or expanding this aspect if possible.

Citation: https://doi.org/10.5194/egusphere-2025-1513-RC2
- AC3: 'Reply on RC2', Kun Yang, 06 Jun 2025
  
  We would like to express our sincere gratitude to you for your insightful comments and suggestions. Detailed responses are provided in the Supplement below.
  
  Citation: https://doi.org/10.5194/egusphere-2025-1513-AC3
AC4: 'Clarification on corrections to EGUSPHERE-2025-1513', Kun Yang, 11 Nov 2025

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1513/egusphere-2025-1513-AC4-supplement.pdf

Citation: https://doi.org/10.5194/egusphere-2025-1513-AC4

Interactive discussion

Status: closed

CC1:
'Comment on egusphere-2025-1513', Cheng Shen, 04 May 2025

The manuscript proposes a method for refined estimation of aerodynamic roughness length (z0) in urban built-up areas and applies the results to improve near-surface wind speed simulations in the WRF model. The authors utilized ERA5 reanalysis data and China Meteorological Administration (CMA) station observations to optimize z0 values, subsequently employing a Random Forest Regression algorithm to generate a high-resolution gridded z0 dataset. The simulations indicate significant improvements in the accuracy of 10 m and 100 m wind speeds in Chinese urban areas. However, there are significant limitations in the study. My comments below:
1: The critical assumption that ERA5 100-m wind speed data closely aligns with observational data has not been sufficiently validated, especially for areas characterized by complex terrains or significant local environmental variations. The authors need to provide robust evidence supporting the applicability and limitations of this assumption.
2: The observation dataset without homogenization from CMA has shown large bias in https://journals.ametsoc.org/view/journals/clim/36/11/JCLI-D-22-0445.1.xml. This may significantly affect the generalizability and accuracy of z0 estimations across broader geographic contexts. Direct usage of the CMA wind data would absolutely reduce the robustness of the study. Thus, the homogenization on near-surface wind data is necessary.
3: Although a Random Forest Regression model is employed, the sensitivity analysis of different feature variables lacks depth and clarity. The authors are encouraged to conduct comprehensive sensitivity analyses to clearly illustrate the theoretical rationale and practical implications of feature selection on model accuracy.
4: The validation of the model's performance is restricted to simulations for only one month, limiting the assessment of its robustness across different seasons or under varying long-term climatic conditions. The authors should include additional simulations covering multiple seasons or a full year to demonstrate the general applicability and reliability of their approach.
Given these substantial issues, I recommend rejecting this manuscript in its current form.

Citation: https://doi.org/10.5194/egusphere-2025-1513-CC1
- AC1:
  'Reply on CC1', Kun Yang, 05 May 2025
  
  Dear Dr. Shen,
  
  We sincerely appreciate your time and effort for your comments on our manuscript, which may help us improve our work. Nevertheless, we would like to point out that three (Q1, Q2, and Q4) out of your four comments have already been extensively addressed in our original manuscript. Please see our point-by-point responses below.
  
  Q1. The critical assumption that ERA5 100-m wind speed data closely aligns with observational data has not been sufficiently validated, especially for areas characterized by complex terrains or significant local environmental variations. The authors need to provide robust evidence supporting the applicability and limitations of this assumption.
  Response: In our manuscript, we have addressed this concern by presenting a two-fold justification for our assumption. First, we evaluated ERA5 100-m winds with measurements from 589 wind towers across China, each providing months to years of data spanning different periods between 2004 and 2022. The results show that ERA5 exhibits a smaller mean bias percentage in eastern regions compared to western areas, supporting its higher reliability in eastern regions. This finding led us to focus primarily on weather stations in eastern China and to derive z₀ for 1,805 stations in these built-up regions. Please see this evaluation in Section 3.1. Second, we further validated our assumption through model experiments. The gridded z₀ dataset was tested in WRF simulations and independently evaluated against both unseen station data (10-m winds) and additional tower measurements (100-m winds). Both validation tests confirmed significant improvements in wind speed simulations. Please see the model improvement in Section 3.3.
  Regarding the applicability of our assumption in complex terrain areas with significant local environmental variations, we have explicitly addressed this limitation in our study (Lines 281-285). Our analysis shows that the gridded z₀ produced based on station estimates provides only limited improvement for wind speed simulations in topographically complex regions. This suggests two possible explanations: (1) our fundamental assumption may not hold well in such areas, or more likely, (2) z₀ is not the sole determinant of wind speed in these regions. As discussed in our manuscript, wind patterns in complex terrain are governed by multi-scale physical processes including microscale terrain features, turbulent orographic form drag, thermally-driven mountain-valley circulations, and mountain wave dynamics. These processes may make the simple z₀-wind speed relationship invalid in flat terrain.
  
  Q2. The observation dataset without homogenization from CMA has shown large bias in https://journals.ametsoc.org/view/journals/clim/36/11/JCLI-D-22-0445.1.xml. This may significantly affect the generalizability and accuracy of z0 estimations across broader geographic contexts. Direct usage of the CMA wind data would absolutely reduce the robustness of the study. Thus, the homogenization on near-surface wind data is necessary.
  Response: We appreciate your reference to Zhang and Wang's study regarding wind speed inhomogeneity in CMA stations. Their work identified significant inhomogeneities with breakpoints concentrated in the late 1970s, mid-1990s, and early 2000s, but they do not affect our results. Our study exclusively uses CMA station data from 2015-2019, when the CMA network had already completed its transition to automated observations with standardized instruments. In addition, we conducted quality control procedures before use, including missing value screening, physical range validation, and temporal consistency checks.
  
  Q3. Although a Random Forest Regression model is employed, the sensitivity analysis of different feature variables lacks depth and clarity. The authors are encouraged to conduct comprehensive sensitivity analyses to clearly illustrate the theoretical rationale and practical implications of feature selection on model accuracy.
  Response: In our study, we have conducted comprehensive sensitivity tests at every step of the random forest (RF) methodology to ensure the robustness of our results in Section 2.3 and Figure 3. Specifically, for data partitioning, we evaluated the impact of random seed selection when splitting the dataset into training and test subsets (Figure 3a); for parameter tuning, we systematically adjusted multiple key parameters (e.g., max_depth, n_estimators, min_samples_split, min_samples_leaf and so on) and provided detailed sensitivity analysis on the most influential parameter--the number of decision trees (Figure 3b); for model validation: a five-fold cross-validation approach was used to further verify the stability of our model (Figure 3c); for feature importance, we conducted thorough feature importance analysis to identify the dominant predictors (Figure 3e). These rigorous sensitivity tests confirm the reliability of our RF model. Please refer to Section 2.3 for a complete description of the methodology.
  
  Q4. The validation of the model's performance is restricted to simulations for only one month, limiting the assessment of its robustness across different seasons or under varying long-term climatic conditions. The authors should include additional simulations covering multiple seasons or a full year to demonstrate the general applicability and reliability of their approach.
  Given these substantial issues, I recommend rejecting this manuscript in its current form.
  Response: We appreciate the reviewer's suggestion regarding the simulation period selection. Our choice to focus on April was motivated by both physical and practical considerations. As shown in Figure S3, April consistently exhibits the highest mean wind speeds across our study domain, making simulated wind speeds particularly sensitive to z₀ effects and thus ideal for evaluating our parameterization. To ensure robust results while managing computational constraints, we employed a carefully designed re-initialization approach where each 36-hour simulation (initialized daily at 12:00 LT (LT=UTC+8)) included a 12-hour spin-up period followed by 24 hours of analysis. This strategy produced 30 independent realizations, capturing diverse meteorological conditions throughout April. The consistent improvement in wind speed simulations across all cases (Section 3.3) strongly supports the reliability of our findings. While the current results are statistically robust, we may extend simulations to other months to further validate the general applicability of our z₀ dataset under varying climatic conditions.
  
  We hope that we have addressed your concerns. We remain open to further feedback and are committed to improving the quality of our work.
  
  Thank you very much!
  
  Sincerely,
  
  Jiamin Wang and Kun Yang
  
  On behalf of all co-authors
  
  Citation: https://doi.org/10.5194/egusphere-2025-1513-AC1
  - CC2: 'Reply on AC1', Cheng Shen, 06 May 2025
    
    Thank you very much for your detailed and comprehensive responses to my comments. Your clarifications significantly improve my understanding of your methodology and your results. I suggest explicitly mentioning this broader implication in your discussion to further strengthen the reliability and applicability of your findings.
    Again, thank you for addressing my concerns thoroughly.
    
    Citation: https://doi.org/10.5194/egusphere-2025-1513-CC2
RC1:
'Comment on egusphere-2025-1513', Anonymous Referee #1, 27 May 2025

This study estimated the aerodynamic roughness length (Z0) values using ERA5 analyses and weather station observations to improve the near-surface wind speed modeling. Technically, the Random Forest Regression algorithm is suitable for the estimation of Z0, and the results are encouraging, significantly improving the wind speed simulation in the WRF model. However, the evaluation of the improved Z0 on the WRF near-surface wind simulation was only for one month, and a longer time evaluation is needed. Therefore, I recommend Major Revision in this round.
Major comments:
1. The new estimated Z0 values were only evaluated for 1 month. A longer time evaluation should be conducted for a thorough evaluation.
2. The grid-based Z0 statistics are only available in the inner domain. This indicates that the Z0 could only be improved where there are surface weather station observations. How to improve the Z0 destination in areas where there is no good coverage of surface weather station observations? More discussions should be included.
Minor comments:
Line 39-41: It is a little bit causing here. Please revise it to be more clear.
Line 47: ERA5 is the analysis from a DA system. In my opinion, it is the blend of observations and model forecasts. Therefore, it is not proper to use it as an example.
Line 54: What does it mean here 'low-type' and 'high-type'?
Line 87: Better to add surface weather station observations before CMA.
Line 192: This could be because of the altitude differences between observation sites and the model terrain.
Line 227: What is the temporal coverage of this monthly Z0 dataset?
Figure 5: better to a reference line of y = 0 in panel (c) for reference, indicating which has a smaller bias.
Line 317: The values are significantly large when verified against the Mean values. However, if you take a deep look at Fig. 7d, the improvements are not that large from the perspective of MAB and RMSE.
Figure 7: Better to add staticts of mean/rms/r in the panels of (a). For (d), the units of MAB is not m/s, likely %.

Citation: https://doi.org/10.5194/egusphere-2025-1513-RC1
- AC2: 'Reply on RC1', Kun Yang, 06 Jun 2025
  
  We would like to express our sincere gratitude to you for your insightful comments and suggestions. Detailed responses are provided in the Supplement below.
  
  Citation: https://doi.org/10.5194/egusphere-2025-1513-AC2
RC2:
'Comment on egusphere-2025-1513', Anonymous Referee #2, 27 May 2025

General Comment:
This manuscript presents a novel and practical approach to improving the simulation of near-surface wind speed over built-up areas by refining the aerodynamic roughness length (𝑧₀) using a combination of ERA5 reanalysis and ground-based observations from the China Meteorological Administration (CMA). The authors developed a high-resolution monthly gridded 𝑧₀ dataset by applying a Random Forest Regression algorithm, and demonstrated its effectiveness through WRF simulations. The study is timely and potentially impactful for urban climate modeling and wind-related applications.
While the manuscript introduces a potentially useful methodology, the current version does not provide sufficient critical evaluation or methodological transparency. To be suitable for publication, the manuscript requires revision, including clarification of the observational setup, deeper theoretical consideration of the methodology's assumptions, and further analyses related to model resolution and 𝑧₀ scale dependency.
Major comments:
(1) Uncertainty about CMA Wind Observation Heights:     The manuscript assumes that CMA stations provide 10-m wind speed observations. However, there is no clear documentation or justification of this assumption in the text. Are all CMA anemometers calibrated and installed precisely at 10 m above ground level? Given that the accuracy of 𝑧₀ estimation strongly depends on the reference height of the wind speed, this should be clarified and supported by official metadata or references. Otherwise, the credibility of the derived 𝑧₀ values may be significantly undermined.
(2) Circular Logic in Using ERA5 to Derive 𝑧₀ and Then Evaluating WRF Performance:    The method uses ERA5 as the basis to derive optimal 𝑧₀ values, and then uses these 𝑧₀ values in WRF to simulate wind fields, which are subsequently compared to CMA observations. However, since the 𝑧₀ is essentially tuned to ERA5 wind characteristics, and WRF is driven by ERA5 data, it is not surprising that the WRF simulations become closer to observations. This circular logic reduces the strength of the validation. A deeper discussion is needed in the Discussion section to acknowledge this methodological dependency and to better clarify to what extent the improvements stem from 𝑧₀ refinement as opposed to alignment with the reanalysis base.
(3) Lack of Resolution-Dependent 𝑧₀ Consideration:    The aerodynamic roughness length is known to be resolution-dependent due to varying representations of land cover and orography. However, the manuscript does not address why a single 𝑧₀ value (derived from coarser ERA5 resolution) is applied across finer-resolution WRF simulations. A justification is needed as to why scale-dependent roughness parameters were not considered, especially when moving from ERA5 (∼30 km) to WRF (3 km). Moreover, higher-resolution simulations are expected to better resolve local features influencing 𝑧₀. Has the relationship between horizontal resolution and 𝑧₀ been explored in this study? Such an analysis would greatly strengthen the work, and I recommend adding or expanding this aspect if possible.

Citation: https://doi.org/10.5194/egusphere-2025-1513-RC2
- AC3: 'Reply on RC2', Kun Yang, 06 Jun 2025
  
  We would like to express our sincere gratitude to you for your insightful comments and suggestions. Detailed responses are provided in the Supplement below.
  
  Citation: https://doi.org/10.5194/egusphere-2025-1513-AC3
AC4: 'Clarification on corrections to EGUSPHERE-2025-1513', Kun Yang, 11 Nov 2025

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1513/egusphere-2025-1513-AC4-supplement.pdf

Citation: https://doi.org/10.5194/egusphere-2025-1513-AC4

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload

AR by Kun Yang on behalf of the Authors (03 Jul 2025) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (07 Jul 2025) by Guoqing Ge

RR by Anonymous Referee #3 (03 Sep 2025)

RR by Anonymous Referee #4 (21 Sep 2025)

Suggestions for revision or reasons for rejection

“Improvement of near-surface wind speed modeling through refined aerodynamic roughness length in built-up regions: implementation and validation in the Weather Research and Forecasting (WRF) model version 4.0” by Jiamin Wang, Kun Yang, Jiarui Liu, Xu Zhou, Xiaogang Ma, Wenjun Tang, Ling Yuan, and Zuhuan Ren

Recommendation: Minor revisions

General comments
This manuscript introduced an approach to estimate roughness length at CMA weather stations (z0_CMA) that minimizes differences in 100-m wind speed (u100) between ERA5 (u100_ERA5) and CMA (u100_CMA) stations (Equations 2 and 3), using the wind profile described by Monin-Obukhov similarity theory flux-profile relationship in neutral conditions (Equation 1). They assumed differences in near-surface wind speed between ERA5 and CMA stations are mainly due to z0 and the influence of z0 diminishes with height (i.e., u100_ERA5 ~ u100_CMA) (assumption 1), and the impact of atmospheric stability on wind speed is identical between ERA5 and CMA (assumption 2) (Lines 122–125). The estimated station-wise z0 (z0_CMA) was then used to derive a gridded z0 dataset using a random forest regression algorithm, i.e., z0_RFR, which improved near-surface wind simulations in the WRF model compared to those simulations with other static z0 values.
This manuscript is well written and organized. This manuscript not only provides a method to estimate roughness length at measurement stations but also suggests a potential way to provide a gridded dataset that can be applied to numerical simulations, which well fits the scope of this journal. I do not 100% agree with the authors on the two assumptions that they used to derive z0_CMA (assumptions 1 and 2 above). I think impacts of the validity of these two assumptions are topics to be discussed and further studied (it would be good if this issue is briefly discussed in the manuscript), but they don’t need to be addressed in the current manuscript. Below I have several minor comments and suggestions.

Minor comments
Lines 92–93, “variance of the slope”: Could you clarify what data you used to derive this variable with its spatial resolution (e.g., 3 arcsec SRTM)?
Line 100, “z0 dataset at a spatial resolution of 0.01°×0.01°”: Why did you select this spatial resolution?
Line 206, “all stations are situated in build-up areas”: Not “all” stations seem to be situated in build-up areas. Figure 2c shows there are some stations at croplands categories. Am I missing something?
Lines 210–211, “the robust consistency in the relationship between z0 and wind speed confirmed the reasonableness of z0_optimal”: I think the robust consistency in the relationship between z0_optimal and wind speed is an expected outcome because you used equation 1, which relates z0 and wind speed, to derive z0_CMA, which was in turn used to derive z0_optimal. In that regard, I’m not sure how the robustness relationship between z0 and wind speed can be related to the reasonableness of z0_optimal.
Line 215, Figure 1: It is hard to read Figures 1a–1c. I think using color scales that are more consistent with numerical values (e.g., bluish/reddish colors for negative/positive values) would help.
Line 230, “categores”: “categories”.
Lines 333–334, “the resulting gridded z0 dataset significantly reduces uncertainties ~ particularly over relatively flat built-up areas”: The WRF model has various urban canopy model (UCM) options, which parameterize effects of urban topography by updating surface drag etc. This includes for example, a single-layer UCM (WRF UCM option 1), a building effect parameterization (BEP; WRF UCM option 2), etc. Considering the impact of the updated roughness length dataset is mainly over urban areas, could you explain the advantage of using the updated dataset instead of using an UCM? Also, could you compare the impact of z0_optimal with the impact of using UCMs?

Hide

RR by Ye Liu (01 Oct 2025)

ED: Reconsider after major revisions (06 Oct 2025) by Guoqing Ge

AR by Kun Yang on behalf of the Authors (28 Oct 2025) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (31 Oct 2025) by Guoqing Ge

RR by Anonymous Referee #3 (10 Nov 2025)

RR by Anonymous Referee #4 (12 Nov 2025)

ED: Publish subject to minor revisions (review by editor) (23 Nov 2025) by Guoqing Ge

AR by Kun Yang on behalf of the Authors (23 Nov 2025) Author's response Author's tracked changes Manuscript

ED: Publish as is (24 Nov 2025) by Guoqing Ge

AR by Kun Yang on behalf of the Authors (02 Dec 2025) Author's response Manuscript

Journal article(s) based on this preprint

16 Dec 2025

Jiamin Wang, Kun Yang, Jiarui Liu, Xu Zhou, Xiaogang Ma, Wenjun Tang, Ling Yuan, and Zuhuan Ren

Geosci. Model Dev., 18, 10077–10094, https://doi.org/10.5194/gmd-18-10077-2025,https://doi.org/10.5194/gmd-18-10077-2025, 2025

Short summary

Jiamin Wang, Kun Yang, Jiarui Liu, Xu Zhou, Xiaogang Ma, Wenjun Tang, Ling Yuan, and Zuhuan Ren

Supplement

https://doi.org/10.5194/egusphere-2025-1513-supplement

Model code and software

all codes Jiamin Wang and Kun Yang https://doi.org/10.5281/zenodo.15108200

Jiamin Wang, Kun Yang, Jiarui Liu, Xu Zhou, Xiaogang Ma, Wenjun Tang, Ling Yuan, and Zuhuan Ren

Viewed

Total article views: 2,235 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
2,092	109	34	2,235	54	31	44

HTML: 2,092
PDF: 109
XML: 34
Total: 2,235
Supplement: 54
BibTeX: 31
EndNote: 44

Views and downloads (calculated since 23 Apr 2025)

Month	HTML	PDF	XML	Total
Apr 2025	94	12	3	109
May 2025	107	18	7	132
Jun 2025	63	10	14	87
Jul 2025	47	6	1	54
Aug 2025	367	13	2	382
Sep 2025	1,253	10	0	1,263
Oct 2025	82	11	2	95
Nov 2025	46	19	4	69
Dec 2025	33	10	1	44

Cumulative views and downloads (calculated since 23 Apr 2025)

Month	HTML	PDF	XML	Total
Apr 2025	94	12	3	109
May 2025	107	18	7	132
Jun 2025	63	10	14	87
Jul 2025	47	6	1	54
Aug 2025	367	13	2	382
Sep 2025	1,253	10	0	1,263
Oct 2025	82	11	2	95
Nov 2025	46	19	4	69
Dec 2025	33	10	1	44

Viewed (geographical distribution)

Total article views: 2,153 (including HTML, PDF, and XML) Thereof 2,153 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 16 Dec 2025

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (2614 KB)
Metadata XML

Short summary

Aerodynamic roughness length (z₀) is a key parameter determining wind profiles in models, but most models neglect the urban effects. We proposed a low-cost method to estimate z₀ at weather stations in built-up areas across China, and then developed a z₀dataset. Tests in the Weather Research and Forecasting model show that it significantly improves the simulation accuracy of wind speed at both 10-m and 100-m heights, supporting urban planning, air quality management, and wind energy projects.


Total:	0
HTML:	0
PDF:	0
XML:	0