A comparison of straightray and curvedray surface wave tomography approaches at nearsurface studies
 ^{1}Politecnico di Torino, Turin, 10129, Italy
 ^{2}Delft University of Technology, Delft, 2628 CN, Netherlands
 ^{1}Politecnico di Torino, Turin, 10129, Italy
 ^{2}Delft University of Technology, Delft, 2628 CN, Netherlands
Abstract. Surface waves are widely used to model shearwave velocity of the subsurface. Surface wave tomography (SWT) has recently gained popularity for nearsurface studies. Some researchers have used straightray SWT in which it is assumed that surface waves propagate along the straight line between receiver pairs. Alternatively, curvedray SWT can be employed by computing the exact paths between the receiver pairs. SWT is a wellestablished method in seismology and has been employed in numerous seismological studies. However, it is important to make a comparison between these two SWT approaches for nearsurface applications since the amount of information and the level of complexity in nearsurface are different from seismological studies. We apply straightray and curvedray SWT to four nearsurface examples and compare the results in terms of the quality of the final model and the computational cost.

Notice on discussion status
The requested preprint has a corresponding peerreviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint
(2598 KB)

The requested preprint has a corresponding peerreviewed final revised paper. You are encouraged to refer to the final revised version.
 Preprint
(2598 KB)  BibTeX
 EndNote
 Final revised paper
Journal article(s) based on this preprint
Mohammadkarim Karimpour et al.
Interactive discussion
Status: closed

RC1: 'Comment on egusphere2022279', Anonymous Referee #1, 09 Jun 2022
The submitted manuscript focuses on comparing the straightray and curvedraybased surface wave tomography using four data sets under the nearsurface condition. The surface wave tomography technique is a wellestablished method in regional and global seismology, introducing this technique to nearsurface applications could be beneficial for accurately investigating the shallow target, and this topic is within the scope of the journal. However, some points should be addressed before a possible publication. Moreover, there are lots of typos in the manuscript that should be revised properly. The English also needs improvement. I would recommend moderate revision before publication.
Following, I listed some comments regarding the manuscript.
 Line 75, Page 3. Here the authors state the Vp and density are assumed to be known as prior information but in the result section (such as in the figure caption of Fig 2) the term “Poisson ratio” is used instead of the Vp. It is recommended to keep the full text consistent.
 Section 3.1. It is recommended to provide the elastic parameters of the Blocky model to help readers better understand what the model is like.
 Line 115, Page 5. It may simply be stated that 16 shots were chosen to generate the raw data after optimized design. The 441 shots may make the readers confusing.
 Line 130, Page 6. What do the red text ‘a’, ‘b’, and ‘c’ and red arrows in Fig 1a represent? They should be demonstrated in the figure caption.
 Line 140, Page 7. “The same initial model is used as the starting model for the SWT inversion in both straight and curved ray methods”. It is recommended to show the readers the iteration curves and inversion parameters (damping factor and weights used in the regularization matrix). In fact, the inversion parameters of the two inversions should be the same for the sake of comparison.
 Figure 6. For the current version, the red arrow and text are confusing for the readers. It is recommended to look for a better way to show the plot.
 Line 180, Page 12. It is not easy for readers to identify that the boundary in the curvedray tomogram is clear than the one in the straightray tomogram. Providing the model error in this area might be better to support this conclusion.
 Figure 10. Again, it is recommended to provide the iteration curves and inversion parameters. It seems that the inversion using the straightray method becomes unstable and there are some outliers in the tomogram. Do these two methods use the same weights in the smooth regularization?
 For some plots, such as in Figure 12, it is recommended to indicate which data set the plot is related to in the figure caption.
 Line 280, Page 20. Please delete the redundant sentence “For each parameter, the values …”
 Table 3. The abbreviation CR and SR should be mentioned in the context, also, the formula for calculating CRSR should be demonstrated.
 Line 295, Page 21. Again, please delete the redundant sentence “For each parameter, the values …”
 Please check the sentence above section 4.3: Table 3 that using curvedray SWT has increases the computational cost by an average of 23 %.

RC3: 'Reply on RC1', Emanuel Kästle, 10 Jun 2022
A short note on comment 8/Fig 10: I also thought that maybe the straight ray ray tomography became unstable which is causing these extreme velocity jumps. But I think it could be an actual effect only caused by the 'wrong' ray paths. Since the straight rays are forced through the slow, central part of the medium (while the true rays curve around and never saw any of the slow medium), the inversion needs to compensate by putting higher velocities in other grid cells, which results in the strong irregularities. I would suggest that the authors discuss this effect and add the information required by Reviewer 1.
 AC4: 'Reply on RC3', Mohammadkarim Karimpour, 03 Aug 2022

AC1: 'Reply on RC1', Mohammadkarim Karimpour, 03 Aug 2022
Dear Reviewer,
Thank you for your comments and suggestions. We have corrected the typos and provided a response for each of your comments.
 Following this suggestion, we have modified the text and used “Poisson ratio” to keep the full text consistent.
 Following this comment, we have provided the elastic parameters of the Blocky model in Table 1 of the revised version of the manuscript.
 We have removed the sentence “To optimize the shot locations, we defined 441 shots …” to avoid confusing the readers. In the revised version of the manuscript, we have added Section 2.1 to clarify the employed criteria to pick the shot positions.
 We have added the following explanation in the caption to clarify the purpose of using the red arrows “a”, “b”, and “c”:
“The red arrows and letters represent the location of 2D slices in subplots (bd).”
 We have added the iteration curves for all the examples in Figures 3b, 6b, 9, and 12 of the revised version of the manuscript. We have added the clarifying sentence that all the inversion parameters are kept the same for the sake of comparison in lines 185187 of the revised version of the manuscript as:
“For each example, the straightray and curvedray SWT inversions start from the same initial model. Other inversion parameters (C_Rpand λ) are also the same for the sake of comparison.”
 We received similar comment from Reviewer#3 (Fabrizio Magrini). To avoid confusion, we have removed the red arrows from the reconstructed VS models from the inversion (Figures 4, 7, 10, and 13 of the revised version of the manuscript). The intention to put them was to clarify the position of each crosssection but it seemed to add confusion for the readers.
 The distribution of the model error is displayed in the attaches supplement file.
As shown is in the figure of the supplement file, the difference between the model errors from the straightray and curvedray is very small. We have not included this figure in the text of the manuscript because it does not show a clearer difference between straight and curvedray approaches than what has been already shown in Fig. 7 of the revised version of the manuscript. We have compared the model errors in Table 4, which shows that curvedray approach has produced a more accurate model. This agrees with what we had stated in the original manuscript lines 177178, that is the difference between the VS model from straightray and curvedray approaches in this example is not significant:
“The vertical slices at X (Figure 6a and d) and Y directions (Figure 6b and e) do not display significant differences.”
We have also modified these sentences in lines 260262 of the revised version of the manuscript for clarification:
“The areas marked in dashed black in the horizontal slices (Fig. 7c and f) shows that the boundaries of the anomaly are slightly clearer in the curvedray approach (Fig. 7f) and also the VS values in these areas are closer to the true VS value (Fig. 5d).”
 We have responded to this comment in the responses to comment#5. We have added the iteration curve for this example. As we explained in the response to comment#5, the inversion parameters are the same for both straightray and curvedray methods. Moreover, the same regularization values (i.e., same weights) have been used for both approaches.
As commented correctly by Reviewer#2 (Emanuel Kästle), the observed difference is due to the ‘wrong’ ray paths in the straightray approach. At the edges of the velocity anomaly, the assumed paths by the straightray are shorter than the true paths and therefore the velocities are (wrongly) high.
We have added the following explanation to the text (lines 324327 of the revised version of the manuscript) to clarify this issue:
“The black arrow in Fig. 13c shows the high velocity cells inside the loose sand body in the retrieved model from the straightray SWT. The reason is that in this area (close to the interface of the sand body and the background medium) the assumed paths in the straightray approach are much shorter than the true paths and therefore, the obtained VS from the inversion becomes unrealistically high.”
 We have added this clarification in Figures 1517 of the revised version of the manuscript.
 We have deleted the redundant sentence from the revised version of the manuscript.
 We have shown the difference between the parameters from curvedray and straightray by , added Eq. (16) to show how to compute this parameter, and updated Table 4 in the revised version of the manuscript. We have clarified it in lines 379383 of the revised version of the manuscript.
 We have removed the redundant sentence.
 We have corrected the sentence and deleted the redundant phrase.

RC2: 'Comment on egusphere2022279', Emanuel Kästle, 10 Jun 2022
Dear Editor,
Dear Authors,the manuscript entitled "A comparison of straightray and curvedray surface wave tomography approaches at nearsurface studies" provides an informative study that is certainly of interest to tomographers who consider applying similar methods. The manuscript is clearly structured and generally well written. The applied methods are appropriate and the authors' conclusions are supported by the results of the two synthetic and of the two real data examples. Nevertheless, there are a couple of points that, in my view, need to be addressed before publication in Solid Earth.
Remarks
In the introduction, a few other studies are mentioned that also apply straight and/or curvedray tomography, but it is not clear for most of these studies whether they performed a comparison of the thwo approaches. There should be an overview of what the conclusions from other authors on the topic were. I am also missing a (short) discussion at the end of the manuscript on whether the results agree with existing literature.
The abstract needs to be rewritten. It gives a very brief introduction and motivation for the study. But an abstract should summarize the key results of the study. The same applies to the conclusions section which should be rewritten.
One of your key results is that in a scenario with low data coverage, the curvedray approach performs significantly better. But your synthetic examples do not prove that since you run no example with low coverage. I would suggest that you take one of your two synthetic tests and test whether this conclusion holds.
l. 10 "exact paths"  the term 'exact' is quite vague. In this case you calculate Eikonal paths since they are the solution to the Eikonal equation. These represent an approximation and not necessarily the true/exact paths.
l. 35 So did Trampert and Spetzler find a difference between the raybased and the finitefrequency approach?
l. 38 It sounds like Gouedard did not perform any comparison between straight and Eikonal ray based models. So why cite them here?
l. 5056 You present several studies where some applied some form of ray tracing and others didn't. But what is your point? Did these studies find any advantage in raytracing? If your point is that researchers have applied different methods to approximate the rays but noone has done a systematic study, then you should write it like that.
l. 70 "shots are defined as a regular grid" sounds wrong to me. I would rather write something like "shot locations located on a regular grid are tested by calculating the number of aligend receivers for each location."
l. 71 How many shots are picked, what are the criteria for the number of picked shot locations?
l. 71 I assume this approach only applies to receiver layouts on a grid and not in case of irregular/random receiver locations? Maybe you should say so.
l. 75 Which values for Vp and rho are you assuming in your study? How are these values chosen?
l. 7680 I think the way you describe the procedure is a bit complicated. Bascially, you take your 3D model defined by Vs,Vp and rho and extract 1D depth profiles at each point of the model. You then calculate the phase dispersion curve for each profile and join all the dispersion curves to get 2D phaseslowness maps at a set of periods (at how many periods, how do you choose the periods?). The ray tracing is then done in each of the phaseslowness maps separately. I would suggest to rewrite this paragraph.
l. 99, 102 "uneven sampling", "nonuniform sampling"; it would be helpful to your readers if you could say more precisely what you mean. If I understand correctly, it is that the number of samples is not the same at different wavelengths, i.e. periods?
You should also mention how you sample your dispersion curves. From the images, it looks like you have a uniform sampling in frequency. This means that you implicitly put a higher weight on high frequencies (a uniform sampling in period would imply a higher weight on low frequencies). Many researchers therefore apply a logspaced sampling.
l. 105 "sigma_i,j is the standard deviation of the ith data point of the jth dispersion curve". I think there is a mistake in that description. In your matrix, only the trace is nonzero. Your sentence would then imply that for each dispersion curve, you only have a single measurement. Instead, I think that you have several measurements (at a set of periods) for each dispersion curve, so that the measurements from disp curve 1 have standard deviations sigma_1,1 to sigma_n,n, and from disp curve 2 from sigma_n+1,n+1 to sigma_n+m,n+m, and so on...
l. 109 What is meant by "closest data point"? Please explain. Also, why the delta in delta lambda_j,max if it is the maximum wavelength? (from the delta I would expect a difference)
l. 110 Did you add any error to your synthetic measurements? Please mention in the text.
l. 150 It would be good to give a more quantitative measure for the quality of reconstruction, fo example by providing the variance reduction or simply the misfit to the input model. (I just saw that you did in table 3, I would suggest that you write down these values here or refer to table 3).
l. 199 Are the values of nu and rho in your Table 2 fixed during the inversion? Please mention somewhere in the text. What influence do you expect from the potential errors in these values?
l. 214 What value for the data standard deviation (sigma, eq 4) do you assume in your synthetic tests and in the real data examples? Is sigma individually determined for each measurement?
l 324 You weighting is based on the wavelength of the signals, but at the same time you argue with the lower number of data at long wavelengths. So should the weight not rather be based on the number of measurements at each frequency? Or, putting the question differently, if I have a dataset with exactly the same number of measurements at each frequency (as is probably the case in your synthetic experiment), do I still need the weighting?
I would like to refer again to my previous comment on the importance of the sampling of your dispersion curves. If you use a uniform sampling in frequency, it is clear to me that the low frequency measurements are underweighted. Maybe run a test with log sampling and compare to the results in Fig. 14.Fig. 7 There should be a scale on (b). Why is the panel in (d) cropped? It seems that some dispersion curves go also to values slower than 50 m/s.

AC2: 'Reply on RC2', Mohammadkarim Karimpour, 03 Aug 2022
Thank you for your comments and suggestions. We have replied to all your comments below. In the following, we have copied your comments and assigned a number to each of them (for clarification) and provided the explanations for every comment.
Remarks
Comment#1: In the introduction, a few other studies are mentioned that also apply straight and/or curvedray tomography, but it is not clear for most of these studies whether they performed a comparison of the thwo approaches. There should be an overview of what the conclusions from other authors on the topic were. I am also missing a (short) discussion at the end of the manuscript on whether the results agree with existing literature.
Response#1: We have modified the Introduction and added the following sentences to clarify this issue and provide an overview of what the conclusion from other authors on the topic were in the lines 3753 of the revised version of the manuscript as:
“SWT has been used in seismological studies for decades and different SWT approaches have been compared by seismologists. For instance, Laske (1995) studied deviations from straight line in the propagation of longperiod surface waves and concluded that they usually have small effects on the propagation phase. Spetzler et al. (2001) applied both straightray and curvedray SWT methods. They computed the maximum deviations of ray paths from straight lines and pointed out that this maximum is typically below the estimated resolution, except for long paths at short periods. Some studies showed that a more complex forward modelling in SWT did not improve the results (Sieminski et al., 2004; Levshin et al., 2005) while other studies reported obtaining better results (Ritzwoller et al., 2002; Yoshizawa and Kennett, 2004; Zhou et al., 2005). Trampert and Spetzler (2006) pointed out that the choice of regularization has a major impact on SWT results. They studied SWT methods based on ray theory (straightray and curvedray) and scattering theory in which the integral along the ray path is replaced by the integral over an influence zone. They showed that these methods are statistically alike and any model from one method can be obtained by the other one by changing the value of the regularization. They concluded that the only option to increase the resolution of the model is to increase and homogenize the data coverage. Bozdag and Trampert (2008) compared straightray and curvedray SWT methods in their study and mentioned that performing ray tracing could be so timeconsuming that the potential gain in crustal corrections on a global scale might not be worth the additional computational effort imposed by ray tracing. Despite seismological studies, a comparison between the performance of straightray and curvedray SWT at the nearsurface scale is missing.”
We have also added Section 4.5 to compare the results of our study with previous studies.
Comment#2: The abstract needs to be rewritten. It gives a very brief introduction and motivation for the study. But an abstract should summarize the key results of the study. The same applies to the conclusions section which should be rewritten.
One of your key results is that in a scenario with low data coverage, the curvedray approach performs significantly better. But your synthetic examples do not prove that since you run no example with low coverage. I would suggest that you take one of your two synthetic tests and test whether this conclusion holds.
Response#2: We have rewritten the Abstract and Conclusions Sections. We have considered to better highlight the motivation and the key results of the study in the revised version. We have added several lines (1419 of the revised version of the manuscript) at the end of the Abstract to highlight the key results as:
“In three examples we optimise the shot positions to obtain acquisition layout which can produce high coverage of dispersion curves. In the other example, the data have been acquired using a typical seismic exploration 3D acquisition scheme. We show that if the source positions are optimised, the straightray can produce Swave velocity models similar to the curvedray SWT but with lower computational cost. Otherwise, the improvement of inversion results from curvedray SWT can be significant.”
We have rewritten the whole Conclusions Section as well.
Regarding the data coverage, we have made several changes to explain this issue more clearly. As we have stated in line 1418 of the revised version of the manuscript, for three examples in this study (the Blocky model, Sand Bar mode, and Pijnacker example) we have optimized the shot positions to obtain high data coverage. We have devoted Section 2.1 to describe the employed algorithm to optimize the source positions. For case of the CNR example, the acquisition layout mimics at a smaller scale the classical seismic exploration 3D crossspread acquisition scheme with orthogonal lines of sources and receivers. This dataset, not being optimised will help analysing the criticalities introduced by a nonoptimal acquisition scheme. We believe that the subject of the data coverage has been more clearly explained in the revised version and therefore, we did not repeat a test with one of the synthetic examples with lower data coverage (because the problem is not only about having a low data coverage. It’s about the low data coverage that may be obtained from a typical exploration acquisition scheme)
Comment#3: l. 10 "exact paths"  the term 'exact' is quite vague. In this case you calculate Eikonal paths since they are the solution to the Eikonal equation. These represent an approximation and not necessarily the true/exact paths.
Response#3: We have removed the term “exact” and modified the sentence in lines 910 to:
“Alternatively, curvedray SWT can be employed by computing the paths between the receiver pairs using a raytracing algorithm.”
Comment#4: l. 35 So did Trampert and Spetzler find a difference between the raybased and the finitefrequency approach?Response#4: We have added the following explanation to show the highlights of the study by Trampert and Spetzler (2006) in lines 4449 of the revised version of the manuscript:
“Trampert and Spetzler (2006) pointed out that the choice of regularization has a major impact on SWT results. They studied SWT methods based on ray theory (straightray and curvedray) and scattering theory in which the integral along the ray path is replaced by the integral over an influence zone. They showed that these methods are statistically alike and any model from one method can be obtained by the other one by changing the value of the regularization. They concluded that the only option to increase the resolution of the model is to increase and homogenize the data coverage.”
Comment#5: l. 38 It sounds like Gouedard did not perform any comparison between straight and Eikonal ray based models. So why cite them here?Response#5: Following this suggestion, that work (Gouedard et al., 2010) has been deleted from the Introduction in the revised version of the manuscript.
Comment#6: l. 5056 You present several studies where some applied some form of ray tracing and others didn't. But what is your point? Did these studies find any advantage in raytracing? If your point is that researchers have applied different methods to approximate the rays but noone has done a systematic study, then you should write it like that.Response#6: We have rewritten the Introduction and devoted a paragraph to refer to the previous seismological studies where these two approaches were compared. We have clarified in lines 5253 of the revised version of the manuscript that such comparison is missing at the nearsurface scale. This is one of the key points of this study to compare the two methods at the nearsurface scale.
Comment#7: l. 70 "shots are defined as a regular grid" sounds wrong to me. I would rather write something like "shot locations located on a regular grid are tested by calculating the number of aligend receivers for each location."Response#7: Following this comment, we have removed “shots are defined as a regular grid”. In the revised version of the manuscript, the Method Section has been expanded largely as suggested by Reviewer#3. We have dedicated Section 2.1 to explain the employed procedure to pick the shot positions.
Comment#8: l. 71 How many shots are picked, what are the criteria for the number of picked shot locations?Response#8: As mentioned in the response to the previous comment, we have explained the criteria to pick the shot positions in Section 2.1 of the revised version of the manuscript.
Comment#9: l. 71 I assume this approach only applies to receiver layouts on a grid and not in case of irregular/random receiver locations? Maybe you should say so.Response#9: It can be applied also to irregular/ random receiver locations. We have used the guidelines proposed by Da Col et al. (2020). In that study, the receivers are not put as a regular grid.
We have clarified it in Section 2.1, lines 8788 of the revised version of the manuscript:
“For a given (random or regular) array configuration, we can optimise the locations of shots to ensure having high coverage DCs with minimum number of shots based on the guidelines by Da Col et al. (2020).”
Comment#10: l. 75 Which values for Vp and rho are you assuming in your study? How are these values chosen?Response#10: As requested by Reviewer#1, we have changed Vp to Poisson ratio for the sake of consistency. For the synthetic examples, we have used the true values of Poisson ratio and densities, as mentioned in lines 216217 and 247248 of the revised version of the manuscript. For the Pijnacker example, we have added the following clarification in lines 279280 of the revised version of the manuscript as:
“Since the medium was (almost) saturated, a high ν value (0.45) was chosen for the initial model. The ρ values in the medium were assumed to be low (1700 kg m^{3}) because it consisted of unconsolidated materials.”
In case of the CNR example, the following clarification has been added in lines 309310 of the revised version of the manuscript:
“… ν is approximated based on a previous study (Khosro Anjom et al., 2019) on the site and fixed at 0.33, and density is fixed at 2000 kg m^{3} since the site mainly consists of loos sand material.”
Comment#11: l. 7680 I think the way you describe the procedure is a bit complicated. Bascially, you take your 3D model defined by Vs,Vp and rho and extract 1D depth profiles at each point of the model. You then calculate the phase dispersion curve for each profile and join all the dispersion curves to get 2D phaseslowness maps at a set of periods (at how many periods, how do you choose the periods?). The ray tracing is then done in each of the phaseslowness maps separately. I would suggest to rewrite this paragraph.Response#11: We have deleted this paragraph from the manuscript. Instead, we have expanded the Method Section as suggested by Reviewer#3. We have provided more details and clarifications in the Method Section of the revised version of the manuscript.
Comment#12: l. 99, 102 "uneven sampling", "nonuniform sampling"; it would be helpful to your readers if you could say more precisely what you mean. If I understand correctly, it is that the number of samples is not the same at different wavelengths, i.e. periods?Response#12: We did not mean that the number of samples is not the same at different wavelength. We estimated the DCs in the frequency domain and each DC is sampled uniformly in frequency (delta_f is constant). This means that for each DC, the adjacent sampled points have the same “delta_f” but not the same wavelength difference. In fact, the portion of the DC with lower wavelength are more sampled than the portion with higher wavelength.
We have modified this part of the manuscript and added the following explanation to clarify this subject in lines 172175 of the revised version of the manuscript:
“To estimate the DCs from raw data, we have used the autopicking code (Papadopoulou, 2021) in which the DCs are sampled uniformly in frequency. This means that each DC is nonuniformly sampled in terms of wavelength which can drive the inversion algorithms to the shallowest part of the subsurface without any significant updates in the deeper portion of the initial velocity model (Khosro Anjom and Socco, 2019).”
Comment#13: You should also mention how you sample your dispersion curves. From the images, it looks like you have a uniform sampling in frequency. This means that you implicitly put a higher weight on high frequencies (a uniform sampling in period would imply a higher weight on low frequencies). Many researchers therefore apply a logspaced sampling.Response#13: We have mentioned in lines 172173 of the revised version of the manuscript that we sample each DC uniformly in frequency:
‘”… we have used the autopicking code (Papadopoulou, 2021) in which the DCs are sampled uniformly in frequency.”
We have added explanations regarding the employed processing tool (the autopicking code described in Papadopoulou, 2021) and devoted a subsection (2.2) to explain the applied methodology to estimate the DCs from the raw data. As this code samples each DC uniformly in frequency, we impose the wavelengthbased weights to increase the weights of the points with lower wavelength.
Comment#14: l. 105 "sigma_i,j is the standard deviation of the ith data point of the jth dispersion curve". I think there is a mistake in that description. In your matrix, only the trace is nonzero. Your sentence would then imply that for each dispersion curve, you only have a single measurement. Instead, I think that you have several measurements (at a set of periods) for each dispersion curve, so that the measurements from disp curve 1 have standard deviations sigma_1,1 to sigma_n,n, and from disp curve 2 from sigma_n+1,n+1 to sigma_n+m,n+m, and so on...Response#14: In fact, we agree that the Eq. 4 in the original version (equivalent of Eq. (12) in the revised version of the manuscript) could be confusing. To clarify, we have modified the previous equation to:
Comment#15: What is meant by "closest data point"? Please explain. Also, why the delta in delta lambda_j,max if it is the maximum wavelength? (from the delta I would expect a difference)
Response#15: The weights in Eq. (13) in the revised version of the manuscript, are computed separately for each DC. For each point of the DC, a wavelength value can be computed based on its phase velocity and frequency. For the generic i^{th} point of the j^{th} DC, the “closest data point” is defined as the point from which the i^{th} point has the smallest wavelength distance. delta lambda_j,max represents the maximum computed wavelength difference for the j^{th} DC. We have clarified these terms in lines 182183 of the revised version of the manuscript as:“where delta lambda_i,j represents the wavelength difference between the data point i of the j^{th} DC and the data point with the smallest wavelength distance from i, and delta lambda_j,max is the maximum computed wavelength difference for the j^{th} DC.”
Comment#16: l. 110 Did you add any error to your synthetic measurements? Please mention in the text.Response#16: We have not added any error to the synthetic data. We have clarified it in lines 197 and 239 of the revised version of the manuscript.
Comment#17: l. 150 It would be good to give a more quantitative measure for the quality of reconstruction, fo example by providing the variance reduction or simply the misfit to the input model. (I just saw that you did in table 3, I would suggest that you write down these values here or refer to table 3).Response#17: We have added the reference to Table 3 (equivalent of Table 4 in the revised version of the manuscript).
Comment#18: l. 199 Are the values of nu and rho in your Table 2 fixed during the inversion? Please mention somewhere in the text. What influence do you expect from the potential errors in these values?Response#18: Yes, the values of nu and rho are fixed during the inversion. We have explained it in lines 187191 of the revised version of the manuscript:
“It should be noted that only VS values are updated during the inversion and the other parameters (h, ν, and ρ) are fixed. In case of the synthetic examples, the true values of ν and ρ are used in the inversion. For the field examples, ν and ρ are approximated based on the available a priori information. Having erroneous values of ν and ρ can induce errors in the inversion results even though the sensitivity of surface waves to VS is more than ν (and way more than ρ).”
Comment#19: l. 214 What value for the data standard deviation (sigma, eq 4) do you assume in your synthetic tests and in the real data examples? Is sigma individually determined for each measurement?Response#19: We have added Eq. (2) from which the standard deviation of phase velocities are computed in lines 114115 of the revised version of the manuscript:
The proposed equation by Passeri (2019) is used to approximate the standard deviation (σ_{Vj}) of generic j^{th} element of the phase velocity vector (V_{j}) at its corresponding frequency (f_{j}) as …
Comment#20: l 324 You weighting is based on the wavelength of the signals, but at the same time you argue with the lower number of data at long wavelengths. So should the weight not rather be based on the number of measurements at each frequency? Or, putting the question differently, if I have a dataset with exactly the same number of measurements at each frequency (as is probably the case in your synthetic experiment), do I still need the weighting?Response#20: The weighting is different for each DC. The weighting is not based on the number of measurements at each frequency but rather based on the wavelength of each point of a DC. So, even in case of synthetic example, each DC is nonuniformly sampled in terms of wavelength and the wavelengthbased weighting can be applied to compensate this nonuniformity. We believe this concept has been better explained in the lines 172176 of the revised version of the manuscript as:
“To estimate the DCs from raw data, we have used the autopicking code (Papadopoulou, 2021) in which the DCs are sampled uniformly in frequency. This means that each DC is nonuniformly sampled in terms of wavelength which can drive the inversion algorithms to the shallowest part of the subsurface without any significant updates in the deeper portion of the initial velocity model (Khosro Anjom and Socco, 2019). To address this issue, a wavelengthbased weighting scheme was applied in the inversion process to compensate for this nonuniformity (see Khosro Anjom et al., 2021, for details).”
Comment#21: I would like to refer again to my previous comment on the importance of the sampling of your dispersion curves. If you use a uniform sampling in frequency, it is clear to me that the low frequency measurements are underweighted. Maybe run a test with log sampling and compare to the results in Fig. 14.Response#21: As we have explained in the Section 2.2 of the revised version of the manuscript, we have used a processing tool uniformly samples the DCs in frequency. We believe that with the provided clarifying explanations on the procedure of weighting in the revised version of the manuscript, the process is described much clearer than the original version. We did not run a test with log sampling because as explained earlier, our processing tool samples DCs uniformly in frequency.
Comment#22: Fig. 7 There should be a scale on (b). Why is the panel in (d) cropped? It seems that some dispersion curves go also to values slower than 50 m/s.
Response#22: We have added the scale for the Pijnacker’s acquisition scheme (Fig. 8b of the revised version of the manuscript). We have modified Figure 8b and decreased the lower limit of the yaxis to avoid confusion.

AC2: 'Reply on RC2', Mohammadkarim Karimpour, 03 Aug 2022

RC4: 'Comment on egusphere2022279', Fabrizio Magrini, 27 Jun 2022
Dear Editor,
The manuscript "A comparison of straightray and curvedray surface wave tomography approaches at nearsurface studies" compares the effectiveness of "straightray" and "curvedray" tomography in seismic imaging tasks. The pros and cons of these two approaches can be considered very well known to the seismological community, but I am not aware of any technical study that discusses them thoroughly and systematically. I suppose that doing so was the purpose of this study.
Unfortunately, I believe that a study of this kind might only be considered useful/appropriate/worth of publication if the technicalities inherent to the methods employed are very clearly explained; this is also very important, of course, to make the work reproducible. The manuscript, however, lacks many details, and does not allow one to fully understand or reproduce the work that was carried out. I suggest that the authors revise their manuscript extensively in this sense. Moreover, the quality of English and of the figures could and should be greatly improved.
The authors will also find my detailed comments/questions below.
Best regards,
Fabrizio Magrini
Abstract: This is too general. It does not display any highlights on the results presented in the paper
Introduction: In general, two different strategies exist to "convert" surfacewave dispersion curves to a 3D Vs model. As explained in the manuscript, the first (i) involves the calculation of phasevelocity maps, which are then converted to Vs by carrying out many 1D inversions. The second, instead, (ii) allows one to invert the dispersion curves directly for the 3D model. In both cases, the data kernels can be calculated (a) by assuming that the waves travel along the greatcircle path connecting a given station pair (raytheory) or (b) by accounting for ray bending (raytracing).
In this study, you carry out a comparison between strategies (a) and (b). Is there a reason why you don't contemplate strategies (i) and (ii)? This would hugely benefit both the paper and the seismological community. Moreover, from the introduction, it is not clear which strategy between (i) and (ii) you intend to focus on, and why.
I feel that the motivation for the study should be discussed more indepth, because the pros and cons of raytheory vs. ray tracing are well known (for reviews, see Rawlinson & Sambridge 2003, Rawlinson et al. 2010).
General Consideration: This study focuses on very shallow crustal structures; I wonder if the iterative nonlinearinversion scheme used to convert the dispersion curves to the 3D Vs model can be considered appropriate in this sense. Wouldn't a globally optimized algorithm be more suited to the solution of a (possibly highly) nonlinear problem such as that of this kind? Adding some consideration on the matter would be useful.
Method: The method, and the related assumptions, should be explained more clearly (see also the points below). In principle, since this is supposed to be a technical work, I believe this part should be highly detailed, so as to make your work reproducible. I suggest that you expand largely this section, and possibly subdivide it into several subsections. It would be good to explain (i) the calculation of dispersion curves, (ii) the raytheory vs. the raytracing algorithm (with the latter one meriting more consideration, especially if you implemented it on your own), (iii) the forward solver that allows you to measure predicted data from a given Vs model, and (iv) the inversion for Vs. In (iv), it would be good to say something about the calculation of sensitivity kernels at different periods. Other points are found below.
 One can only guess that the inversion strategy chosen by the authors involves the direct inversion of surfacewave velocity for the 3D structure.
 Synthetic tests to verify the accuracy of the raytracing algorithm should be presented. What is the relative error as a function of distance from the source based on a homogenous medium?
 When you refer to "straight lines", are you referring to greatcircle paths? If not, it should be explained that you designed your experiment in a cartesian coordinate system.
 Each vector and matrix in equations (2) and (3) should be thoroughly explained, and their dimensions be explicit. For example, is d_obs your slowness, or is it the arrival time obtained from slowness and raypath distance? How do you calculate the roughness operator Rp? Generally, the extent of the roughness is determined by a damping scalar coefficient (e.g., Boschi & Dziewonski 1999, Magrini et al. 2022), but you have the matrix C_{R_{p}}. Can you please be more explicit on its calculation? Can you also provide a reference for your equation (3), or alternatively a derivation for it? As it is, it appears different from, e.g., eqs. (51) and (57) of Rawlinson & Sambridge (2003).
 Is your stopping criterion compatible with previous studies?
Results: An important point that does not seem to be discussed is the choice of damping in the two different inversions. Slightly different values of damping can produce slightly different results. I believe it would be important to discuss in some depth this choice, and to demonstrate, to some degree, that the result of your comparisons is not biased by improper use of regularization. (Note that a given value of roughness damping might be ideal for the raytheory case but not for the raytracing case, and vice versa).
 For example, consider Fig 10: I have the feeling that the large differences between the two inversions might derive from the choice of the roughness damping (the straightray tomography seems slightly underdamped). Have the authors experimented with different values?
 Are your synthetic data generated with the mentioned 3D finitedifference code only in the Case study 1 or also in the Case study 2? Eventually, a brief explanation of this code could go in the Method section.
 I am struggling to understand the meaning of the red arrows/letters in Fig. 8, and the caption is not helping me. Probably I am missing something simple, but this suggests to me that the explanation in the caption should be extended or made clearer.
Discussion: The relative misfit in equation (6) is a function of fw(m_final). May you please be more explicit on the forward calculation of your dispersion curves (predicted data) based on a given model? Did you use SOFI3D?
Minor
 In the introduction, you refer to Boschi and Dziewonski (1999) while speaking of seismic ambient noise. Clearly, in 1999 ambientnoise tomography did not exist
References
Nicholas Rawlinson and Malcolm Sambridge. Seismic traveltime tomography of the crust and
lithosphere. Advances in geophysics, 46:81–199, 2003.Nicholas Rawlinson, S Pozgay, and S Fishwick. Seismic tomography: a window into deep earth.
Physics of the Earth and Planetary Interiors, 178(34):101–135, 2010.Fabrizio Magrini, Sebastian Lauro, Emanuel Kästle, Lapo Boschi. Surfacewave tomography using SeisLib: a Python package for multiscale seismic imaging, Geophysical Journal International, ggac236, https://doi.org/10.1093/gji/ggac236, 2022

AC3: 'Reply on RC4', Mohammadkarim Karimpour, 03 Aug 2022
Thank you for your suggestions. We have added more details, particularly in the Method Section, to allow the readers to fully understand the applied methodology and increase the reproducibility of the work.
We have copied your comments below and assigned a number to each of them. We have provided a response for every comment.
Comment#1: Abstract: This is too general. It does not display any highlights on the results presented in the paper
Response#1: As also suggested by Reviewer#2, we have modified the Abstract and better highlighted the key results of the study.
Comment#2: Introduction: In general, two different strategies exist to "convert" surfacewave dispersion curves to a 3D Vs model. As explained in the manuscript, the first (i) involves the calculation of phasevelocity maps, which are then converted to Vs by carrying out many 1D inversions. The second, instead, (ii) allows one to invert the dispersion curves directly for the 3D model. In both cases, the data kernels can be calculated (a) by assuming that the waves travel along the greatcircle path connecting a given station pair (raytheory) or (b) by accounting for ray bending (raytracing).
In this study, you carry out a comparison between strategies (a) and (b). Is there a reason why you don't contemplate strategies (i) and (ii)? This would hugely benefit both the paper and the seismological community. Moreover, from the introduction, it is not clear which strategy between (i) and (ii) you intend to focus on, and why.
Response#2: As you mentioned, this study focuses on the comparison between strategies (a) and (b). The comparison between strategies (i) and (ii) is definitely very interesting and can be a topic for a separate study because it is out of the scope of this work. We have reconstructed 3D VS models by direct inversion of DCs. We have clarified it in lines 7980 of the revised version of the manuscript as:
“For each dataset, 3D VS models from straight and curvedray SWT are obtained by direct inversion of DCs, …”
Comment#3: I feel that the motivation for the study should be discussed more indepth, because the pros and cons of raytheory vs. ray tracing are well known (for reviews, see Rawlinson & Sambridge 2003, Rawlinson et al. 2010).
Response#3: We have rewritten some paragraphs of the Introduction for clarification. The main purpose of this study is to evaluate the performance of straightray and curvedray SWT at the nearsurface scale. As stated in lines 5253 of the revised version of the manuscript, despite seismological studies, such comparison is missing at the nearsurface scale. We have also compared our results with previous seismological studies in Section 4.5 of the revised version of the manuscript, and showed that the previous findings in seismology might not be valid at the nearsurface scale. Moreover, we have explained the applied criteria to optimize the shot positions for a SWT study using active seismic data at the nearsurface scale. This is a key difference between the available seismological studies and nearsurface studies using active seismic data. We show the importance of the shot optimization in the obtained dispersion curves coverage and consequently the obtained VS models from the inversions in the Results Section.
Comment#4: General Consideration: This study focuses on very shallow crustal structures; I wonder if the iterative nonlinearinversion scheme used to convert the dispersion curves to the 3D Vs model can be considered appropriate in this sense. Wouldn't a globally optimized algorithm be more suited to the solution of a (possibly highly) nonlinear problem such as that of this kind? Adding some consideration on the matter would be useful.
Response#4: We had stated in lines 5961 of the original version that:
“The computational efficiency is of great importance in seismic nearsurface since, compared to seismological studies, the abundance of data at active seismic nearsurface projects can increase the computational cost significantly.”
The computational cost is a key factor for professionals particularly in the nearsurface studies with high amount of data. We had shown in Table 3 of the original version, the cost of the SWT using the employed deterministic approach which can be as much of 80 $. This number can increase drastically by using stochastic methods.
Moreover, we have shown in Table 4 of the revised version of the manuscript that the final VS model obtained from the iterative nonlinear inversion can be quite accurate even though the inversions started from homogeneous initial models.
Comment#5: Method: The method, and the related assumptions, should be explained more clearly (see also the points below). In principle, since this is supposed to be a technical work, I believe this part should be highly detailed, so as to make your work reproducible. I suggest that you expand largely this section, and possibly subdivide it into several subsections. It would be good to explain (i) the calculation of dispersion curves, (ii) the raytheory vs. the raytracing algorithm (with the latter one meriting more consideration, especially if you implemented it on your own), (iii) the forward solver that allows you to measure predicted data from a given Vs model, and (iv) the inversion for Vs. In (iv), it would be good to say something about the calculation of sensitivity kernels at different periods. Other points are found below.
Response#5: Following this comment, we have largely expanded the Method Section in the revised version of the manuscript and added way more details and equations. We have also divided this section into:
2.1 Optimisation of source layout
2.2 Estimation of DCs
2.3 1D forward modelling
2.4 Computation of forward response
2.5 Inversion algorithm
We have supported each subsection by related explanations and Equations.
Comment#6:  One can only guess that the inversion strategy chosen by the authors involves the direct inversion of surfacewave velocity for the 3D structure.
Response#6: We have clarified it in lines 7980 of the revised version of the manuscript:
“For each dataset, 3D VS models from straight and curvedray SWT are obtained by direct inversion of DCs, …”
Comment#7:  Synthetic tests to verify the accuracy of the raytracing algorithm should be presented. What is the relative error as a function of distance from the source based on a homogenous medium?
Response#7: To test the accuracy of the ray tracing algorithm, we used a homogeneous medium (the Blocky model with constant VS equal to 200 m/s in the whole medium) and computed the ray paths. As an example, we show the computed paths for a DC where the receivers are located at (19 m, 5 m) and (10 m, 5 m) in the figure below:
Figure 1. The computed ray paths for different frequency component of the DC with receivers’ positions at (19 m, 5 m) and (10 m, 5 m).
The average relative error of the paths (i.e., deviation from straightline) is almost zero (7.2e16). We have not shown this figure in the revised manuscript. Nonetheless, we have added the following clarification in lines 129130 of the revised version of the manuscript:
“To evaluate the accuracy of the raytracing algorithm, we have applied it in a to a homogeneous media and noticed that the error (i.e., deviation from straightline) in this condition is almost zero (not shown here).”
Comment#8:  When you refer to "straight lines", are you referring to greatcircle paths? If not, it should be explained that you designed your experiment in a cartesian coordinate system.
Response#8: We have added the following clarification sentence in line 118 of the revised version of the manuscript:
“We carry out our experiments in a Cartesian coordinate system.”
Comment#9:  Each vector and matrix in equations (2) and (3) should be thoroughly explained, and their dimensions be explicit. For example, is d_obs your slowness, or is it the arrival time obtained from slowness and raypath distance? How do you calculate the roughness operator Rp? Generally, the extent of the roughness is determined by a damping scalar coefficient (e.g., Boschi & Dziewonski 1999, Magrini et al. 2022), but you have the matrix C_{R_{p}}. Can you please be more explicit on its calculation? Can you also provide a reference for your equation (3), or alternatively a derivation for it? As it is, it appears different from, e.g., eqs. (51) and (57) of Rawlinson & Sambridge (2003).
Response#9: We have added the reference for Eq. (3) (equivalent of Eq. 11 in the revised version of the manuscript), which is Boiero (2009).
We have extended the Method Section largely and added way more details in this section. We explained in line 112 of the revised version of the manuscript that the dimension of the vector of the experimental data (d_obs) is (phase) velocity. We have also clarified in lines 160162 of the revised version of the manuscript that we have assigned a large value (10^{6}) to the covariance matrix of the spatial regularization matrix:
“To reduce the impact of spatial regularization on the inversion results, in all four examples in this study, a large value (10^{6}) is assigned to C_{Rp}. It means that the VS difference between the neighbouring cells is constrained to 1000 m/s.”
Comment#10:  Is your stopping criterion compatible with previous studies?
Response#10: In some surface waves studies, the inversion stops when the misfit function reduces less than 1% with respect to the value at the previous iteration (e.g., Garofalo et al., 2015). Since our computational facilities have been improved compared to before, we have defined a lower threshold in our study (0.01%) to make sure that the inversion reaches a local minima. As can be seen in Fig. 6b of the revised version of the manuscript, the misfit value of straightray inversion shows a sudden decrease at iteration 27, while the inversion would have stopped at iteration 26 if a higher threshold had been chosen as the stopping criterion.
Comment#11: Results: An important point that does not seem to be discussed is the choice of damping in the two different inversions. Slightly different values of damping can produce slightly different results. I believe it would be important to discuss in some depth this choice, and to demonstrate, to some degree, that the result of your comparisons is not biased by improper use of regularization. (Note that a given value of roughness damping might be ideal for the raytheory case but not for the raytracing case, and vice versa).
Response#11: We have clarified the choice of regularization 160162 of the revised version of the manuscript. As pointed out by Trampert and Spetzler (2006), the choice of regularization has a major impact on SWT results. Therefore, we have assigned a very large number (10^{6}) to the regularization values so that the final VS model is not biased by the regularization values and the comparison between straightray and curvedray methods are fair.
Comment#12:  For example, consider Fig 10: I have the feeling that the large differences between the two inversions might derive from the choice of the roughness damping (the straightray tomography seems slightly underdamped). Have the authors experimented with different values?
Response#12: We have provided a detailed response in the response to the comments of Reviewer#1. The observed difference is due to the ‘wrong’ ray paths in the straightray approach. At the edges of the velocity anomaly, the assumed paths by the straightray are shorter than the true paths and therefore the velocities are (wrongly) high.
All experiments have been done with the same regularization values. As stated in lines 160162 of the revised version of the manuscript.
Comment#13:  Are your synthetic data generated with the mentioned 3D finitedifference code only in the Case study 1 or also in the Case study 2? Eventually, a brief explanation of this code could go in the Method section.
Response#13: The same 3D finite difference code has been used to generate the synthetic data in both case studies 1 and 2. We have added the following sentence in line 238239 of the revised version of the manuscript for clarification:
“The same finite difference code used for the Blocky model was used to obtain the Sand Bar synthetic dataset …”
We have added a brief explanation of this code (SOFI3D) in lines 197201 of the revised version of the manuscript as:
“The code is an FD modelling program based on the FD approach described by Virieux (1986) and Levander (1988) with some extensions. It can consider viscoelastic wave propagation effects such as attenuation and dispersion, employ higher order FD operators, 200 apply perfectly matched layer (PML) boundary conditions at the edges of the model, and it works in message passing interface (MPI) parallel environment which reduces the running time of the simulations”
Comment#14:  I am struggling to understand the meaning of the red arrows/letters in Fig. 8, and the caption is not helping me. Probably I am missing something simple, but this suggests to me that the explanation in the caption should be extended or made clearer.
Response#14: Since Reviewer#1 had the same struggle, we have removed the red arrows from the reconstructed VS models from the inversion (Figures 4, 7, 10, and 13 of the revised version of the manuscript).
Comment#15: Discussion: The relative misfit in equation (6) is a function of fw(m_final). May you please be more explicit on the forward calculation of your dispersion curves (predicted data) based on a given model? Did you use SOFI3D?
Response#15: No, we have not used SOFI3D for this purpose. Following your previous comments on the Method Section, the process for the forward calculation of the dispersion curves have been thoroughly explained in Section 2.4 of the revised version of the manuscript.
Comment#16: In the introduction, you refer to Boschi and Dziewonski (1999) while speaking of seismic ambient noise. Clearly, in 1999 ambientnoise tomography did not exist
Response#16: We have put it its right place, that is at line 24 of the revised version of the manuscript.

AC3: 'Reply on RC4', Mohammadkarim Karimpour, 03 Aug 2022
Interactive discussion
Status: closed

RC1: 'Comment on egusphere2022279', Anonymous Referee #1, 09 Jun 2022
The submitted manuscript focuses on comparing the straightray and curvedraybased surface wave tomography using four data sets under the nearsurface condition. The surface wave tomography technique is a wellestablished method in regional and global seismology, introducing this technique to nearsurface applications could be beneficial for accurately investigating the shallow target, and this topic is within the scope of the journal. However, some points should be addressed before a possible publication. Moreover, there are lots of typos in the manuscript that should be revised properly. The English also needs improvement. I would recommend moderate revision before publication.
Following, I listed some comments regarding the manuscript.
 Line 75, Page 3. Here the authors state the Vp and density are assumed to be known as prior information but in the result section (such as in the figure caption of Fig 2) the term “Poisson ratio” is used instead of the Vp. It is recommended to keep the full text consistent.
 Section 3.1. It is recommended to provide the elastic parameters of the Blocky model to help readers better understand what the model is like.
 Line 115, Page 5. It may simply be stated that 16 shots were chosen to generate the raw data after optimized design. The 441 shots may make the readers confusing.
 Line 130, Page 6. What do the red text ‘a’, ‘b’, and ‘c’ and red arrows in Fig 1a represent? They should be demonstrated in the figure caption.
 Line 140, Page 7. “The same initial model is used as the starting model for the SWT inversion in both straight and curved ray methods”. It is recommended to show the readers the iteration curves and inversion parameters (damping factor and weights used in the regularization matrix). In fact, the inversion parameters of the two inversions should be the same for the sake of comparison.
 Figure 6. For the current version, the red arrow and text are confusing for the readers. It is recommended to look for a better way to show the plot.
 Line 180, Page 12. It is not easy for readers to identify that the boundary in the curvedray tomogram is clear than the one in the straightray tomogram. Providing the model error in this area might be better to support this conclusion.
 Figure 10. Again, it is recommended to provide the iteration curves and inversion parameters. It seems that the inversion using the straightray method becomes unstable and there are some outliers in the tomogram. Do these two methods use the same weights in the smooth regularization?
 For some plots, such as in Figure 12, it is recommended to indicate which data set the plot is related to in the figure caption.
 Line 280, Page 20. Please delete the redundant sentence “For each parameter, the values …”
 Table 3. The abbreviation CR and SR should be mentioned in the context, also, the formula for calculating CRSR should be demonstrated.
 Line 295, Page 21. Again, please delete the redundant sentence “For each parameter, the values …”
 Please check the sentence above section 4.3: Table 3 that using curvedray SWT has increases the computational cost by an average of 23 %.

RC3: 'Reply on RC1', Emanuel Kästle, 10 Jun 2022
A short note on comment 8/Fig 10: I also thought that maybe the straight ray ray tomography became unstable which is causing these extreme velocity jumps. But I think it could be an actual effect only caused by the 'wrong' ray paths. Since the straight rays are forced through the slow, central part of the medium (while the true rays curve around and never saw any of the slow medium), the inversion needs to compensate by putting higher velocities in other grid cells, which results in the strong irregularities. I would suggest that the authors discuss this effect and add the information required by Reviewer 1.
 AC4: 'Reply on RC3', Mohammadkarim Karimpour, 03 Aug 2022

AC1: 'Reply on RC1', Mohammadkarim Karimpour, 03 Aug 2022
Dear Reviewer,
Thank you for your comments and suggestions. We have corrected the typos and provided a response for each of your comments.
 Following this suggestion, we have modified the text and used “Poisson ratio” to keep the full text consistent.
 Following this comment, we have provided the elastic parameters of the Blocky model in Table 1 of the revised version of the manuscript.
 We have removed the sentence “To optimize the shot locations, we defined 441 shots …” to avoid confusing the readers. In the revised version of the manuscript, we have added Section 2.1 to clarify the employed criteria to pick the shot positions.
 We have added the following explanation in the caption to clarify the purpose of using the red arrows “a”, “b”, and “c”:
“The red arrows and letters represent the location of 2D slices in subplots (bd).”
 We have added the iteration curves for all the examples in Figures 3b, 6b, 9, and 12 of the revised version of the manuscript. We have added the clarifying sentence that all the inversion parameters are kept the same for the sake of comparison in lines 185187 of the revised version of the manuscript as:
“For each example, the straightray and curvedray SWT inversions start from the same initial model. Other inversion parameters (C_Rpand λ) are also the same for the sake of comparison.”
 We received similar comment from Reviewer#3 (Fabrizio Magrini). To avoid confusion, we have removed the red arrows from the reconstructed VS models from the inversion (Figures 4, 7, 10, and 13 of the revised version of the manuscript). The intention to put them was to clarify the position of each crosssection but it seemed to add confusion for the readers.
 The distribution of the model error is displayed in the attaches supplement file.
As shown is in the figure of the supplement file, the difference between the model errors from the straightray and curvedray is very small. We have not included this figure in the text of the manuscript because it does not show a clearer difference between straight and curvedray approaches than what has been already shown in Fig. 7 of the revised version of the manuscript. We have compared the model errors in Table 4, which shows that curvedray approach has produced a more accurate model. This agrees with what we had stated in the original manuscript lines 177178, that is the difference between the VS model from straightray and curvedray approaches in this example is not significant:
“The vertical slices at X (Figure 6a and d) and Y directions (Figure 6b and e) do not display significant differences.”
We have also modified these sentences in lines 260262 of the revised version of the manuscript for clarification:
“The areas marked in dashed black in the horizontal slices (Fig. 7c and f) shows that the boundaries of the anomaly are slightly clearer in the curvedray approach (Fig. 7f) and also the VS values in these areas are closer to the true VS value (Fig. 5d).”
 We have responded to this comment in the responses to comment#5. We have added the iteration curve for this example. As we explained in the response to comment#5, the inversion parameters are the same for both straightray and curvedray methods. Moreover, the same regularization values (i.e., same weights) have been used for both approaches.
As commented correctly by Reviewer#2 (Emanuel Kästle), the observed difference is due to the ‘wrong’ ray paths in the straightray approach. At the edges of the velocity anomaly, the assumed paths by the straightray are shorter than the true paths and therefore the velocities are (wrongly) high.
We have added the following explanation to the text (lines 324327 of the revised version of the manuscript) to clarify this issue:
“The black arrow in Fig. 13c shows the high velocity cells inside the loose sand body in the retrieved model from the straightray SWT. The reason is that in this area (close to the interface of the sand body and the background medium) the assumed paths in the straightray approach are much shorter than the true paths and therefore, the obtained VS from the inversion becomes unrealistically high.”
 We have added this clarification in Figures 1517 of the revised version of the manuscript.
 We have deleted the redundant sentence from the revised version of the manuscript.
 We have shown the difference between the parameters from curvedray and straightray by , added Eq. (16) to show how to compute this parameter, and updated Table 4 in the revised version of the manuscript. We have clarified it in lines 379383 of the revised version of the manuscript.
 We have removed the redundant sentence.
 We have corrected the sentence and deleted the redundant phrase.

RC2: 'Comment on egusphere2022279', Emanuel Kästle, 10 Jun 2022
Dear Editor,
Dear Authors,the manuscript entitled "A comparison of straightray and curvedray surface wave tomography approaches at nearsurface studies" provides an informative study that is certainly of interest to tomographers who consider applying similar methods. The manuscript is clearly structured and generally well written. The applied methods are appropriate and the authors' conclusions are supported by the results of the two synthetic and of the two real data examples. Nevertheless, there are a couple of points that, in my view, need to be addressed before publication in Solid Earth.
Remarks
In the introduction, a few other studies are mentioned that also apply straight and/or curvedray tomography, but it is not clear for most of these studies whether they performed a comparison of the thwo approaches. There should be an overview of what the conclusions from other authors on the topic were. I am also missing a (short) discussion at the end of the manuscript on whether the results agree with existing literature.
The abstract needs to be rewritten. It gives a very brief introduction and motivation for the study. But an abstract should summarize the key results of the study. The same applies to the conclusions section which should be rewritten.
One of your key results is that in a scenario with low data coverage, the curvedray approach performs significantly better. But your synthetic examples do not prove that since you run no example with low coverage. I would suggest that you take one of your two synthetic tests and test whether this conclusion holds.
l. 10 "exact paths"  the term 'exact' is quite vague. In this case you calculate Eikonal paths since they are the solution to the Eikonal equation. These represent an approximation and not necessarily the true/exact paths.
l. 35 So did Trampert and Spetzler find a difference between the raybased and the finitefrequency approach?
l. 38 It sounds like Gouedard did not perform any comparison between straight and Eikonal ray based models. So why cite them here?
l. 5056 You present several studies where some applied some form of ray tracing and others didn't. But what is your point? Did these studies find any advantage in raytracing? If your point is that researchers have applied different methods to approximate the rays but noone has done a systematic study, then you should write it like that.
l. 70 "shots are defined as a regular grid" sounds wrong to me. I would rather write something like "shot locations located on a regular grid are tested by calculating the number of aligend receivers for each location."
l. 71 How many shots are picked, what are the criteria for the number of picked shot locations?
l. 71 I assume this approach only applies to receiver layouts on a grid and not in case of irregular/random receiver locations? Maybe you should say so.
l. 75 Which values for Vp and rho are you assuming in your study? How are these values chosen?
l. 7680 I think the way you describe the procedure is a bit complicated. Bascially, you take your 3D model defined by Vs,Vp and rho and extract 1D depth profiles at each point of the model. You then calculate the phase dispersion curve for each profile and join all the dispersion curves to get 2D phaseslowness maps at a set of periods (at how many periods, how do you choose the periods?). The ray tracing is then done in each of the phaseslowness maps separately. I would suggest to rewrite this paragraph.
l. 99, 102 "uneven sampling", "nonuniform sampling"; it would be helpful to your readers if you could say more precisely what you mean. If I understand correctly, it is that the number of samples is not the same at different wavelengths, i.e. periods?
You should also mention how you sample your dispersion curves. From the images, it looks like you have a uniform sampling in frequency. This means that you implicitly put a higher weight on high frequencies (a uniform sampling in period would imply a higher weight on low frequencies). Many researchers therefore apply a logspaced sampling.
l. 105 "sigma_i,j is the standard deviation of the ith data point of the jth dispersion curve". I think there is a mistake in that description. In your matrix, only the trace is nonzero. Your sentence would then imply that for each dispersion curve, you only have a single measurement. Instead, I think that you have several measurements (at a set of periods) for each dispersion curve, so that the measurements from disp curve 1 have standard deviations sigma_1,1 to sigma_n,n, and from disp curve 2 from sigma_n+1,n+1 to sigma_n+m,n+m, and so on...
l. 109 What is meant by "closest data point"? Please explain. Also, why the delta in delta lambda_j,max if it is the maximum wavelength? (from the delta I would expect a difference)
l. 110 Did you add any error to your synthetic measurements? Please mention in the text.
l. 150 It would be good to give a more quantitative measure for the quality of reconstruction, fo example by providing the variance reduction or simply the misfit to the input model. (I just saw that you did in table 3, I would suggest that you write down these values here or refer to table 3).
l. 199 Are the values of nu and rho in your Table 2 fixed during the inversion? Please mention somewhere in the text. What influence do you expect from the potential errors in these values?
l. 214 What value for the data standard deviation (sigma, eq 4) do you assume in your synthetic tests and in the real data examples? Is sigma individually determined for each measurement?
l 324 You weighting is based on the wavelength of the signals, but at the same time you argue with the lower number of data at long wavelengths. So should the weight not rather be based on the number of measurements at each frequency? Or, putting the question differently, if I have a dataset with exactly the same number of measurements at each frequency (as is probably the case in your synthetic experiment), do I still need the weighting?
I would like to refer again to my previous comment on the importance of the sampling of your dispersion curves. If you use a uniform sampling in frequency, it is clear to me that the low frequency measurements are underweighted. Maybe run a test with log sampling and compare to the results in Fig. 14.Fig. 7 There should be a scale on (b). Why is the panel in (d) cropped? It seems that some dispersion curves go also to values slower than 50 m/s.

AC2: 'Reply on RC2', Mohammadkarim Karimpour, 03 Aug 2022
Thank you for your comments and suggestions. We have replied to all your comments below. In the following, we have copied your comments and assigned a number to each of them (for clarification) and provided the explanations for every comment.
Remarks
Comment#1: In the introduction, a few other studies are mentioned that also apply straight and/or curvedray tomography, but it is not clear for most of these studies whether they performed a comparison of the thwo approaches. There should be an overview of what the conclusions from other authors on the topic were. I am also missing a (short) discussion at the end of the manuscript on whether the results agree with existing literature.
Response#1: We have modified the Introduction and added the following sentences to clarify this issue and provide an overview of what the conclusion from other authors on the topic were in the lines 3753 of the revised version of the manuscript as:
“SWT has been used in seismological studies for decades and different SWT approaches have been compared by seismologists. For instance, Laske (1995) studied deviations from straight line in the propagation of longperiod surface waves and concluded that they usually have small effects on the propagation phase. Spetzler et al. (2001) applied both straightray and curvedray SWT methods. They computed the maximum deviations of ray paths from straight lines and pointed out that this maximum is typically below the estimated resolution, except for long paths at short periods. Some studies showed that a more complex forward modelling in SWT did not improve the results (Sieminski et al., 2004; Levshin et al., 2005) while other studies reported obtaining better results (Ritzwoller et al., 2002; Yoshizawa and Kennett, 2004; Zhou et al., 2005). Trampert and Spetzler (2006) pointed out that the choice of regularization has a major impact on SWT results. They studied SWT methods based on ray theory (straightray and curvedray) and scattering theory in which the integral along the ray path is replaced by the integral over an influence zone. They showed that these methods are statistically alike and any model from one method can be obtained by the other one by changing the value of the regularization. They concluded that the only option to increase the resolution of the model is to increase and homogenize the data coverage. Bozdag and Trampert (2008) compared straightray and curvedray SWT methods in their study and mentioned that performing ray tracing could be so timeconsuming that the potential gain in crustal corrections on a global scale might not be worth the additional computational effort imposed by ray tracing. Despite seismological studies, a comparison between the performance of straightray and curvedray SWT at the nearsurface scale is missing.”
We have also added Section 4.5 to compare the results of our study with previous studies.
Comment#2: The abstract needs to be rewritten. It gives a very brief introduction and motivation for the study. But an abstract should summarize the key results of the study. The same applies to the conclusions section which should be rewritten.
One of your key results is that in a scenario with low data coverage, the curvedray approach performs significantly better. But your synthetic examples do not prove that since you run no example with low coverage. I would suggest that you take one of your two synthetic tests and test whether this conclusion holds.
Response#2: We have rewritten the Abstract and Conclusions Sections. We have considered to better highlight the motivation and the key results of the study in the revised version. We have added several lines (1419 of the revised version of the manuscript) at the end of the Abstract to highlight the key results as:
“In three examples we optimise the shot positions to obtain acquisition layout which can produce high coverage of dispersion curves. In the other example, the data have been acquired using a typical seismic exploration 3D acquisition scheme. We show that if the source positions are optimised, the straightray can produce Swave velocity models similar to the curvedray SWT but with lower computational cost. Otherwise, the improvement of inversion results from curvedray SWT can be significant.”
We have rewritten the whole Conclusions Section as well.
Regarding the data coverage, we have made several changes to explain this issue more clearly. As we have stated in line 1418 of the revised version of the manuscript, for three examples in this study (the Blocky model, Sand Bar mode, and Pijnacker example) we have optimized the shot positions to obtain high data coverage. We have devoted Section 2.1 to describe the employed algorithm to optimize the source positions. For case of the CNR example, the acquisition layout mimics at a smaller scale the classical seismic exploration 3D crossspread acquisition scheme with orthogonal lines of sources and receivers. This dataset, not being optimised will help analysing the criticalities introduced by a nonoptimal acquisition scheme. We believe that the subject of the data coverage has been more clearly explained in the revised version and therefore, we did not repeat a test with one of the synthetic examples with lower data coverage (because the problem is not only about having a low data coverage. It’s about the low data coverage that may be obtained from a typical exploration acquisition scheme)
Comment#3: l. 10 "exact paths"  the term 'exact' is quite vague. In this case you calculate Eikonal paths since they are the solution to the Eikonal equation. These represent an approximation and not necessarily the true/exact paths.
Response#3: We have removed the term “exact” and modified the sentence in lines 910 to:
“Alternatively, curvedray SWT can be employed by computing the paths between the receiver pairs using a raytracing algorithm.”
Comment#4: l. 35 So did Trampert and Spetzler find a difference between the raybased and the finitefrequency approach?Response#4: We have added the following explanation to show the highlights of the study by Trampert and Spetzler (2006) in lines 4449 of the revised version of the manuscript:
“Trampert and Spetzler (2006) pointed out that the choice of regularization has a major impact on SWT results. They studied SWT methods based on ray theory (straightray and curvedray) and scattering theory in which the integral along the ray path is replaced by the integral over an influence zone. They showed that these methods are statistically alike and any model from one method can be obtained by the other one by changing the value of the regularization. They concluded that the only option to increase the resolution of the model is to increase and homogenize the data coverage.”
Comment#5: l. 38 It sounds like Gouedard did not perform any comparison between straight and Eikonal ray based models. So why cite them here?Response#5: Following this suggestion, that work (Gouedard et al., 2010) has been deleted from the Introduction in the revised version of the manuscript.
Comment#6: l. 5056 You present several studies where some applied some form of ray tracing and others didn't. But what is your point? Did these studies find any advantage in raytracing? If your point is that researchers have applied different methods to approximate the rays but noone has done a systematic study, then you should write it like that.Response#6: We have rewritten the Introduction and devoted a paragraph to refer to the previous seismological studies where these two approaches were compared. We have clarified in lines 5253 of the revised version of the manuscript that such comparison is missing at the nearsurface scale. This is one of the key points of this study to compare the two methods at the nearsurface scale.
Comment#7: l. 70 "shots are defined as a regular grid" sounds wrong to me. I would rather write something like "shot locations located on a regular grid are tested by calculating the number of aligend receivers for each location."Response#7: Following this comment, we have removed “shots are defined as a regular grid”. In the revised version of the manuscript, the Method Section has been expanded largely as suggested by Reviewer#3. We have dedicated Section 2.1 to explain the employed procedure to pick the shot positions.
Comment#8: l. 71 How many shots are picked, what are the criteria for the number of picked shot locations?Response#8: As mentioned in the response to the previous comment, we have explained the criteria to pick the shot positions in Section 2.1 of the revised version of the manuscript.
Comment#9: l. 71 I assume this approach only applies to receiver layouts on a grid and not in case of irregular/random receiver locations? Maybe you should say so.Response#9: It can be applied also to irregular/ random receiver locations. We have used the guidelines proposed by Da Col et al. (2020). In that study, the receivers are not put as a regular grid.
We have clarified it in Section 2.1, lines 8788 of the revised version of the manuscript:
“For a given (random or regular) array configuration, we can optimise the locations of shots to ensure having high coverage DCs with minimum number of shots based on the guidelines by Da Col et al. (2020).”
Comment#10: l. 75 Which values for Vp and rho are you assuming in your study? How are these values chosen?Response#10: As requested by Reviewer#1, we have changed Vp to Poisson ratio for the sake of consistency. For the synthetic examples, we have used the true values of Poisson ratio and densities, as mentioned in lines 216217 and 247248 of the revised version of the manuscript. For the Pijnacker example, we have added the following clarification in lines 279280 of the revised version of the manuscript as:
“Since the medium was (almost) saturated, a high ν value (0.45) was chosen for the initial model. The ρ values in the medium were assumed to be low (1700 kg m^{3}) because it consisted of unconsolidated materials.”
In case of the CNR example, the following clarification has been added in lines 309310 of the revised version of the manuscript:
“… ν is approximated based on a previous study (Khosro Anjom et al., 2019) on the site and fixed at 0.33, and density is fixed at 2000 kg m^{3} since the site mainly consists of loos sand material.”
Comment#11: l. 7680 I think the way you describe the procedure is a bit complicated. Bascially, you take your 3D model defined by Vs,Vp and rho and extract 1D depth profiles at each point of the model. You then calculate the phase dispersion curve for each profile and join all the dispersion curves to get 2D phaseslowness maps at a set of periods (at how many periods, how do you choose the periods?). The ray tracing is then done in each of the phaseslowness maps separately. I would suggest to rewrite this paragraph.Response#11: We have deleted this paragraph from the manuscript. Instead, we have expanded the Method Section as suggested by Reviewer#3. We have provided more details and clarifications in the Method Section of the revised version of the manuscript.
Comment#12: l. 99, 102 "uneven sampling", "nonuniform sampling"; it would be helpful to your readers if you could say more precisely what you mean. If I understand correctly, it is that the number of samples is not the same at different wavelengths, i.e. periods?Response#12: We did not mean that the number of samples is not the same at different wavelength. We estimated the DCs in the frequency domain and each DC is sampled uniformly in frequency (delta_f is constant). This means that for each DC, the adjacent sampled points have the same “delta_f” but not the same wavelength difference. In fact, the portion of the DC with lower wavelength are more sampled than the portion with higher wavelength.
We have modified this part of the manuscript and added the following explanation to clarify this subject in lines 172175 of the revised version of the manuscript:
“To estimate the DCs from raw data, we have used the autopicking code (Papadopoulou, 2021) in which the DCs are sampled uniformly in frequency. This means that each DC is nonuniformly sampled in terms of wavelength which can drive the inversion algorithms to the shallowest part of the subsurface without any significant updates in the deeper portion of the initial velocity model (Khosro Anjom and Socco, 2019).”
Comment#13: You should also mention how you sample your dispersion curves. From the images, it looks like you have a uniform sampling in frequency. This means that you implicitly put a higher weight on high frequencies (a uniform sampling in period would imply a higher weight on low frequencies). Many researchers therefore apply a logspaced sampling.Response#13: We have mentioned in lines 172173 of the revised version of the manuscript that we sample each DC uniformly in frequency:
‘”… we have used the autopicking code (Papadopoulou, 2021) in which the DCs are sampled uniformly in frequency.”
We have added explanations regarding the employed processing tool (the autopicking code described in Papadopoulou, 2021) and devoted a subsection (2.2) to explain the applied methodology to estimate the DCs from the raw data. As this code samples each DC uniformly in frequency, we impose the wavelengthbased weights to increase the weights of the points with lower wavelength.
Comment#14: l. 105 "sigma_i,j is the standard deviation of the ith data point of the jth dispersion curve". I think there is a mistake in that description. In your matrix, only the trace is nonzero. Your sentence would then imply that for each dispersion curve, you only have a single measurement. Instead, I think that you have several measurements (at a set of periods) for each dispersion curve, so that the measurements from disp curve 1 have standard deviations sigma_1,1 to sigma_n,n, and from disp curve 2 from sigma_n+1,n+1 to sigma_n+m,n+m, and so on...Response#14: In fact, we agree that the Eq. 4 in the original version (equivalent of Eq. (12) in the revised version of the manuscript) could be confusing. To clarify, we have modified the previous equation to:
Comment#15: What is meant by "closest data point"? Please explain. Also, why the delta in delta lambda_j,max if it is the maximum wavelength? (from the delta I would expect a difference)
Response#15: The weights in Eq. (13) in the revised version of the manuscript, are computed separately for each DC. For each point of the DC, a wavelength value can be computed based on its phase velocity and frequency. For the generic i^{th} point of the j^{th} DC, the “closest data point” is defined as the point from which the i^{th} point has the smallest wavelength distance. delta lambda_j,max represents the maximum computed wavelength difference for the j^{th} DC. We have clarified these terms in lines 182183 of the revised version of the manuscript as:“where delta lambda_i,j represents the wavelength difference between the data point i of the j^{th} DC and the data point with the smallest wavelength distance from i, and delta lambda_j,max is the maximum computed wavelength difference for the j^{th} DC.”
Comment#16: l. 110 Did you add any error to your synthetic measurements? Please mention in the text.Response#16: We have not added any error to the synthetic data. We have clarified it in lines 197 and 239 of the revised version of the manuscript.
Comment#17: l. 150 It would be good to give a more quantitative measure for the quality of reconstruction, fo example by providing the variance reduction or simply the misfit to the input model. (I just saw that you did in table 3, I would suggest that you write down these values here or refer to table 3).Response#17: We have added the reference to Table 3 (equivalent of Table 4 in the revised version of the manuscript).
Comment#18: l. 199 Are the values of nu and rho in your Table 2 fixed during the inversion? Please mention somewhere in the text. What influence do you expect from the potential errors in these values?Response#18: Yes, the values of nu and rho are fixed during the inversion. We have explained it in lines 187191 of the revised version of the manuscript:
“It should be noted that only VS values are updated during the inversion and the other parameters (h, ν, and ρ) are fixed. In case of the synthetic examples, the true values of ν and ρ are used in the inversion. For the field examples, ν and ρ are approximated based on the available a priori information. Having erroneous values of ν and ρ can induce errors in the inversion results even though the sensitivity of surface waves to VS is more than ν (and way more than ρ).”
Comment#19: l. 214 What value for the data standard deviation (sigma, eq 4) do you assume in your synthetic tests and in the real data examples? Is sigma individually determined for each measurement?Response#19: We have added Eq. (2) from which the standard deviation of phase velocities are computed in lines 114115 of the revised version of the manuscript:
The proposed equation by Passeri (2019) is used to approximate the standard deviation (σ_{Vj}) of generic j^{th} element of the phase velocity vector (V_{j}) at its corresponding frequency (f_{j}) as …
Comment#20: l 324 You weighting is based on the wavelength of the signals, but at the same time you argue with the lower number of data at long wavelengths. So should the weight not rather be based on the number of measurements at each frequency? Or, putting the question differently, if I have a dataset with exactly the same number of measurements at each frequency (as is probably the case in your synthetic experiment), do I still need the weighting?Response#20: The weighting is different for each DC. The weighting is not based on the number of measurements at each frequency but rather based on the wavelength of each point of a DC. So, even in case of synthetic example, each DC is nonuniformly sampled in terms of wavelength and the wavelengthbased weighting can be applied to compensate this nonuniformity. We believe this concept has been better explained in the lines 172176 of the revised version of the manuscript as:
“To estimate the DCs from raw data, we have used the autopicking code (Papadopoulou, 2021) in which the DCs are sampled uniformly in frequency. This means that each DC is nonuniformly sampled in terms of wavelength which can drive the inversion algorithms to the shallowest part of the subsurface without any significant updates in the deeper portion of the initial velocity model (Khosro Anjom and Socco, 2019). To address this issue, a wavelengthbased weighting scheme was applied in the inversion process to compensate for this nonuniformity (see Khosro Anjom et al., 2021, for details).”
Comment#21: I would like to refer again to my previous comment on the importance of the sampling of your dispersion curves. If you use a uniform sampling in frequency, it is clear to me that the low frequency measurements are underweighted. Maybe run a test with log sampling and compare to the results in Fig. 14.Response#21: As we have explained in the Section 2.2 of the revised version of the manuscript, we have used a processing tool uniformly samples the DCs in frequency. We believe that with the provided clarifying explanations on the procedure of weighting in the revised version of the manuscript, the process is described much clearer than the original version. We did not run a test with log sampling because as explained earlier, our processing tool samples DCs uniformly in frequency.
Comment#22: Fig. 7 There should be a scale on (b). Why is the panel in (d) cropped? It seems that some dispersion curves go also to values slower than 50 m/s.
Response#22: We have added the scale for the Pijnacker’s acquisition scheme (Fig. 8b of the revised version of the manuscript). We have modified Figure 8b and decreased the lower limit of the yaxis to avoid confusion.

AC2: 'Reply on RC2', Mohammadkarim Karimpour, 03 Aug 2022

RC4: 'Comment on egusphere2022279', Fabrizio Magrini, 27 Jun 2022
Dear Editor,
The manuscript "A comparison of straightray and curvedray surface wave tomography approaches at nearsurface studies" compares the effectiveness of "straightray" and "curvedray" tomography in seismic imaging tasks. The pros and cons of these two approaches can be considered very well known to the seismological community, but I am not aware of any technical study that discusses them thoroughly and systematically. I suppose that doing so was the purpose of this study.
Unfortunately, I believe that a study of this kind might only be considered useful/appropriate/worth of publication if the technicalities inherent to the methods employed are very clearly explained; this is also very important, of course, to make the work reproducible. The manuscript, however, lacks many details, and does not allow one to fully understand or reproduce the work that was carried out. I suggest that the authors revise their manuscript extensively in this sense. Moreover, the quality of English and of the figures could and should be greatly improved.
The authors will also find my detailed comments/questions below.
Best regards,
Fabrizio Magrini
Abstract: This is too general. It does not display any highlights on the results presented in the paper
Introduction: In general, two different strategies exist to "convert" surfacewave dispersion curves to a 3D Vs model. As explained in the manuscript, the first (i) involves the calculation of phasevelocity maps, which are then converted to Vs by carrying out many 1D inversions. The second, instead, (ii) allows one to invert the dispersion curves directly for the 3D model. In both cases, the data kernels can be calculated (a) by assuming that the waves travel along the greatcircle path connecting a given station pair (raytheory) or (b) by accounting for ray bending (raytracing).
In this study, you carry out a comparison between strategies (a) and (b). Is there a reason why you don't contemplate strategies (i) and (ii)? This would hugely benefit both the paper and the seismological community. Moreover, from the introduction, it is not clear which strategy between (i) and (ii) you intend to focus on, and why.
I feel that the motivation for the study should be discussed more indepth, because the pros and cons of raytheory vs. ray tracing are well known (for reviews, see Rawlinson & Sambridge 2003, Rawlinson et al. 2010).
General Consideration: This study focuses on very shallow crustal structures; I wonder if the iterative nonlinearinversion scheme used to convert the dispersion curves to the 3D Vs model can be considered appropriate in this sense. Wouldn't a globally optimized algorithm be more suited to the solution of a (possibly highly) nonlinear problem such as that of this kind? Adding some consideration on the matter would be useful.
Method: The method, and the related assumptions, should be explained more clearly (see also the points below). In principle, since this is supposed to be a technical work, I believe this part should be highly detailed, so as to make your work reproducible. I suggest that you expand largely this section, and possibly subdivide it into several subsections. It would be good to explain (i) the calculation of dispersion curves, (ii) the raytheory vs. the raytracing algorithm (with the latter one meriting more consideration, especially if you implemented it on your own), (iii) the forward solver that allows you to measure predicted data from a given Vs model, and (iv) the inversion for Vs. In (iv), it would be good to say something about the calculation of sensitivity kernels at different periods. Other points are found below.
 One can only guess that the inversion strategy chosen by the authors involves the direct inversion of surfacewave velocity for the 3D structure.
 Synthetic tests to verify the accuracy of the raytracing algorithm should be presented. What is the relative error as a function of distance from the source based on a homogenous medium?
 When you refer to "straight lines", are you referring to greatcircle paths? If not, it should be explained that you designed your experiment in a cartesian coordinate system.
 Each vector and matrix in equations (2) and (3) should be thoroughly explained, and their dimensions be explicit. For example, is d_obs your slowness, or is it the arrival time obtained from slowness and raypath distance? How do you calculate the roughness operator Rp? Generally, the extent of the roughness is determined by a damping scalar coefficient (e.g., Boschi & Dziewonski 1999, Magrini et al. 2022), but you have the matrix C_{R_{p}}. Can you please be more explicit on its calculation? Can you also provide a reference for your equation (3), or alternatively a derivation for it? As it is, it appears different from, e.g., eqs. (51) and (57) of Rawlinson & Sambridge (2003).
 Is your stopping criterion compatible with previous studies?
Results: An important point that does not seem to be discussed is the choice of damping in the two different inversions. Slightly different values of damping can produce slightly different results. I believe it would be important to discuss in some depth this choice, and to demonstrate, to some degree, that the result of your comparisons is not biased by improper use of regularization. (Note that a given value of roughness damping might be ideal for the raytheory case but not for the raytracing case, and vice versa).
 For example, consider Fig 10: I have the feeling that the large differences between the two inversions might derive from the choice of the roughness damping (the straightray tomography seems slightly underdamped). Have the authors experimented with different values?
 Are your synthetic data generated with the mentioned 3D finitedifference code only in the Case study 1 or also in the Case study 2? Eventually, a brief explanation of this code could go in the Method section.
 I am struggling to understand the meaning of the red arrows/letters in Fig. 8, and the caption is not helping me. Probably I am missing something simple, but this suggests to me that the explanation in the caption should be extended or made clearer.
Discussion: The relative misfit in equation (6) is a function of fw(m_final). May you please be more explicit on the forward calculation of your dispersion curves (predicted data) based on a given model? Did you use SOFI3D?
Minor
 In the introduction, you refer to Boschi and Dziewonski (1999) while speaking of seismic ambient noise. Clearly, in 1999 ambientnoise tomography did not exist
References
Nicholas Rawlinson and Malcolm Sambridge. Seismic traveltime tomography of the crust and
lithosphere. Advances in geophysics, 46:81–199, 2003.Nicholas Rawlinson, S Pozgay, and S Fishwick. Seismic tomography: a window into deep earth.
Physics of the Earth and Planetary Interiors, 178(34):101–135, 2010.Fabrizio Magrini, Sebastian Lauro, Emanuel Kästle, Lapo Boschi. Surfacewave tomography using SeisLib: a Python package for multiscale seismic imaging, Geophysical Journal International, ggac236, https://doi.org/10.1093/gji/ggac236, 2022

AC3: 'Reply on RC4', Mohammadkarim Karimpour, 03 Aug 2022
Thank you for your suggestions. We have added more details, particularly in the Method Section, to allow the readers to fully understand the applied methodology and increase the reproducibility of the work.
We have copied your comments below and assigned a number to each of them. We have provided a response for every comment.
Comment#1: Abstract: This is too general. It does not display any highlights on the results presented in the paper
Response#1: As also suggested by Reviewer#2, we have modified the Abstract and better highlighted the key results of the study.
Comment#2: Introduction: In general, two different strategies exist to "convert" surfacewave dispersion curves to a 3D Vs model. As explained in the manuscript, the first (i) involves the calculation of phasevelocity maps, which are then converted to Vs by carrying out many 1D inversions. The second, instead, (ii) allows one to invert the dispersion curves directly for the 3D model. In both cases, the data kernels can be calculated (a) by assuming that the waves travel along the greatcircle path connecting a given station pair (raytheory) or (b) by accounting for ray bending (raytracing).
In this study, you carry out a comparison between strategies (a) and (b). Is there a reason why you don't contemplate strategies (i) and (ii)? This would hugely benefit both the paper and the seismological community. Moreover, from the introduction, it is not clear which strategy between (i) and (ii) you intend to focus on, and why.
Response#2: As you mentioned, this study focuses on the comparison between strategies (a) and (b). The comparison between strategies (i) and (ii) is definitely very interesting and can be a topic for a separate study because it is out of the scope of this work. We have reconstructed 3D VS models by direct inversion of DCs. We have clarified it in lines 7980 of the revised version of the manuscript as:
“For each dataset, 3D VS models from straight and curvedray SWT are obtained by direct inversion of DCs, …”
Comment#3: I feel that the motivation for the study should be discussed more indepth, because the pros and cons of raytheory vs. ray tracing are well known (for reviews, see Rawlinson & Sambridge 2003, Rawlinson et al. 2010).
Response#3: We have rewritten some paragraphs of the Introduction for clarification. The main purpose of this study is to evaluate the performance of straightray and curvedray SWT at the nearsurface scale. As stated in lines 5253 of the revised version of the manuscript, despite seismological studies, such comparison is missing at the nearsurface scale. We have also compared our results with previous seismological studies in Section 4.5 of the revised version of the manuscript, and showed that the previous findings in seismology might not be valid at the nearsurface scale. Moreover, we have explained the applied criteria to optimize the shot positions for a SWT study using active seismic data at the nearsurface scale. This is a key difference between the available seismological studies and nearsurface studies using active seismic data. We show the importance of the shot optimization in the obtained dispersion curves coverage and consequently the obtained VS models from the inversions in the Results Section.
Comment#4: General Consideration: This study focuses on very shallow crustal structures; I wonder if the iterative nonlinearinversion scheme used to convert the dispersion curves to the 3D Vs model can be considered appropriate in this sense. Wouldn't a globally optimized algorithm be more suited to the solution of a (possibly highly) nonlinear problem such as that of this kind? Adding some consideration on the matter would be useful.
Response#4: We had stated in lines 5961 of the original version that:
“The computational efficiency is of great importance in seismic nearsurface since, compared to seismological studies, the abundance of data at active seismic nearsurface projects can increase the computational cost significantly.”
The computational cost is a key factor for professionals particularly in the nearsurface studies with high amount of data. We had shown in Table 3 of the original version, the cost of the SWT using the employed deterministic approach which can be as much of 80 $. This number can increase drastically by using stochastic methods.
Moreover, we have shown in Table 4 of the revised version of the manuscript that the final VS model obtained from the iterative nonlinear inversion can be quite accurate even though the inversions started from homogeneous initial models.
Comment#5: Method: The method, and the related assumptions, should be explained more clearly (see also the points below). In principle, since this is supposed to be a technical work, I believe this part should be highly detailed, so as to make your work reproducible. I suggest that you expand largely this section, and possibly subdivide it into several subsections. It would be good to explain (i) the calculation of dispersion curves, (ii) the raytheory vs. the raytracing algorithm (with the latter one meriting more consideration, especially if you implemented it on your own), (iii) the forward solver that allows you to measure predicted data from a given Vs model, and (iv) the inversion for Vs. In (iv), it would be good to say something about the calculation of sensitivity kernels at different periods. Other points are found below.
Response#5: Following this comment, we have largely expanded the Method Section in the revised version of the manuscript and added way more details and equations. We have also divided this section into:
2.1 Optimisation of source layout
2.2 Estimation of DCs
2.3 1D forward modelling
2.4 Computation of forward response
2.5 Inversion algorithm
We have supported each subsection by related explanations and Equations.
Comment#6:  One can only guess that the inversion strategy chosen by the authors involves the direct inversion of surfacewave velocity for the 3D structure.
Response#6: We have clarified it in lines 7980 of the revised version of the manuscript:
“For each dataset, 3D VS models from straight and curvedray SWT are obtained by direct inversion of DCs, …”
Comment#7:  Synthetic tests to verify the accuracy of the raytracing algorithm should be presented. What is the relative error as a function of distance from the source based on a homogenous medium?
Response#7: To test the accuracy of the ray tracing algorithm, we used a homogeneous medium (the Blocky model with constant VS equal to 200 m/s in the whole medium) and computed the ray paths. As an example, we show the computed paths for a DC where the receivers are located at (19 m, 5 m) and (10 m, 5 m) in the figure below:
Figure 1. The computed ray paths for different frequency component of the DC with receivers’ positions at (19 m, 5 m) and (10 m, 5 m).
The average relative error of the paths (i.e., deviation from straightline) is almost zero (7.2e16). We have not shown this figure in the revised manuscript. Nonetheless, we have added the following clarification in lines 129130 of the revised version of the manuscript:
“To evaluate the accuracy of the raytracing algorithm, we have applied it in a to a homogeneous media and noticed that the error (i.e., deviation from straightline) in this condition is almost zero (not shown here).”
Comment#8:  When you refer to "straight lines", are you referring to greatcircle paths? If not, it should be explained that you designed your experiment in a cartesian coordinate system.
Response#8: We have added the following clarification sentence in line 118 of the revised version of the manuscript:
“We carry out our experiments in a Cartesian coordinate system.”
Comment#9:  Each vector and matrix in equations (2) and (3) should be thoroughly explained, and their dimensions be explicit. For example, is d_obs your slowness, or is it the arrival time obtained from slowness and raypath distance? How do you calculate the roughness operator Rp? Generally, the extent of the roughness is determined by a damping scalar coefficient (e.g., Boschi & Dziewonski 1999, Magrini et al. 2022), but you have the matrix C_{R_{p}}. Can you please be more explicit on its calculation? Can you also provide a reference for your equation (3), or alternatively a derivation for it? As it is, it appears different from, e.g., eqs. (51) and (57) of Rawlinson & Sambridge (2003).
Response#9: We have added the reference for Eq. (3) (equivalent of Eq. 11 in the revised version of the manuscript), which is Boiero (2009).
We have extended the Method Section largely and added way more details in this section. We explained in line 112 of the revised version of the manuscript that the dimension of the vector of the experimental data (d_obs) is (phase) velocity. We have also clarified in lines 160162 of the revised version of the manuscript that we have assigned a large value (10^{6}) to the covariance matrix of the spatial regularization matrix:
“To reduce the impact of spatial regularization on the inversion results, in all four examples in this study, a large value (10^{6}) is assigned to C_{Rp}. It means that the VS difference between the neighbouring cells is constrained to 1000 m/s.”
Comment#10:  Is your stopping criterion compatible with previous studies?
Response#10: In some surface waves studies, the inversion stops when the misfit function reduces less than 1% with respect to the value at the previous iteration (e.g., Garofalo et al., 2015). Since our computational facilities have been improved compared to before, we have defined a lower threshold in our study (0.01%) to make sure that the inversion reaches a local minima. As can be seen in Fig. 6b of the revised version of the manuscript, the misfit value of straightray inversion shows a sudden decrease at iteration 27, while the inversion would have stopped at iteration 26 if a higher threshold had been chosen as the stopping criterion.
Comment#11: Results: An important point that does not seem to be discussed is the choice of damping in the two different inversions. Slightly different values of damping can produce slightly different results. I believe it would be important to discuss in some depth this choice, and to demonstrate, to some degree, that the result of your comparisons is not biased by improper use of regularization. (Note that a given value of roughness damping might be ideal for the raytheory case but not for the raytracing case, and vice versa).
Response#11: We have clarified the choice of regularization 160162 of the revised version of the manuscript. As pointed out by Trampert and Spetzler (2006), the choice of regularization has a major impact on SWT results. Therefore, we have assigned a very large number (10^{6}) to the regularization values so that the final VS model is not biased by the regularization values and the comparison between straightray and curvedray methods are fair.
Comment#12:  For example, consider Fig 10: I have the feeling that the large differences between the two inversions might derive from the choice of the roughness damping (the straightray tomography seems slightly underdamped). Have the authors experimented with different values?
Response#12: We have provided a detailed response in the response to the comments of Reviewer#1. The observed difference is due to the ‘wrong’ ray paths in the straightray approach. At the edges of the velocity anomaly, the assumed paths by the straightray are shorter than the true paths and therefore the velocities are (wrongly) high.
All experiments have been done with the same regularization values. As stated in lines 160162 of the revised version of the manuscript.
Comment#13:  Are your synthetic data generated with the mentioned 3D finitedifference code only in the Case study 1 or also in the Case study 2? Eventually, a brief explanation of this code could go in the Method section.
Response#13: The same 3D finite difference code has been used to generate the synthetic data in both case studies 1 and 2. We have added the following sentence in line 238239 of the revised version of the manuscript for clarification:
“The same finite difference code used for the Blocky model was used to obtain the Sand Bar synthetic dataset …”
We have added a brief explanation of this code (SOFI3D) in lines 197201 of the revised version of the manuscript as:
“The code is an FD modelling program based on the FD approach described by Virieux (1986) and Levander (1988) with some extensions. It can consider viscoelastic wave propagation effects such as attenuation and dispersion, employ higher order FD operators, 200 apply perfectly matched layer (PML) boundary conditions at the edges of the model, and it works in message passing interface (MPI) parallel environment which reduces the running time of the simulations”
Comment#14:  I am struggling to understand the meaning of the red arrows/letters in Fig. 8, and the caption is not helping me. Probably I am missing something simple, but this suggests to me that the explanation in the caption should be extended or made clearer.
Response#14: Since Reviewer#1 had the same struggle, we have removed the red arrows from the reconstructed VS models from the inversion (Figures 4, 7, 10, and 13 of the revised version of the manuscript).
Comment#15: Discussion: The relative misfit in equation (6) is a function of fw(m_final). May you please be more explicit on the forward calculation of your dispersion curves (predicted data) based on a given model? Did you use SOFI3D?
Response#15: No, we have not used SOFI3D for this purpose. Following your previous comments on the Method Section, the process for the forward calculation of the dispersion curves have been thoroughly explained in Section 2.4 of the revised version of the manuscript.
Comment#16: In the introduction, you refer to Boschi and Dziewonski (1999) while speaking of seismic ambient noise. Clearly, in 1999 ambientnoise tomography did not exist
Response#16: We have put it its right place, that is at line 24 of the revised version of the manuscript.

AC3: 'Reply on RC4', Mohammadkarim Karimpour, 03 Aug 2022
Peer review completion
Journal article(s) based on this preprint
Mohammadkarim Karimpour et al.
Mohammadkarim Karimpour et al.
Viewed
HTML  XML  Total  BibTeX  EndNote  

247  94  16  357  5  4 
 HTML: 247
 PDF: 94
 XML: 16
 Total: 357
 BibTeX: 5
 EndNote: 4
Viewed (geographical distribution)
Country  #  Views  % 

Total:  0 
HTML:  0 
PDF:  0 
XML:  0 
 1
The requested preprint has a corresponding peerreviewed final revised paper. You are encouraged to refer to the final revised version.
 Preprint
(2598 KB)  Metadata XML