the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
GC Insights: Breaking the silos – leveraging NLP to encourage interdisciplinary interaction at the EGU
Abstract. Thousands of abstracts from various geoscience sub-fields are presented annually at the EGU General Assembly (GA), offering a rich resource for tracking scientific progress. However, rigid session groupings can limit cross-disciplinary exploration. Here, we show that participants focusing only on their broad disciplinary session miss an average of 44 % of the 10 most relevant contributions. To break this compartmentalization, we propose using natural language processing (NLP), enabling the geoscience community to explore the full breadth of knowledge beyond traditional disciplinary boundaries.
- Preprint
(1286 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 12 Feb 2025)
-
RC1: 'Comment on egusphere-2024-3430', Lina Stein, 17 Jan 2025
reply
The authors present an analysis of all abstracts submitted to the EGU General Assembly in the last five years. The abstracts are clustered according to semantic similarity of the abstract text. For each scientific division, the distribution of abstracts across the cluster space is evaluated to demonstrate that similar research is submitted across different scientific divisions within the EGU GA. The authors claim, that since attendees primarily attend sessions from their own scientific division, this means attendees might miss relevant research (bubble effect). I found the overview of abstract similarity across scientific divisions very informative. It demonstrates that the scientific divisions have more overlap than the strict separation necessary for conference organization might suggest.
Additionally, the authors propose a webtool, which can recommend EGU GA submissions based on similarity to a search string. I have actually used the prototype during the 2024 EGU and found it immensely helpful.
Overall, I find this article timely, relevant, and well-written. I only have a few minor suggestions for improvement.
Specific comments:
Your main assumption is that people will not attend presentations of research similar to their own submission due to it being in another scientific division. E.g. Line 10 “We hypothesize that this compartmentalization may inadvertently create knowledge silos, as EGU GA attendants tend to focus on their own scientific divisions, potentially missing relevant developments from other disciplines.” From my own (probably biased) experience I would say that cross-division attendance does take place a lot during EGU. And the EGU website does already offer a search tool that allows people to find abstracts across scientific divisions according to keyword searches. However, text similarity offers a much better tool to identify presentations of interest. Simple keyword searches often still produce a large number of results. While it is, of course, impossible to quantify which scientific division sessions are attended by whom during the GA, it would help your claim to offer some more numbers to demonstrate the infeasibility of manually checking all sections for relevant contributions. How many abstracts are submitted per section? How many sessions are registered per section?
On thing I found irritating is that the 22 scientific divisions or scientific sections are referred to as sessions. There are multiple sessions organised within each scientific division. But the authors split their data into scientific divisions and called them session. That the selection on the EGU GA website offers “Disciplinary sessions” is not a name, but means that they are the session related to one specific division.
Please clarify in the abstract and text what you mean by participant exploration or participant focus. It should be stated early on and more clearly, that you refer to choice of session/division attendance during the GA. (And not, for example, choice of session during the abstract submission process).
Lastly, it would be nice to add the number of people who have used the webtool to find relevant sessions during EGU24.
Citation: https://doi.org/10.5194/egusphere-2024-3430-RC1 -
AC1: 'Reply on RC1', Jan Sodoge, 27 Jan 2025
reply
Thanks for your positive feedback. Below we provide answers to your comments and how we will address these in the updated manuscript.
Comment: Your main assumption is that people will not attend presentations of research similar to their own submission due to it being in another scientific division. E.g. Line 10 “We hypothesize that this compartmentalization may inadvertently create knowledge silos, as EGU GA attendants tend to focus on their own scientific divisions, potentially missing relevant developments from other disciplines.” From my own (probably biased) experience I would say that cross-division attendance does take place a lot during EGU. And the EGU website does already offer a search tool that allows people to find abstracts across scientific divisions according to keyword searches. However, text similarity offers a much better tool to identify presentations of interest. Simple keyword searches often still produce a large number of results. While it is, of course, impossible to quantify which scientific division sessions are attended by whom during the GA, it would help your claim to offer some more numbers to demonstrate the infeasibility of manually checking all sections for relevant contributions. How many abstracts are submitted per section? How many sessions are registered per section?
Answer: This is a good suggestion. Indeed, as you point out our initial assumption is very stylized- which might not reflect reality at EGU GA. In the updated manuscript, we added to this hypothesis that it is likely not reflecting reality yet that such simplification is necessary for our assessment. This is a limitation of our study that requires clearer acknowledgment. In fact, for the next EGU, we plan to survey individual users' attributes (e.g., which division(s) does one belong to) to assess the quality of recommendations and select the most appropriate model. Regarding the current manuscript, we agree and will implement your suggestion of adding some numbers on the volume of abstracts to emphasize the 'infeasibility of manually checking'.
Comment: On thing I found irritating is that the 22 scientific divisions or scientific sections are referred to as sessions. There are multiple sessions organised within each scientific division. But the authors split their data into scientific divisions and called them session. That the selection on the EGU GA website offers “Disciplinary sessions” is not a name, but means that they are the session related to one specific division.
Please clarify in the abstract and text what you mean by participant exploration or participant focus. It should be stated early on and more clearly, that you refer to choice of session/division attendance during the GA. (And not, for example, choice of session during the abstract submission process).Answer: We agree with your criticism of the unclear terminology. We used the disciplinary sessions throughout the manuscript as it is the name on the GA website but understand that "scientific division" is a much clearer terminology. As such, we revised the entire manuscript and clarified in the text that we use the term "scientific divisions" to refer to what sometimes is called disciplinary sessions.
Comment: Lastly, it would be nice to add the number of people who have used the webtool to find relevant sessions during EGU24.
Answer: Sadly, we have no clear evidence of the number of users based on the platform and subscription we used for hosting this app (shinyapps.io). The only metric we could quantify is 45 hours of usage time. However, this does not correspond to a particular number of users. For the upcoming EGU25 though, we plan to quantify such metrics more in-depth by hosting the application via our computational resources (see also the first comment on our ambitions to quantify more aspects to improve the evaluation).
On behalf of the authors,
Jan SodogeCitation: https://doi.org/10.5194/egusphere-2024-3430-AC1
-
AC1: 'Reply on RC1', Jan Sodoge, 27 Jan 2025
reply
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
280 | 102 | 7 | 389 | 4 | 4 |
- HTML: 280
- PDF: 102
- XML: 7
- Total: 389
- BibTeX: 4
- EndNote: 4
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1