Preprints
https://doi.org/10.5194/egusphere-2026-1960
https://doi.org/10.5194/egusphere-2026-1960
20 May 2026
 | 20 May 2026
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

GeoGen3D 1.0: An LMM-Based Reasoning Agent Framework for 3D Geological Model Generation

Jiateng Guo, Junkun Li, Mark Jessell, Zhibin Liu, Luyuan Wang, and Xulei Wang

Abstract. 3D Geological models provide conceptual and specific frameworks for a range of theoretical and applied geoscience activities, from theoretical research to the search for new resources. In many scenarios, the scarcity of data is common, and researchers must infer corresponding 3D geological structures based only on textual descriptions or outcrop images, for which standard 3D modelling approaches are poorly suited. Although current generative artificial intelligence can already generate pictures, videos, and 3D object models as required, there is still no feasible method to directly convert geologists' ideas or real photos of rock outcrops into 3D geological models. Here we present GeoGen3D, an intelligent Agent for text-image multimodal-driven 3D geological modeling. (1) Based on an improved ReAct agent framework, and by constructing a comprehensive collection of Noddy-based agent tools, we leverage the deep text and image understanding capabilities of large multimodal models (LMMs) to enable intelligent generation of 3D geological models from textual or visual inputs. (2) We introduce MMGM-Eval, a multimodal 3D geological model generation benchmark, to systematically evaluate the ability of LMMs to generate geological models from multimodal prompts. Our analyses demonstrate that GeoGen3D significantly outperforms direct prompt engineering approaches combining LMMs on the MMGM-Eval benchmark. GeoGen3D thus provides an efficient and intelligent modeling paradigm for multimodal-driven 3D geological model generation, especially suitable for scenarios lacking sufficient data.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Jiateng Guo, Junkun Li, Mark Jessell, Zhibin Liu, Luyuan Wang, and Xulei Wang

Status: open (until 16 Jul 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Jiateng Guo, Junkun Li, Mark Jessell, Zhibin Liu, Luyuan Wang, and Xulei Wang

Data sets

Data Sets Jiateng Guo and Junkun Li https://doi.org/10.5281/zenodo.19493634

Model code and software

Source Code Jiateng Guo and Junkun Li https://doi.org/10.5281/zenodo.19493634

Video supplement

Video Demo Jiateng Guo and Junkun Li https://doi.org/10.5281/zenodo.19493634

Jiateng Guo, Junkun Li, Mark Jessell, Zhibin Liu, Luyuan Wang, and Xulei Wang

Viewed

Total article views: 211 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
148 51 12 211 17 8 10
  • HTML: 148
  • PDF: 51
  • XML: 12
  • Total: 211
  • Supplement: 17
  • BibTeX: 8
  • EndNote: 10
Views and downloads (calculated since 20 May 2026)
Cumulative views and downloads (calculated since 20 May 2026)

Viewed (geographical distribution)

Total article views: 209 (including HTML, PDF, and XML) Thereof 209 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 08 Jun 2026
Download
Short summary
This study proposes a method for generating three-dimensional geological models from text and rock outcrop images. Through multiple experimental cases, it has been proved that by using this method, a three-dimensional geological model corresponding to the geological structure can be generated based on the language descriptions of geologists and the actual rock images. This method enables geologists to directly develop geological structural hypotheses from a three-dimensional perspective.
Share