Preprints
https://doi.org/10.5194/egusphere-2026-1944
https://doi.org/10.5194/egusphere-2026-1944
20 Apr 2026
 | 20 Apr 2026
Status: this preprint is open for discussion and under review for SOIL (SOIL).

Use of Spatial Embeddings in Genosoil Identification

Julio César Pachón Maldonado, José Padarian, Quentin Styc, and Alex McBratney

Abstract. Genosoils are minimally disturbed reference states within pedogenons, that is, soil units shaped by similar pedogenic processes within the Soil Security framework. They are central to assessing human impacts on soil functions, services, and resistance to threats. At present, genosoil delineation relies on the Human Modification Index (HMI), yet in intensively managed landscapes HMI thresholds may exclude all local pixels, leaving no local reference state available. Because the same pedogenon may occur across geographically distant regions, non-local occurrences may provide an alternative source of reference information. Using the United Kingdom as a case study, we tested whether satellite-derived spatial embeddings can detect genosoil signatures at 10 m resolution and whether these signatures can be transferred to regions with limited or absent local low-human-modification examples. We evaluated two satellite foundation-model embedding products, AlphaEarth and Tessera, across three contrasting pedogenons selected from the Global Pedogenon Map. Within each pedogenon, pixels with lower HMI values were generally more similar to the genosoil reference, indicating that the embeddings capture a reproducible low-modification surface-state signal. At the global scale, similarity to the UK genosoil was largely confined to biogeographically coherent regions. Cross-border substitution of local UK genosoil delineation was mostly limited, with meaningful partial recovery observed primarily in the highly modified agricultural pedogenon. These results indicate that satellite foundation-model embeddings can support higher-resolution genosoil delineation than is currently possible from global human modification products alone, extending the operational framework from 90 m to 10 m. They also suggest a pathway towards future genosoil identification frameworks that rely less on coarse disturbance proxies and more on validated surface-state similarity.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Julio César Pachón Maldonado, José Padarian, Quentin Styc, and Alex McBratney

Status: open (until 01 Jun 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Julio César Pachón Maldonado, José Padarian, Quentin Styc, and Alex McBratney
Metrics will be available soon.
Latest update: 20 Apr 2026
Download
Short summary
Identifying local soil references for research in heavily modified landscapes is challenging. We evaluated whether AI satellite models at 10 m resolution to detect undisturbed soils and apply the findings internationally. Pixels with less human disturbance aligned better with undisturbed references. Cross-border transfer was partly successful. Results indicate AI models may help find reference soils where local examples are scarce.
Share