Preprints
https://doi.org/10.5194/egusphere-2026-2041
https://doi.org/10.5194/egusphere-2026-2041
10 Jun 2026
 | 10 Jun 2026
Status: this preprint is open for discussion and under review for Geoscientific Instrumentation, Methods and Data Systems (GI).

Bitfields encode what data knows about itself

Steffen Ehrmann

Abstract. Every observation a scientist produces carries implicit computational context, from uncertainty estimates and distribution parameters to quality flags and algorithmic decisions. Yet, in most cases, none of this travels when data crosses workflow boundaries. What arrives is typically a mean value. Everything else remains trapped in supplementary materials or institutional memory. Bitfields offer a direct solution. They are compact binary encodings that pack observation-level computational context into standard integer arrays, travel inside existing data formats, and decode on the receiving end without external dependencies. This is demonstrated through three linked workflows spanning livestock density modeling, ecological carrying capacity assessment, and socioeconomic intervention planning, where each project decodes its predecessor's bitfield and enriches it with new information. Statistical distributions, uncertainty metrics, and processing decisions accumulate across project boundaries rather than fragmenting at each handoff. The bitfield R package and a community-driven protocol repository provide the practical tooling. Because encoding integrates directly into analytical code, adoption requires no separate documentation step. Computational context is captured as a byproduct of the analysis itself.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Steffen Ehrmann

Status: open (until 16 Jul 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Steffen Ehrmann
Steffen Ehrmann
Metrics will be available soon.
Latest update: 10 Jun 2026
Download
Short summary
Scientific observations carry context such as uncertainty, quality assessments and modeling decisions, yet usually merely average values survive when data moves between research teams. This paper presents a method for packing that context into compact binary codes inside standard data formats. Three linked projects show network effects when information accumulates across independent teams. The approach merges with existing analyses and works despite current scientific incentive structures.
Share