Preprints
https://doi.org/10.48550/arXiv.2508.11307
https://doi.org/10.48550/arXiv.2508.11307
06 Jan 2026
 | 06 Jan 2026
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

Approximating the universal thermal climate index using sparse regression with orthogonal polynomials

Sabin Roman, Gregor Skok, Ljupčo Todorovski, and Sašo Džeroski

Abstract. The Universal Thermal Climate Index (UTCI) is a measure of thermal comfort that quantifies how humans experience environmental conditions. Due to its robustness and versatility as a bioclimatic indicator, it has been extensively employed across a wide range of studies in bioclimatology and is increasingly used as an operational measure of outdoor thermal comfort. At the same time, calculating the UTCI value from the relevant environmental parameters is nominally not straightforward, which is why using a 6th-degree polynomial approximation has become the standard way to calculate UTCI values. At the same time, although it is computationally efficient, the error of this polynomial approximation can be substantial. The goal of this study was to develop an improved version of the polynomial approximation – one that retains comparable computational efficiency but is more robust in terms of numerical stability and substantially more accurate, particularly in reducing the frequency of larger errors. This goal was successfully achieved using sparse orthogonal regression, namely sparse regression with an orthogonal polynomial basis, which not only substantially reduces the average errors (i.e., the mean error, the mean absolute error, and the root mean square error) but also drastically reduces the frequency of large errors. By leveraging Legendre polynomial bases, approximation models could be constructed that efficiently populate a Pareto front of accuracy versus complexity and exhibit stable, hierarchical coefficient structures across varying model capacities. Training the new approximation models over only 20 % of the data, with the testing performed over the remaining 80 %, highlights successful generalization, with the results also being robust under bootstrapping. The decomposition effectively approximates the UTCI as a Fourier-like expansion in an orthogonal basis, yielding results near the theoretical optimum in the L2 (least squares) sense.

Share
Sabin Roman, Gregor Skok, Ljupčo Todorovski, and Sašo Džeroski

Status: open (until 03 Mar 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Sabin Roman, Gregor Skok, Ljupčo Todorovski, and Sašo Džeroski

Model code and software

Code for Approximating the universal thermal climate index (UTCI) using sparse regression with orthogonal polynomials Sabin Roman https://zenodo.org/records/17465548

Sabin Roman, Gregor Skok, Ljupčo Todorovski, and Sašo Džeroski

Viewed

Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.

Total article views: 43 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
43 0 0 43 0 0
  • HTML: 43
  • PDF: 0
  • XML: 0
  • Total: 43
  • BibTeX: 0
  • EndNote: 0
Views and downloads (calculated since 06 Jan 2026)
Cumulative views and downloads (calculated since 06 Jan 2026)

Viewed (geographical distribution)

Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.

Total article views: 24 (including HTML, PDF, and XML) Thereof 24 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 09 Jan 2026
Download
Short summary
This study aimed to improve how the Universal Thermal Climate Index, a key measure of human thermal comfort, is calculated. Existing methods use a simplified polynomial approximation that is straightforward to apply but can introduce errors. We developed a new version using sparse regression with orthogonal polynomials, which keeps computational efficiency while improving accuracy and stability. The results enable more reliable assessments of outdoor thermal comfort and climate analyses.
Share