1 Introduction
Permafrost contains between 677 and 949 of soil organic carbon (SOC) in the upper few metres, roughly twice as much carbon as the atmosphere . As permafrost thaws with increased temperature, SOC becomes available for microbial decomposition, resulting in the release of large amounts of greenhouse gases into the atmosphere, which, in turn, increase surface temperatures. This permafrost–carbon feedback will likely accelerate climate change; however, the precise magnitude and timing of these emissions and their subsequent impact on the global climate system remain uncertain .
A key aspect of this uncertainty is the complex quantification of the rate and extent of permafrost thaw. Predicting how the permafrost thermal regime will respond to ongoing climate change is particularly challenging, given its high sensitivity to surface properties . Among these, snow cover acts as an important moderator by directly influencing surface energy fluxes between the air and the soil. Functioning as a thermal insulator, snow cover can limit heat loss from the ground during winter , but its insulating properties are highly variable and insufficiently detailed in Earth system models .
The insulating efficiency of snow cover increases with thickness, reaching its peak insulation capacity at around 25 in depth depending on the (micro)structure and stratigraphy of the snowpack. As denser snow has fewer air voids, resulting in fewer insulating air pockets, thermal conductivity also tends to increase with density . As a result, heat is transferred more efficiently through a dense snow matrix. Snowpack in Arctic tundra environments typically consists of two main parts: depth hoar and wind slab . Depth hoar forms towards the base of the snowpack due to strong vertical temperature gradients and water vapour fluxes. Wind slab forms due to snow compaction from the strong Arctic wind transport and deposition. Depth hoar crystals have large, faceted, and often cup-shaped grains with low density, making them poor heat conductors, while wind slab layers have higher density, resulting in better heat conductivity and decreased insulation properties.
Studies show that state-of-the-art land surface models (LSMs) and snowpack models, including the Community Land Model
The insulating capacity of a snowpack is determined by the snow thermal conductivity: a critical parameter influencing heat exchange between the soil and atmosphere. Previous studies have highlighted the high sensitivity of LSM soil temperature simulations to this parameter , identifying it as a significant source of uncertainty . In models, it is expressed as the effective snow thermal conductivity , which aims to account for all heat-transfer processes in a single vertical dimension. Snow exhibits a low , generally falling within the range of 0.01–0.7 ; tundra snowpacks typically display values toward the lower end of this range . Numerous studies describe empirical relationships between and snow density based on experiments made in laboratories on different snowpacks around the world. Among them, derived a regression equation relating density and thermal conductivity based on 488 measurements of pan-Arctic and Antarctic seasonal snow:
1 where is the snow density (in ). The Sturm equation stands out due to its notably lower compared to other relationships based on non-Arctic snowpacks (Fig. ), particularly within the range of typical Arctic tundra snowpack densities, 150 to 300 .
demonstrate that the equation better fits their measurements in the Qarlikturvik Valley because it is specifically based on tundra snow characteristics. In contrast, equations commonly used by many LSMs (e.g. the equation in ORCHIDEE , the equation in CLASSIC1.0 , the equation in ISBA and JULES , and the equation in CLM5 and ELMv0 ) are more adapted to alpine conditions and may not accurately represent pan-Arctic environments. conducted a sensitivity experiment involving five modified settings in Crocus, one of which incorporated the equation. Their assessment demonstrated only slight improvements in soil temperature; however, it is difficult to isolate the specific impact of the equation in their study amongst the other modified parameters. Conversely, conducted a comparative analysis of different snow thermal conductivity schemes with CLM5.0 using in situ measurements from Trail Valley Creek, Northwest Territories, Canada, and found that the CLM5.0 default scheme overestimates snow thermal conductivity by a factor of 3 compared to observations, consequently inducing a cold bias in the wintertime soil temperature simulations. When replacing the default scheme with the formulation proposed by , significant improvements were observed in wintertime soil temperature simulations. In addition, and studied the effects of integrating the equation into the LSMs CLASS and ELM , respectively, further underscoring the significant sensitivity of soil temperatures to snow thermal conductivity. Moreover, demonstrate that the scheme effectively mitigates winter soil temperature biases.
Our study aims to extend the assessment to evaluate the applicability of the scheme in CLM5.0 across a broader regional climatological context. We hypothesise that a modification to the CLM5.0 snow thermal conductivity scheme will more effectively capture the sensitivity inherent in Arctic tundra snow, thereby restoring a more accurate thermal insulating function of the snowpack and improving the soil temperature and permafrost dynamics represented by the model. To realise this endeavour, we present a CLM5.0 sensitivity experiment using the snow thermal conductivity scheme and evaluate simulations using Arctic-wide in situ observations and remote sensing data for soil temperature. Additionally, we conduct a sensitivity analysis of snow density to test the robustness of our results for potentially lower bulk snow densities characteristic of tundra environments.
2 Methods and data2.1 Model description
This study uses the Community Land Model (CLM5.0), which is part of the Community Terrestrial Systems Model (CTSM;
2.1.1 Soil
The model soil stratigraphy includes 25 soil layers distributed geometrically, with thinner layers at shallower depths and larger layers at greater depths up to 50 . CLM5.0 has an increased soil layer resolution compared to CLM4.5, particularly in the upper 3 , to more accurately represent the active-layer thickness (ALT) in permafrost areas .
The heat-transfer equation
The model defines soil thermal and hydraulic conductivities using mineral soil parameterisations dependent on soil texture (sand, clay, and silt fractions) and organic matter density derived from . These fractions vary across the first 10 layers but remain constant in the subsequent 15 layers.
2.1.2 Snow
The snow module in CLM5.0, described in and , includes physical processes such as snow accumulation, compaction (due to overburden pressure and drifting snow), refreezing, melting, and sublimation. However, the snow module does not take into account water vapour flux through snow. The CLM5.0 snow module uses a multi-layer approach that discretises the snowpack into a maximum of 12 layers. Fresh-snow density is parameterised by combining a temperature term with a linear wind-dependent density term . Snow can densify via four distinct processes: compaction by overburden pressure, compaction by drifting snow, destructive metamorphism, or melt metamorphism. Furthermore, snow thermal conductivity is solely dependent on snow density and is calculated following the scheme by default:
2 where and are the thermal conductivity of air, 0.023 , and ice, 2.29 , respectively. Improvements to the CLM5.0 snow module have led to increased bulk snow density across most of the Arctic tundra compared to CLM4.5 .
2.2 Model set-up and experimentsThe version of CLM used throughout this study is ctsm5.1.dev086. The domain for this study is between 57 and 90° N and consists of 204 086 grid points with a triangular resolution that varies between 116.3 and 179.4 , giving a rectangular resolution of around 12 . This is a similar domain to that of , who used a coarser resolution.
Default CLM5.0 meteorological forcing data (CRU/GSWP3) are replaced with the finer 31 spatial resolution ERA5 forcings from 1980 to 2021 at an hourly time step. To our knowledge, this is the second time that CLM5.0 has been used with ERA5 forcings, after . While this increase in resolution should represent a substantial improvement over previous global reanalysis methods used , it also introduces additional uncertainty since the model was not parameterised with these settings as its default configuration. To start the run in an equilibrium state, a spin-up of 30 years using the ERA5 reanalysis (looping from 1980 to 1989 three times) was done before running the model from 1980 to 2021 (42 years).
To reduce computation time, this study uses the satellite phenology (SP) set-up, which does not include complex carbon cycle interactions and deactivates the land–ice and river routing models. In order to prevent unrealistically high values of snow heights observed in pan-Arctic non-glaciated islands, the snow initialisation protocol was recalibrated with the snow water equivalent (SWE) reverted to its original value of 0.8 , instead of 10 as was later proposed in .
We conducted two simulations: (1) the control run and (2) the Sturm run, where the conventional snow thermal conductivity scheme is replaced with the scheme (Eq. ). To assess the sensitivity of model outputs to snow density, additional simulations were performed using both the Sturm and Jordan thermal conductivity schemes, with adjustment factors of 0.9 and 0.7 applied to the snow density parameterisation to better represent the lower bulk snow densities characteristic of tundra environments. In CLM5.0, the snow density is computed as follows:
3 where af is the adjustment factor used in this sensitivity analysis, is the ice lens mass per unit area (in ), is the liquid water mass per unit area (in ), is the fractional snow-covered area, and is the snow layer depth (in ).
The choice of adjustment factors is based on observed snow density values in Arctic tundra regions. CLM5.0 simulates an average bulk snow density of 311 over our study domain (Fig. ), whereas observational studies indicate that tundra snow densities should be significantly lower. reported an average tundra bulk snow density of 225 using a large dataset of Arctic-wide snow sites, while depth hoar density measurements from multi-site (Derksen et al., 2014) and single-site studies (Woolley et al., 2024) both report values around 228 . To align model outputs with these observations, an af of 0.7 was chosen to represent the lower range of observed densities, yielding a modelled bulk snow density of 217 . Additionally, an af of 0.9 was selected as an intermediate value between the CLM5.0 simulated densities and the observed tundra densities.
The simulations were conducted exclusively for the 2006–2010 period, selected due to its robust observational data availability, to balance computational efficiency with model reliability. The four additional runs include (1) Sturm with af 0.9, (2) Jordan with af 0.9, (3) Sturm with af 0.7, and (4) Jordan with af 0.7. These sensitivity runs were compared to the baseline simulations (with af 1.0) as part of a broader analysis of snow density impacts on model performance.
2.3 Data for model evaluationThe Arctic tundra has long been recognised as a difficult region to study due to its inherent remoteness and the scarcity of observations . Accordingly, the lack of information on snow properties in Arctic tundra regions places a major limitation on permafrost and climate modelling . To address this challenge, this paper uses two observation datasets as constraints for the CLM5.0 outputs: one derived from remote sensing products and the other obtained through in situ measurements. Both datasets offer complementary perspectives, enabling a thorough integration and analysis of soil temperature assessment, including (1) temporal-scale variations covering seasonal and annual averages, (2) spatial distributions across a wide geographical area, and (3) depth variations throughout the entire soil column.
2.3.1 Remote sensing data
We use grid-based products from the European Space Agency (ESA) Climate Change Initiative (CCI) Essential Climate Variables (ECVs) product database from the CCI+ Permafrost project . ESA-CCI products encompass ECVs with a high spatial resolution of 1 and include mean annual ground temperature (MAGT) at distinct ground depths of 1, 5, and 10 ; permafrost fraction (PFR) – proportion of an area covered by permafrost within a grid point; and the ALT – the top layer of soil that thaws during the warm season and freezes during the colder months. Product validation is documented in , with further details on the methods available in . The geographical extent of these products spans the Northern Hemisphere above 30° N within an Arctic stereographic circumpolar projection. The temporal coverage for the MAGT, ALT, and PFR time series is from 1997 to 2019 at an annual resolution.
To compare CLM5.0 simulations to ESA-CCI products, we aggregated ESA-CCI products to the domain grid using a conservative second-order regridding equation described in . Following the definition of permafrost as ground that remains at or below 0 for at least 2 consecutive years, the presence or absence of permafrost (PFR) at each grid point within CLM5.0 is determined by
4 where is the number of years covered by ESA-CCI product (1997–2019), is the index for the soil depth, is the number of depths, is the index for the days in the year and the next year (), is the number of days in a year, and is the temperature depending on the day, depth, and grid cell. We first calculated the maximum temperature over a 2-year period for each grid cell and each layer. Then, we calculated the vertical soil temperature minimum to see if there is one continually frozen layer over these 2 years. From this, we obtained a temperature data grid for each year, which we then averaged over the period spanning 1997 to 2019 to match the duration of the ESA-CCI product period. Subsequently, we classified grid points into two categories: those with temperatures below 0 were designated permafrost, while those with temperatures above 0 were classified as non-permafrost. It is worth noting that this method provides a binary definition of permafrost, in contrast to the ESA-CCI classification, which offers a quantitative representation of permafrost ranging from 0 % to 100 % resulting from their ensemble-member experiments. To reconcile this difference, we adopted three permafrost classes for the ESA-CCI data: continuous if greater than 90 %, discontinuous if between 50 % and 90 %, and permafrost-free if less than 50 %.
To calculate ALT at each grid point within CLM5.0 for each year, a grid of maximum annual soil temperature was computed to identify the first thawed layer (above 0 ) from the basal layer. Subsequently, a spline curve was calculated using the layers above and below the first thawed layer to estimate the actual depth of transition between frozen and thawed soil layers. The resulting ALTs for both CLM5.0 and ESA-CCI were then averaged between 1997 and 2019.
To obtain the maps presented in the results section, we subtracted the ESA-CCI grid data from the CLM5.0 simulations for the MAGT, PFR, and ALT period-averaged products. In addition, we calculated the mean absolute deviation (MAD) and root-mean-square error (RMSE) for MAGT and ALT, where predicted values are the results from the model and observed values are the ESA-CCI products.
2.3.2 In situ soil temperaturesWe expanded upon the dataset used by using data from the Permafrost Laboratory website (
Figure 1
Locations of the 295 borehole stations used. The size of each point represents the number of data records per station over the whole period and for all depths. The datasets are sourced from the Permafrost Laboratory website, the GTN-P database, Nordicana D, and the Roshydromet network.
[Figure omitted. See PDF]
3 Results3.1 Snow insulation
The winter offset, as defined by , quantifies the difference between the mean soil temperature at 0.2 and the mean air temperature during the December to February period. This metric provides valuable insight into the snow insulation capacity and the transfer of heat from the air to the soil during the winter season as represented by an LSM.
Figure 2
Period-averaged (1980 to 2021) winter offset for the control run (a) and Sturm run (b), following the methodology.
[Figure omitted. See PDF]
The Sturm run demonstrates substantially higher snow insulation across most of the domain, notably in tundra regions, when compared to the control run (Fig. ). Offset values range from 20 to 35 over Siberia and 15 to 25 over Canada and Alaska for the Sturm run compared to 10 to 20 over most regions for the control run.
Figure 3
Variations in the winter offset according to snow depth between the control (a) and Sturm (b) runs calculated from the 295GT Russian site locations ( 178) and 41 individual winters (1981–2021), following a methodology similar to the model comparison undertaken by . Each box plot represents 5 snow depth bins, and colours indicate different air temperature regimes.
[Figure omitted. See PDF]
Following the methodology outlined by , Fig. illustrates the snow insulation effect between the control and Sturm runs across the 295GT Russian site locations ( 178), with colours representing various temperature regimes. The disparity in results between the runs is most notable in the cold-temperature regime (tundra regions), where the winter offset linearly increases up to 40 in snow depth and stabilises thereafter in the Sturm run. Conversely, the relationship between snow depth and winter offset is close to linear across all snow depths in the control run.
Figure 4
Period-averaged (1980–2021) soil temperature differences between the Sturm and control runs at 1 in depth for four seasons: (a) December, January, and February (DJF); (b) March, April, and May (MAM); (c) June, July, and August (JJA); and (d) September, October, and November (SON). Darker red indicates that the Sturm run is warmer than the control run. The grey mask represents glaciers. Hatched areas represent non-significant results compared to the time series ( values 0.95).
[Figure omitted. See PDF]
3.2 Soil temperatureOur initial hypothesis suggests that the cold bias in the control run is caused by the Jordan scheme's limitations in associating snow density with thermal conductivity under Arctic conditions, leading to higher-than-expected thermal conductivities that result in lower ground temperatures. To rectify this cold bias, we replaced the Jordan scheme with the Sturm scheme in the Sturm run, aiming to test whether this adjustment can improve the model's representation of ground temperature.
Figure 5
Period-averaged (1997 to 2019) MAGT at 1 depth, with the difference between CTSM and ESA-CCI in for the control run (a) and the Sturm run (b). Darker blue indicates that CTSM soil temperature is colder than ESA-CCI. ESA-CCI data are aggregated on the CTSM grid using a conservative second-order regridding method.
[Figure omitted. See PDF]
3.2.1 Comparison between the Sturm and control runsDuring DJF, a significant temperature increase is observed in the Sturm run when compared to the control run (Fig. ). In the Siberian permafrost region, temperatures increase by 4 to 10 , while in northern Canada and Alaska, they rise by up to 5 . In MAM, there is an increase of up to 3 found mostly over high-altitude areas across the whole domain and on the southwestern side of Hudson Bay. In JJA and SON, the increase in temperature is much less marked over the whole domain with an increase in temperature from 1 to 2 , except for mountainous areas and the western Hudson Bay. In general, we observed a substantial increase in soil temperature in DJF and MAM when snow cover is important. This outcome aligns with our hypothesis that the increased snow insulation in the Sturm run would result in higher DJF soil temperatures.
3.2.2 Comparison between the Sturm run and ESA-CCI
The evaluation of the 1 year-averaged soil temperature (Fig. ) compares results from the control and Sturm runs against the ESA-CCI dataset. The Sturm run significantly reduces the cold bias observed in the control run within tundra regions, including the West Siberian Plain, Central Siberian Plateau, Yakutsk Basin, Kolyma Lowland, and northern Canada. Similar improvements were observed at soil depths of 5 and 10 (not shown here). Most regions only have a small cold bias of up to 2 .
The MAD and the spread of the temperature (RMSE) show a noteworthy improvement, decreasing from 2.63 in the control run to 1.73 in the Sturm run for MAD and from 3.17 to 2.4 for RMSE. However, the RMSE values still remain high. This is probably linked to the pronounced warm bias observed over high-altitude areas (e.g. the Central Siberian Plateau, the Verkhoyansk Range, most of eastern Siberia, the northern regions of Baffin Island, and the Brooks Range), which was present in the control run but greatly amplified in the Sturm simulation.
Figure 6
Period-averaged (1980–2021) monthly soil temperature for the observations (black), control run (blue), and Sturm run (red) at 4 different depths: (a) 20 , (b) 80 , (c) 60 , and (d) 320 . Each of these represents an average of depth ranges as follows: 20 is 0–40 , 80 is 41–120 , 160 is 121–200 , and 320 is 201–440 . The shaded areas represent the standard deviation over all years. All values and skill scores (MAD, RMSE) come from an average of the 295 stations throughout the full period.
[Figure omitted. See PDF]
3.2.3 Comparison between the Sturm run and the 295GT datasetIn general, the control run captures the attenuation and delay of the seasonal cycle in soil temperature for period-averaged monthly soil temperatures (Fig. ) at various depth levels (20, 80, 160, and 320 ) reasonably well. However, it consistently exhibits a cold bias of a similar amplitude across all seasons and depths (MAD 3.23 and RMSE 3.32 for 20 ; MAD 4.35 and RMSE 4.35 for 320 ). The Sturm run effectively minimised the bias gap introduced by the control run, particularly during DJF and within the uppermost soil layers (MAD 1.76 and RMSE 1.93 for 20 ). Once the snow had melted out in JJA, the impact of our experiment on snow thermal conductivity decreased, as expected. The slight bias reduction that persists after snowmelt can be attributed to soil temperature memory. In addition, the improvement is less pronounced in deeper layers (MAD 2.55 and RMSE 2.57 for 320 ), as the properties of soil increasingly dominate snow insulation properties at depth. Furthermore, there is a notable positive bias of up to 2 observed in the top 20 soil layer during DJF. On average, the RMSE across the four soil layers decreases from 3.9 in the control run to 2.19 in the Sturm run.
Figure 7
Period-averaged (2006–2010) differences in monthly soil temperature RMSEs (Sturm minus Jordan) across the 295 stations. Each row represents a different depth (20, 80, 160, and 320 ), while each column represents the average of a different month. Each cell represents a different adjustment factor: 0.7 (top); 0.9 (middle); and no adjustment factor – default (bottom). Cells with positive MAD values in the Sturm run (overshoots) are marked with an asterisk (∗). Darker blue indicates improved RMSE scores in Sturm relative to Jordan.
[Figure omitted. See PDF]
3.3 Sensitivity analysis to snow densityThe sensitivity analysis to snow density shows that the Sturm parameterisation regularly yields lower RMSE values compared to those of Jordan (blue cells in Fig. ). This improvement is most pronounced during winter months (FMA) in deeper layers of soil. As snow density is reduced, the relative benefit of Sturm over Jordan diminishes, particularly in JFMA months at soil depths of 20 and 80 . However, the Sturm parameterisation leads to a lower soil temperature error for most months and depths. During summer months (without snow cover), the winter influence of the Sturm parameterisation continues, simulating a lower temperature error than that of Jordan, particularly in deeper soil layers.
Figure 8
The permafrost extent area mask difference between CTSM and ESA-CCI for the control run (a) and the Sturm run (b). ESA-CCI data are aggregated on the CTSM grid using a conservative second-order regridding method.
[Figure omitted. See PDF]
Figure 9
Active-layer thickness difference between CTSM and ESA-CCI (in ) for the control run (a) and the Sturm run (b). Darker red indicates that CTSM ALT is deeper than ESA-CCI. ESA-CCI data are aggregated on the CTSM grid using a conservative second-order regridding method. Only regions considered permafrost in the Sturm simulation are shown to facilitate comparison between the two simulations.
[Figure omitted. See PDF]
3.4 Permafrost extentThere is strong agreement between the control run and ESA-CCI permafrost extents, with 93 % of the two datasets overlapping, including the discontinuous Arctic permafrost regions (Fig. ). However, the control run slightly overestimates permafrost extent in the southern regions of Alaska, Canada, and particularly Siberia.
For the Sturm run, the overestimation of permafrost made by the control run has been resolved to the detriment of mountainous regions (in red) that have been reclassified as non-permafrost (Fig. ). In addition, the Sturm run shows a marked loss of discontinuous permafrost (in orange). In total, the Sturm run simulates a permafrost extent area equal to 9.489 106 , a strong decrease compared to the control run (13.358 106 ) and ESA-CCI (12.544 106 ) values.
To supplement our analysis with ESA-CCI permafrost extent products, we compare the results of the control and Sturm runs to the International Permafrost Association (IPA) map in Fig. .
3.5 Active-layer thickness (ALT)
Differences between the CLM5.0 and ESA-CCI ALT products indicate a noticeable positive bias increase (Fig. ) that varies across regions. While minor biases are observed over tundra areas, biases are significantly amplified over mountainous regions and in the southern Siberian regions with deep active layers. MAD and RMSE scores increase from 0.5 to 1.32 and from 0.82 to 2.13 , respectively. Note that we calculated these statistics only within regions identified as permafrost in the Sturm simulation to ensure a direct comparison of identical areas. This approach means that we excluded large regions classified as non-permafrost in the Sturm run from our analysis.
4 Discussion
4.1 Snow insulation
Earlier findings show that there is a logarithmic relationship between the winter offset and snow depth, reaching an asymptote at a snow depth of approximately 25 according to in situ observations. In Fig. , only the Sturm run accurately represents this logarithmic relationship in cold-temperature regimes. The control run exhibits a trend closer to a linear relationship, often resulting in an underestimation of snow insulation, which is consistent with findings from other modelling groups . Interestingly, CESM (using CLM5.0) shows a degradation in the representation of that relationship compared to its previous version using CLM4.5 . We hypothesise that the underestimation of snowpack density by CLM4.5 combined with the high-thermal-conductivity scheme from artificially resulted in adequate snow insulation represented by the model over Arctic tundra regions. The introduction of the new fresh-snow-density function by in CLM5.0 may have had unintended consequences, making the bulk snow density too high in Arctic tundra regions, where specific tundra snowpack features like depth hoar are not represented by the model . As the snow thermal conductivity scheme remained unchanged from CLM4.5 to CLM5.0, higher snow densities mean that heat energy from the soil can be lost to the atmosphere more efficiently, which may explain the notable cold bias observed in CLM5.0.
The spatial distribution of the winter offset in the Sturm run better aligns with previous findings compared to the control run, despite the minimal difference in effective snow depth between the two runs (below 5 % in most regions; see Fig. ). This supports our hypothesis that the snow insulation in the Sturm simulation is considerably increased and is generally more representative of tundra snowpacks.
4.2 Soil temperature
The magnitude of the cold bias observed in the control run is similar to what other modelling groups have shown , especially over colder regions, and tends to be more pronounced in deeper layers. On the other hand, some evaluations of LSMs have reported the absence of such a bias . However, these studies rely on sparse in situ measurements (often with an absence of observations in high-latitude regions) that may not fully represent the entire pan-Arctic domain. Other studies evaluating coupled LSM–Snowpack models have shown very good performance in soil temperature representation in the pan-Arctic region , underscoring the importance of accurate snow physics, albeit at a higher computational cost. Our results reveal a bias amplitude consistent across all seasons and depths, reflecting findings from prior research . This contrasts with several model studies that show larger biases in winter compared to summer. Interestingly, our findings align with similar trends observed in the study by , which examined the performance of reanalysis soil temperature data across the pan-Arctic domain and noted a prevalent cold bias.
The results of the Sturm run are consistent with a comparable experiment on snow thermal conductivity conducted by , showing a decrease in wintertime soil temperature bias and a diminishing improvement with depth. However, our results show closer alignment with the observations. Conversely, the model study by using the equation indicates an underestimation of soil temperature in winter, although their model uses a basic snowpack model with a single layer.
The persistent cold bias in simulated soil temperature in deeper layers may be attributed to several missing snow processes, including more realistic snow metamorphism or upward water vapour mass transfers within the snowpack . Recent studies have explored these missing processes . Additionally, soil processes such as the inclusion of excess ground ice , an improved phase-change scheme , and the development of adapted frozen-soil thermal conductivity models offer greater potential to improve the soil temperature accuracy in summer and at depth.
In general, the model skill scores perform better against grid-based-observation datasets rather than against in situ observations (RMSE 3.17–3.24 against ESA-CCI, RMSE 3.32–4.35 against 295GT for the control run). The divergence between model outputs and in situ observations is often attributed to the inherent scale differences. While the model operates at a coarse resolution (12 ), observations are site-specific. Comparing point observations to model grid points covering a wide area can lead to inaccuracies because individual observations may not fully represent the characteristics of the model grid-point-covered area . Scale disparities commonly stem from variations in elevation, climate, soil composition, and landscape characteristics, resulting in considerable diversity in soil thermal and hydraulic properties and, consequently, in soil temperature patterns.
Large positive-soil-temperature biases of up to 8 are particularly noticeable over high-altitude regions in our ESA-CCI evaluation. This discrepancy arises in part from variations in atmospheric forcing resolution between CLM5.0 (12 ) and ESA-CCI (1 ); lower-resolution models smooth out complex mountain terrain features into larger grid cells, leading to an inadequate representation of temperature in mountain environments . Secondly, the parameterisation of the Sturm scheme assumes the presence of basal depth hoar and overlying wind slab, potentially leading to inaccurate representation of the thermal conductivity of the basal and mid-depth snow types typically found in mountainous regions . The application of different empirical snow thermal conductivity schemes based on snow types (e.g. tundra or alpine) may address this challenge. However, identifying both the meteorological and land surface conditions needed for accurate application of such schemes in a global model like CLM would be challenging.
4.3 Sensitivity analysis to snow density
As previously stated, studies show that state-of-the-art LSMs and snowpack models, including CLM5.0, have vertical density profiles often exhibiting significant discrepancies from observed snow density, in both the top wind slab and bottom depth hoar layers of the snowpack. Such discrepancies lead to over-densification in the simulated tundra snowpack. The misrepresentation arises because the scheme does not account for the temperature-gradient metamorphism, a process that creates low-density depth hoar layers in tundra snowpacks . Without this mechanism, the simulated snow can only increase in density with age, leading to bulk densities that exceed observed values in these regions. Incorporating temperature-gradient metamorphism in future model developments would likely result in lower simulated snow densities, improving agreement with field observations .
Our sensitivity analysis shows that the RMSE reductions achieved by the Sturm parameterisation remain robust, even if future improvements are made to tundra snow densification processes that result in lower bulk densities. This improvement is most pronounced in deeper layers during winter months (FMA), when the cold wave penetrates deeply, emphasising the relevance for permafrost modelling. This suggests that the improved performance of the Sturm model over that of Jordan does not rely on unrealistically high bulk snow density values. However, the increase in RMSE caused by the overestimation of soil temperatures in upper layers during winter months is amplified when snow density is reduced. While this highlights a limitation of the Sturm scheme in certain scenarios, the overall benefits for permafrost modelling outweigh this drawback, particularly in the context of deeper soil layers where winter thermal dynamics are critical.
4.4 Permafrost extent
The comparison between the ESA-CCI permafrost data and our model results involves inherent uncertainties due to differences in spatial resolution. Our land model's grid cells are approximately 100 times larger than those of the ESA-CCI product, leading to blurred boundaries when aggregating the data. Although the ESA-CCI data itself has uncertainties, with most grid cells having uncertainties below 50 %, these are unlikely to outweigh the uncertainties introduced by the resolution mismatch.
Several other modelling groups observe an overestimation of the permafrost extent similar to the control run, as indicated by the Coupled Model Intercomparison Project version 6 (CMIP6) on permafrost physics , although not all models show this behaviour. While the Sturm run provides some mitigation of this pattern, some continuous and discontinuous permafrost areas over mountains and southern Alaska, Canada, and Siberia are lost. The issue may arise from the presence of warm permafrost at the southern edge where ground temperatures approach 0 and the soil moisture content is high. Over those regions, the accuracy of the ESA-CCI products is affected because latent heat effects slow down potential thaw, which increases the disequilibrium between atmospheric and ground temperatures . The area simulated in this study is similar to that modelled by in their Sturm experiment; however, their high-altitude regions remain classified as permafrost.
4.5 Active layer
In general, both CLM5.0 configurations show a tendency to overestimate maximum thaw depth, a trend exacerbated in the Sturm run in high-altitude and southern regions. This discrepancy has been observed in many other LSM studies . Using a knowledge-based hierarchical optimisation strategy on a series of parameters (precipitation-phase partitioning, snow compaction, and snow thermal conductivity) and input data (climate forcings and SOC density profile), effectively enhanced ALT results across more than 100 pan-Arctic sites in their LSM. While their methodology shows promise, its implementation across various model set-ups and models will require thoughtful adaptation and adjustments.
CLM5.0 performs better in high-latitude tundra regions compared to other modelling groups, which often display more pronounced regional biases. Notably, our study is the first to evaluate an LSM's ALT against a grid-based observation product, whereas most other studies to date compare their ALT results to in situ station data, e.g. CALM in . The discrepancy observed in southern regions may also be attributed to challenges faced by ESA-CCI data methods, such as probing and ground-penetrating radar, in accurately measuring ALT in regions with deeper active layers . Our findings highlight the critical need for diverse, regionally tailored observational datasets to refine model performance and better capture the complexities of permafrost dynamics.
5 Conclusions
With the growing need to assess the substantial impact of permafrost–carbon feedbacks on global climate, it is increasingly important for land surface models (LSMs) to accurately represent ground temperature in permafrost tundra regions. Snow plays a critical role over these regions, providing thermal insulation during winter, which has substantial implications for heat exchange between the atmosphere and the soil. However, Earth system models (ESMs) often lack sufficient detail regarding the spatial and temporal variability in snow insulation, among other factors.
Building upon a site experiment at Trail Valley Creek , this paper applies the relationship between snow thermal conductivity and density to the entire pan-Arctic domain, as it is better suited to the snow density profile found over Arctic tundra permafrost regions. Our aim was to study the impact of this scheme on simulated soil temperatures and permafrost dynamics, thereby improving the model's performance in reproducing snow physics over Arctic tundra regions.
The integration of the snow thermal conductivity scheme within CLM5.0 resulted in a reduction in cold biases and a closer alignment of model outputs with observational datasets (against remote sensing data, RMSE decreases from 3.17 to 2.4 ; against in situ data, RMSE decreases from 3.9 to 2.19 ). Our sensitivity analysis of snow density further validates the robustness of the Sturm parameterisation, demonstrating that its improvements persist even when accounting for potentially lower bulk snow densities in tundra environments. Furthermore, the Sturm experiment effectively addresses the overestimation of permafrost observed in the control run in southern Siberia and Canada. However, large areas over discontinuous permafrost and mountainous regions were reclassified as non-permafrost. Altogether, the Sturm run simulates a permafrost extent area of 9.489 106 , a significant decrease compared to both the control run (13.358 106 ) and the ESA-CCI (12.544 106 ) values. In addition, we observed a notable increase in the ALT bias, primarily in mountainous areas. We attribute the bias observed over high-altitude regions to two possible factors: (1) differences in the resolution of the atmospheric forcing data used in ESA-CCI and CLM5.0 and (2) potential lack of suitability in the newly implemented snow scheme in mountainous regions.
While the Sturm parameterisation offers a substantial improvement, addressing cold biases and enhancing the simulation of snow insulation in Arctic regions, it is not a panacea. Future advancements in the CLM snow scheme, particularly in the representation of snow stratigraphy and processes such as water vapour transport, will be necessary to further refine these simulations and improve model accuracy. The value of improved tundra snow thermal representation in an LSM needs testing within a fully coupled ESM to understand how consequent changes in simulated soil temperatures impact vegetation ; river flows ; permafrost-thaw-related emissions ; and consequently, climate feedbacks . Overall, our findings underscore the importance of refining snow-related processes in LSMs to enhance broader understanding of permafrost dynamics in the context of climate change.
Appendix A Additional figures
A1 Snow thermal conductivity schemes
Figure provides a comparison of five different schemes for effective thermal conductivity () across a range of snow densities from 0 to 700 . The Sturm scheme demonstrates lower values in comparison to the other schemes, particularly within the range of snow densities encountered in permafrost regions that typically fall between 200 to 300 .
Figure A1
Comparison of five schemes for from 0 to 700 for snow density. Note that the axis is logarithmic.
[Figure omitted. See PDF]
A2 Bulk snow densityFigure represents the spatial and statistical distribution of bulk snow density for the control run in our domain. The bulk snow density is calculated using the snow water equivalent (SWE) (in ) and snow depth (in ) through the following equation:
A1 where is the density of liquid water (1000 ). The mean density is 311 , with an interquartile range (P25–P75) of 216 to 380 . The histogram reveals a multimodal distribution, indicative of different snowpack types (e.g. tundra, maritime, alpine).
Figure A2
Period-averaged (1980–2021) bulk snow density for (a) the control run and (b) its corresponding histogram.
[Figure omitted. See PDF]
A3 Comparison against the permafrost extent Brown mapThe IPA categorises permafrost into four distinct classes based on its areal coverage: continuous permafrost (90 %–100 %), discontinuous permafrost (50 %–90 %), sporadic permafrost (10 %–50 %), and isolated permafrost (less than 10 %). Similar to our comparison with ESA-CCI, we compare the continuous and discontinuous IPA categories and assumed areas below 50 % coverage to be permafrost-free to align with our binary definition of permafrost.
Figure A3
The permafrost extent area difference between the CTSM control and Sturm runs (1981–1999) and the map.
[Figure omitted. See PDF]
The permafrost extent estimated in surpasses that of ESA-CCI data across southern Siberia, resulting in a nearly negligible overestimation in the control run over this area (Fig. ). However, the model fails to capture a substantial portion of discontinuous permafrost over southern Alaska.
As expected, this discrepancy leads to a more pronounced underestimation of permafrost extent in the Sturm run in many regions, including Alaska, southern Canada, and southern Siberia alongside previously mentioned areas, compared to ESA-CCI products.
It is worth noting that this comparison may be less practical than the comparison with ESA-CCI products. The data, compiled and digitised in the 1990s from historical records, represent an estimate of permafrost extent during the latter half of the 20th century . They are compared with model results covering the period of 1981–1999, suggesting a potentially lower permafrost extent than in the latter half of the 20th century.
A4 Effective snow depth in the Sturm and control runsThe effective snow depth characterises the insulation provided by snow during the cold period . is a cumulative value where the average snow depth in each month, denoted (in ), is adjusted according to its duration: A2
Snow can be present anytime from October ( 1) to March ( 6), with the maximum duration, , being 6 months. This weighting approach favours early snowfall over late snowfall, as it contributes more to the overall insulating effect. When the effective snow depth, , surpasses 0.25 , the insulating capacity of the snow remains relatively constant , and seasons with earlier snowfall typically exhibit higher than seasons with later snowfall.
Figure shows the period-averaged percentage change in effective snow depth between the control and Sturm simulations, highlighting the fact that there are few regions with percent changes higher than +5 or lower than 5. Percentage change is calculated as A3
Figure A4
Percent change in effective snow (1980–2021 period average) between the control and Sturm runs. Darker red indicates that the Sturm run effective snow is lower than that of the control run. The grey mask represents glaciers.
[Figure omitted. See PDF]
Code availability
The model version used in this study is available at
Data availability
Post-processed model simulations and observations products from ESA-CCI, as well as our 295GT dataset, are available at
Author contributions
AD carried out the model experiments and evaluations and wrote the original draft of the paper. AD and HM collected the different data for the model evaluation. HM supervised the project. LW contributed to modifying the code in the model experiment. All authors developed the idea that led to this paper and were involved in the review and editing of the paper.
Competing interests
The contact author has declared that none of the authors has any competing interests.
Disclaimer
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
Acknowledgements
Adrien Damseaux would like to thank Evie Morin for contributing to the editing and proofreading of an earlier version of the paper.
Financial support
Adrien Damseaux was supported by the AWI INSPIRES project. Heidrun Matthes was supported by the European Union's Horizon 2020 programme SOCIETAL CHALLENGES (grant agreement no. 869471). Nick Rutter and Leanne Wake have been supported by the Natural Environment Research Council (Carbon Emissions under Arctic Snow, grant no. NE/W003686/1). The article processing charges for this open-access publication were covered by the Alfred-Wegener-Institut Helmholtz-Zentrum für Polar- und Meeresforschung.
Review statement
This paper was edited by Philipp de Vrese and reviewed by two anonymous referees.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The precise magnitude and timing of permafrost-thaw-related emissions and their subsequent impact on the global climate system remain highly uncertain. This uncertainty stems from the complex quantification of the rate and extent of permafrost thaw, which is influenced by factors such as snow cover and other surface properties. Acting as a thermal insulator, snow cover directly influences surface energy fluxes and can significantly impact the permafrost thermal regime. However, current Earth system models often inadequately represent the nuanced effects of snow cover in permafrost regions, leading to inaccuracies in simulating soil temperatures and permafrost dynamics. Notably, the Community Land Model (CLM5.0) tends to overestimate snowpack thermal conductivity over permafrost regions, resulting in an underestimation of the snow insulating capacity. Using a snow thermal conductivity scheme better adapted for the snowpack typically found in permafrost regions, we seek to resolve thermal insulation underestimation and assess the influence of snow on simulated soil temperatures and permafrost dynamics. Evaluation using two Arctic-wide soil temperature observation datasets reveals that the new snow thermal conductivity scheme reduces the cold-soil temperature bias (root-mean-square error, RMSE
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
; Matthes, Heidrun 2
; Dutch, Victoria R 3
; Wake, Leanne 4
; Rutter, Nick 4
1 Alfred-Wegener-Insitut (AWI), Potsdam, Germany; Karlsruhe Institute of Technology (KIT), IMK-IFU, Garmisch-Partenkirchen, Germany; Institute of Physics and Astronomy, University of Potsdam, Potsdam, Germany
2 Alfred-Wegener-Insitut (AWI), Potsdam, Germany
3 School of Environmental Sciences, University of East Anglia, Norwich, UK; Department of Geography and Environmental Sciences, Northumbria University, Newcastle, UK
4 Department of Geography and Environmental Sciences, Northumbria University, Newcastle, UK





