1 Introduction
Sea ice dynamics in highly compact ice result from the interaction between surface stress on the ice supplied by wind and ocean currents and the emerging internal stress in the ice. In sea ice models, the internal stress is calculated by a set of equations commonly referred to as rheology. Virtually all large-scale sea ice models used for sea ice forecasting and climate modelling use the so-called viscous–plastic (VP) rheology of or more numerically efficient derivatives thereof. Additionally, the elastic–plastic–anisotropic (EAP) approach was introduced by parameterising the anisotropy of the ice stress through interactions of diamond-shaped floes
A new branch of brittle rheologies has been proposed and extended by , and , with the latest version, i.e. the brittle Bingham–Maxwell (BBM) rheology, implemented and used in the next-generation sea ice model (neXtSIM)
The BBM rheology, like the other brittle rheologies, is a damage-propagation model. It parameterises the density of fractures (the mechanical weakness of sea ice) at the subgrid scale with a scalar damage variable. In this framework, undamaged ice is fully elastic, and damage increases when the local stresses reach the Mohr–Coulomb failure criterion. An increase in damage results in a decrease in elasticity, simulating the fracturing of the ice. Once fractured, the ice can also deform viscously, simulating the permanent deformation of fractured ice. This permanent viscous deformation is limited in convergence by resistance to ridge formation, which is also accounted for by the BBM rheology.
As pointed out in , the BBM rheology has many parameters, with some well-defined constants and some poorly constrained. These parameters strongly and nonlinearly impact the patterns of sea ice drift and deformation simulated by the model. Visually, the differences between observed and simulated deformation fields can guide the selection of a model parameter value. Still, such manual tuning can become complicated when several parameters must be considered. This work aims to develop a set of metrics for quantitative comparison of the simulated and observed sea ice deformation fields and to use these metrics for tuning BBM parameters utilising a deep learning approach.
2 A brief introduction to the BBM rheology
The constitutive model of the BBM rheology consists of a parallel dashpot and a friction element, connected in series with a spring (see Fig. ). The spring represents elastic ice deformation, the dashpot represents viscous deformation when the ice is fractured and the friction element represents the resistance of broken ice to ridge formation. In a simple 1D case, these regimes can be summarised as follows (with , and denoting total, elastic and viscous internal stresses, and , and denoting total, elastic and viscous deformations). Note that due to the serial connectivity, the elastic stress always equals the total stress, i.e. .
-
When sea ice is undamaged, viscous stress is zero, and total deformation is fully reversible (elastic): , and , where damage is a single scalar to parameterise the fracture density at the subgrid scale. The damage value is altered whenever the local stress exceeds the Mohr–Coulomb failure criterion.
-
When ice is damaged and diverging, the friction element is inactive; therefore, the viscous stress equals the elastic and total stress. Deformation is both elastic and viscous: , , and .
-
When ice is damaged and converging with weak internal stress, the friction element is active, viscous stress is zero and all deformations are elastic. , , and , where is a compressive ice strength threshold that separates the elastic behaviour from the elastic and stress-dissipative behaviour of damaged sea ice.
-
When ice is damaged and converging with strong internal stresses, the friction element is inactive; therefore, the viscous stress is equal to the elastic and total stress, and deformation is both elastic and viscous: , , and .
2 where is sea ice thickness, m is a constant reference thickness, is the exponent of the compression factor, is a constant reference stress, is compaction parameter and is ice concentration.
The time derivative of total stress (see details in Ólason et al., 2022) is 3 where elasticity is a function of damage and concentration: 4 is the stiffness tensor operation: 5 where is the viscous relaxation time: 6 with (undamaged elasticity), (Poisson's ratio), (undamaged viscous relaxation time) and being constants.
Damage occurs in the BBM rheology whenever the simulated stress in a grid cell or element is outside the failure envelope (or yield curve). The failure envelope of the BBM rheology is the Mohr–Coulomb criterion: 7 where is the internal friction coefficient, and is the cohesion. Following , we let the cohesion scale with model resolution as 8 where is the distance between model node points, and is the cohesion at the reference length scale, i.e. cm, where the cohesion was measured to be of the order of 1 MPa .
The aforementioned parameters of the neXtSIM BBM rheology are summarised in Table . The ice–atmosphere drag coefficient is also added to the table (although, strictly speaking, it is not a rheology parameter) as it controls the amount of energy transferred from wind and ocean to sea ice and strongly affects the sea ice drift speed and, respectively, sea ice deformation. According to , such parameters as and require tuning using satellite observations at large spatial and temporal scales. Given that the rheology parameters nonlinearly affect the field of sea ice deformation, a metric based on satellite-derived deformation should be used, and the tuning should capitalise on nonlinear methods, such as deep learning.
Table 1The default and optimised (see Sect. ) parameters of the BBM rheology.
Parameter | Symbol | Default value | Optimal value |
---|---|---|---|
Undamaged elasticity | Pa | ||
Undamaged viscous relaxation time | s | ||
Reference thickness | 1 m | ||
Damage parameter | |||
Compaction parameter | |||
Scaling parameter for compressive strength | kPa | 5.1 kPa | |
Cohesion at the reference scale | MPa | 1.2 MPa | |
Poisson's ratio | |||
Internal friction angle tangent | 0.7 | 0.7 | |
Exponent of compression factor | 3/2 | ||
Ice–atmosphere drag coefficient |
We used the Lagrangian sea ice motion data from the RADARSAT Geophysical Processing System (RGPS) for tuning the above rheology parameters. The dataset contains trajectories of virtual buoys tracked using pattern-matching techniques on synthetic aperture radar (SAR) imagery from RADARSAT-2. The buoys are initialised by the RGPS in mid-December in the western Arctic on a regular grid with 10 km spacing on individual SAR images. The position of each virtual buoy is tracked from one image to another overlapping image during the entire winter season (December–May). The trajectory is terminated if a virtual buoy cannot be tracked due to a loss of similarity between SAR images or the absence of images. New virtual buoys are initialised in the regions with a low density of the tracked buoys that appear due to sea ice divergence or disappearance of older buoys. The average time between overlapping RADARSAT-2 image acquisitions is 3 d, but it may vary from 0.5 to 10 d. Therefore, the timing of virtual buoy positions is highly heterogeneous, even for neighbour trajectories (see Appendix for details).
4 Methodology
4.1 Overview of the parameter tuning algorithm
In our approach, we compute a set of descriptors that characterise patterns in the fields of observed and simulated sea ice deformation. Since the correlation between the descriptors and model parameters is weak (see Fig. ), we could not use linear-regression-based methods (e.g. Ensemble Kalman filter; ) and chose a deep learning (DL) approach instead. In our DL approach for tuning the neXtSIM parameter values, we train a neural network based on the modelling results and apply it to actual observations. The inputs for the neural network are the descriptors of the sea ice deformation, and the target is a value of a rheological parameter.
The algorithm for DL parameter tuning can be summarised as follows:
-
We choose the neXtSIM rheology parameters for tuning and perturb their values to generate an ensemble. Let denote the th parameter for the th member of the ensemble, then denotes a vector of the th parameter for all members.
-
An ensemble of neXtSIM instances is run with the same forcings but with different rheology parameters:
9where is the sea ice model state (sea ice concentration, thickness, drift, etc.), is neXtSIM and is time.
-
Let denote only one model variable: sea ice drift. Then denotes the operator for computing a sea ice deformation field and a quantitative characterisation of ice deformation pattern :10The size of is much smaller than the ice deformation field. For example, a daily deformation field containing sea ice deformation values can be characterised by a vector with values.
-
Let denote a set of vectors from all members and all time steps. Hereafter, represents the deformation pattern descriptors or simply descriptors. A neural network is trained (operator ) with the deformation pattern descriptors () as input and the rheology parameters () as the target:11
-
Deformation fields and deformation pattern descriptors are computed from the observed sea ice drift for each time step :12
-
The neural network is applied to the deformation pattern descriptors computed from the observed ice drift, and averaging () is applied for computing the optimal value of a neXtSIM parameter:13
Figure 1
Scheme of machine learning (ML)-based tuning of neXtSIM parameters. Blue arrows denote operations with data. Yellow squares denote modelling data. Green squares denote observations. The operators and denote, respectively, neXtSIM simulations and computation of sea ice deformation descriptors.
[Figure omitted. See PDF]
4.2 Running an ensemble of neXtSIM instancesWe ran two ensembles with neXtSIM instances. In the first experiment, the ensemble consisted of 50 members, and the values of four parameters were perturbed using the Latin hypercube method : , , and . In the second experiment with 70 members, the following parameters were perturbed with the same method: , , , and . The and parameters were added as they control the influence of sea ice thickness on . See the ranges of the perturbed parameters in Table .
Table 2
Ranges of parameters perturbed in two experiments.
Experiment | Parameter | Symbol | Min | Max |
---|---|---|---|---|
1 | Scaling parameter for compressive strength | 0 | 20 kPa | |
1 | Cohesion at the reference scale | 0.5 | 3 MPa | |
1 | Poisson's ratio | 0.27 | 0.33 | |
1 | Internal friction angle tangent | 0.55 | 0.75 | |
2 | Scaling parameter for compressive strength | 0 | 15 kPa | |
2 | Cohesion at the reference scale | 0.2 | 2 MPa | |
2 | Exponent of compression factor | 0.5 | 2.5 | |
2 | Compaction parameter | |||
2 | Ice–atmosphere drag coefficient | 0.001 | 0.003 |
The neXtSIM instances were run at a 10 km resolution mesh, covering the central Arctic Ocean. Ocean forcing from the TOPAZ4 reanalysis and atmosphere forcing from the ERA5 reanalysis were used; neXtSIM exports snapshot outputs every hour with coordinates and connectivity of the nodes of the triangular mesh and model variables for each mesh element, including ice concentration, thickness, etc. End-to-end indexing of the model nodes allows for the identification of similar nodes on two snapshots and the computation of the displacement of the node, i.e. the simulated sea ice drift.
4.3 Preprocessing of neXtSIM dataTo ensure comparability of sea ice drift and deformation from RGPS and neXtSIM, we subsample the model mesh using the mesh of satellite observations. First, for a given set of virtual RGPS buoys that have the same starting time and ending time (i.e. a single pair of SAR images is used for ice drift computation for these buoys), two model snapshots with the closest simulation time are selected from the neXtSIM outputs. Next, only the neXtSIM nodes near the RGPS nodes are selected on the first snapshot, and the corresponding nodes are chosen on the second snapshot (see Fig. for an example). Nodes may disappear during simulation, or new nodes map appear due to convergence/divergence and consequent remeshing. In that case, a new Delaunay triangulation connectivity is computed between the nodes existing on the first and second snapshots. Further drift, deformation and descriptor calculations are performed on subsets of trajectories (same for RGPS or neXtSIM) with the same start and end times. They are somewhat limited in space (by the intersection area of two SAR images).
4.4 Computing the descriptors of the sea ice deformation
We compute the divergence and shear components of the deformation tensor and the total deformation rates using a standard method of contour integrals of velocity for each element of the mesh subset mentioned above. The following descriptors of the total deformation field are computed from each subset as described in the subsections below:
-
structure–function from the spatial scaling analysis;
-
image anisotropy at different spatial scales;
-
Haralick texture features at different spatial scales;
-
length, density, and angle of intersection of linear kinematic features (LKFs); and
-
mean and 90th percentile of ice deformation values.
4.4.1 Spatial scaling analysis
As described in , the deformation is computed at different spatial scales by iterative coarse-graining . First, the deformation is calculated on the native resolutions of RGPS and neXtSIM (which are very similar). Next, some nodes are randomly removed, the remaining node positions are triangulated, and deformation is computed again. The last step is repeated several times until at least three nodes remain in the subset. Information about the area of the element used for computing deformation is preserved. This iterative procedure is repeated several times, starting from the initial deformation field, to collect sufficient deformation observations computed at different spatial scales.
The spatial scale is linked with the statistical moments of the total deformation probability density function using the following equation:
14 where and are coefficients found using the least squares method, is the statistical moment order, and the subscript “lg” indicates logarithmic space. The th statistical moment is computed as , where is mean total deformation and denotes averaging.
Coarse-graining is performed on each deformation subset (see Sect. 3.3). Still, the and coefficients are computed using deformation values (and corresponding spatial scales) from all image pairs acquired within 3 d. Hereafter, the offset and scale of the first statistical moment are denoted
Image anisotropy characterises localisation of image intensity in a linear feature . Anisotropy is high (up to 1) for images of bright, thin or long lines and is low (down to 0) for images with dark, thick or short lines. We compute the image anisotropy as
15 where and are the eigenvalues of the inertia matrix : 16 where and are coordinates of the pixels of the image with intensities (i.e. ice deformation in our case).
In our study, anisotropy is computed on triangular mesh elements as illustrated in Fig. . Only elements with deformation above 0.1 d−1 are used to avoid the impact of noise in sea ice drift and deformation . For computing in a selected element, the nearest-neighbour elements are found, and values of deformation and the coordinates of the centres of the elements are used in Eqs. () and () (see Fig. , left column). For larger spatial scales (Fig. , second, third and fourth columns), values of the deformation and coordinates are collected from the neighbours of the neighbours. After processing a single mesh subset, each element is characterised by a vector of image anisotropy computed at spatial scales of 10, 20, 30, 40 and 50 km. For every 3 d, a median and 90th percentile (P90) of anisotropy from all elements of all mesh subsets are computed for each spatial scale and denoted hereafter as
Figure 2
Computation of image anisotropy on a mesh subset. The upper row shows a mesh subset with values of ice deformation (d−1), and the lower row shows values of anisotropy computed at different spatial scales. The blue dot on the upper row shows the location of an arbitrary element for which anisotropy was computed, and the red dots show the neighbours from which the deformation and coordinates were collected.
[Figure omitted. See PDF]
4.4.3 Texture featuresHaralick texture features (TFs) are extensively used for quantitative characterisation of image texture in tasks dealing with image segmentation . A grey-level co-occurrence matrix (GLCM) is computed at the first stage of the texture analysis. The GLCM is a 2D distribution of the probability of a pixel value and its neighbour value. The neighbours can be selected at varying orientations and distances from the central pixel. At the second stage of texture analysis, several simple algebraic formulas are applied to the GLCM to compute statistical moments of the distribution (mean, standard deviation, kurtosis, etc.) and more complex characteristics (energy, entropy, etc.).
In our study, we compute the GLCM from the triangular mesh elements. We accumulate information about an element and all its neighbours at a given distance in all directions in one GLCM. For one-edge distance, we use the values from three immediate neighbours; for two-edge distances, we use the values from neighbours of neighbours (excluding the central element and duplication), and so on, as shown in Fig. . We populate the GLCM with data from all elements from all mesh subsets acquired within 3 d. The following TFs are computed at distances of 1, 2, 4 and 8 edges using the
Figure 3
Scheme of the GLCM computation. The upper left map shows a mesh subset with values of total deformation from neXtSIM. The upper row shows the neXtSIM mesh with the central element coloured orange, and the neighbours at the one-, two-, four- and eight-edge distances are coloured yellow. The lower row of images shows corresponding GLCMs.
[Figure omitted. See PDF]
4.4.4 LKF intersection angleproposed a method for detecting linear kinematic features (LKFs) for RGPS and model data as well as several metrics for the characterisation of LKFs. These metrics are successfully applied to evaluate sea ice models in a large intercomparison experiment . In our work, we rasterised the 3 d deformation maps on 12.5 km resolution grids and applied the LKF detection method of . The number of LKFs, average length of LKFs and average intersection angle of conjugate faults were used as the descriptors (see Table for notation).
In addition to the descriptors listed above, the median and P90 of divergence, convergence, and shear were computed for each of the 3 d. Thus, a vector of descriptors constituted 49 values: median and P90 of deformation; median and P90 of image anisotropy computed at five spatial scales; six texture features at four distances; slopes and offsets of three statistical moments; and length, number, and intersection of LKFs. Table shows the notation used hereafter in detail. Such vectors were generated from neXtSIM simulations for each day (using a sliding window of 3 d) from 5 December 2006 to 11 April 2007 for the first experiment and from 5 December 2006 to 15 May 2007 for the second experiment. Therefore, we had and vectors for training the ML model in the first and second experiments.
Table 3
Descriptor notation.
Description | Notation |
---|---|
Divergence, convergence and shear (median and P90) | |
Anisotropy median at 10, 20, 30, 40 and 50 km spatial scales | |
Anisotropy P90 at 10, 20, 30, 40 and 50 km spatial scales | |
Dissimilarity TF at a distance of 1, 2, 4 and 8 pixels | |
Homogeneity TF at a distance of 1, 2, 4 and 8 pixels | |
Angular second moment TF at a distance of 1, 2, 4 and 8 pixels | |
Energy TF at a distance of 1, 2, 4 and 8 pixels | |
Correlation TF at a distance of 1, 2, 4 and 8 pixels | |
Contrast TF at distances of 1, 2, 4, 8 pixels | |
First, second and third statistical moments (slope and offset) | |
Average LKF intersection angle, length and number |
We test the applicability of these descriptors in two steps: on the one hand, we compare probability density functions (PDFs) for descriptors from RGPS and neXtSIM, and on the other hand, we use an autoencoder. In the first step, we scale the values of descriptors from the RGPS using the mean and standard deviation of the values from neXtSIM: , where represents all descriptors from the RGPS, and and are the mean and standard deviation, respectively, for all descriptors from neXtSIM.
The mean and standard deviation of the scaled descriptors are analysed, and only the descriptors with scaled standard deviation below 3 remain for further use. Six descriptors computed from RGPS data have significantly different values from the neXtSIM descriptors and are expected to mislead the training.
We trained an autoencoder with dense layers with 32, 16, 8, 16 and 32 neurons on the down-selected descriptors from neXtSIM. Due to the bottleneck, the autoencoder acts as a nonlinear principal component analysis. It can be used for anomaly detection either in the input features or in input records . We applied the autoencoder to the down-selected neXtSIM and RGPS descriptors and computed the root-mean-square difference (RMSD) between the input vector and the autoencoder output for neXtSIM and the RGPS. We excluded seven descriptors with high RMSE in RGPS data from further processing as these are anomalous compared to neXtSIM training data.
4.6 Training of machine learning algorithms
We trained two types of ML models with the values of deformation pattern descriptors as input and a single value of a neXtSIM parameter as a target: a linear regression (LR) model and a deep neural network (DNN). For both models, we split the dataset from neXtSIM into two parts () for training and validation. Training and validation data are taken from different months (selected randomly). The models are trained on neXtSIM data and then applied to all RGPS descriptors. We repeated this procedure 10 times with a new random permutation and averaged the inference results on the RGPS from each repetition. Eqs. () and () can be rewritten as follows with being the index of repetition: The LR model can be formulated as
18 where is the vector of the th model parameter for all 3 d periods, is a matrix with down-selected descriptors for all periods and is a matrix with linear regression coefficients for that model parameter. Values in are found using the least squares method. The LR model does not require a split into training/validation datasets. However, only the training dataset was compared with the DNN results.
The DNN model contains three hidden dense layers with 32, 16 and 8 fully connected neurons. We found this architecture optimal in a simple hyperparameter tuning experiment. The hidden layers use the rectified linear unit (ReLU) activation function, and the output layer uses linear activation. We trained the DNN with the Adam optimiser , and the validation loss (computed as absolute error) decreased.
5 Results and discussions5.1 Sea ice deformation fields from neXtSIM
identified and as two parameters of their rheology that are poorly constrained and have a substantial visual impact on the deformation fields. Figure compares total sea ice deformation derived from RGPS (first column) and neXtSIM runs from the first experiments with the highest and lowest values of and . Three dates were chosen for Fig. in 2007 with low (15 February), moderate (25 January) and high (3 February) deformation events. These maps illustrate that both parameters significantly affect the pattern of sea ice deformation, but their influence is manifested differently. For example, the increase in results in broader and longer LKFs with higher deformation rates. These pronounced LKFs surround quite large floes; the background deformation remains relatively low. The increase in seems to affect the background deformation more – at the lowest value, the deformation between the main LKFs is mostly zero, and it increases with higher to become almost spatially homogeneous. Visually, it is hard to say which of these maps better matches the RGPS data, but we can use the similarity of deformation descriptors in PDFs as the metric. Nevertheless, optimisation of multiple rheology parameters is required to find the best match.
Figure 4
Maps of total deformation from RGPS and neXtSIM for three selected dates representing low, moderate and strong deformation events. Each map represents a 3 d mosaic; that is, the deformation is derived from pairs of RADARSAT-1 images (and corresponding neXtSIM snapshots) accumulated over 3 d, starting from the indicated date.
[Figure omitted. See PDF]
5.2 Deformation pattern descriptorsFigure shows the PDFs of the deformation pattern descriptors computed from RGPS and the first neXtSIM experiment. Comparison of the PDFs from all neXtSIM runs (blue shaded area) with the PDFs from the run with the lowest (orange line) or the highest (green line) value illustrates that some descriptors (
The PDF of most RGPS-derived descriptors lies well within the range of neXtSIM-derived descriptors and peaks between the highest and lowest PDFs (
Figure shows the correlation of all descriptors with the values of all parameters from the first experiment. The correlations are generally relatively low, except for , which correlates with
Figure presents the mean and standard deviation of the RGPS-derived descriptors normalised by the mean and standard deviation of the neXtSIM descriptors. We use it at the first step of evaluating usability and selecting the descriptors. Only 43 descriptors computed from RGPS data show a relative mean value between and 1.5 standard deviations of the neXtSIM descriptors. We exclude the following six descriptors from further processing:
Figure compares root-mean-square difference (RMSD) between input and predictions of the autoencoder trained on normalised neXtSIM data and applied to neXtSIM and RGPS descriptors. The RMSD of neXtSIM-derived descriptors is below 1, showing that even a relatively shallow autoencoder (five layers with eight neurons in the bottleneck) can reproduce the variability of the descriptors well. When the same autoencoder is applied to the RGPS data, some of the descriptors (
Figure 5
PDFs of a few deformation descriptors for RGPS (red), all neXtSIM runs (blue), and runs with lowest (orange) or highest values of . The descriptor
[Figure omitted. See PDF]
Figure 6
Correlations between four parameters of the first experiment and the deformation descriptors.
[Figure omitted. See PDF]
Figure 7
Relative values of the mean (shown by bar height) and standard deviation (shown by error bars) of descriptors computed from RGPS data. The blue colour shows the excluded descriptors.
[Figure omitted. See PDF]
Figure 8
RMSD between input and output of the autoencoder trained on neXtSIM data and applied to neXtSIM (orange) and RGPS (blue or green). The blue colour shows RGPS descriptors excluded from further analysis.
[Figure omitted. See PDF]
5.3 Training and inference of ML modelsFigure shows the DNN training results for the first (upper row) and second (lower row) experiments (see also Table ). These scatterplots compare the actual and the predicted neXtSIM parameters from the test dataset from all 10 repetitions. In the first experiment, the DNN derives only the and with sufficient accuracy (correlations are 0.75 and 0.88, respectively). In the second experiment, the accuracy of the retrieval is somewhat lower (), while the accuracy of the and retrievals is high (correlations are 0.83 and 0.75, respectively). The DNN does not show any skill in retrieving or parameters, and the accuracy of retrieving and is somewhat higher for a short range of values. Still, overall, these four parameters cannot be derived with the machine learning approach. LR accuracy is lower (see Table ), and the scatterplots are not shown.
Figure 9
Comparison of the actual neXtSIM parameters ( axis) and the retrievals by the DNN. The upper row shows results from the first experiment, and the lower row shows the second experiment. The black lines show a relation.
[Figure omitted. See PDF]
Figure shows PDFs of parameters used for training (blue line) and derived from the RGPS data using DNN (orange line) and LR (green line) models in the two experiments mentioned above (upper and lower rows). In both experiments, PDFs of and parameters have a clear peak at kPa and MPa, respectively. Notably, these peaks do not correspond to the centre of the distribution of the parameters used for training (blue line). We can observe similar behaviour for and parameters, which also have relatively high retrieval accuracy for the testing data. For , and parameters, the situation is different – the accuracy of the retrievals for the testing data was low, and the retrievals from the RGPS data are centred on the distribution of training values.
Figure 10
PDFs of parameters used for training (blue lines) and derived from RGPS descriptors using DNN (orange) and LR (green). The upper row shows the results from the first experiment, and the lower row shows results from the second experiment.
[Figure omitted. See PDF]
5.4 Optimal rheology parameters for neXtSIMSince the training accuracy was high, the peaks of PDFs of RGPS-derived parameters are pronounced and have an offset from the centre of the input data distribution; we can conclude that the values of , , and parameters derived from RGPS can be used for optimal representation of the deformation patterns by neXtSIM. Table lists the parameter values derived in different experiments and the accuracy of the model training. Note that the DNN always has higher accuracy than the LR model.
Table 4
Average values of neXtSIM parameters (Param.) derived from the RGPS. The optimal values are marked in bold.
Param. | Exp. | Method | Mean | |
---|---|---|---|---|
training | inference | |||
1 | LR | 0.66 | 4.88 kPa | |
1 | DNN | 0.75 | 5.11 kPa | |
1 | LR | 0.83 | 1.10 MPa | |
1 | DNN | 0.88 | 1.21 MPa | |
1 | LR | 0.36 | 0.305 | |
1 | DNN | 0.35 | 0.305 | |
1 | LR | 0.4 | 0.70 | |
1 | DNN | 0.5 | 0.70 | |
2 | LR | 0.59 | 4.72 kPa | |
2 | DNN | 0.61 | 5.26 kPa | |
2 | LR | 0.78 | 1.11 MPa | |
2 | DNN | 0.83 | 1.19 MPa | |
2 | LR | 0.3 | 1.49 | |
2 | DNN | 0.35 | 1.38 | |
2 | LR | 0.47 | ||
2 | DNN | 0.54 | ||
2 | LR | 0.69 | ||
2 | NN | 0.75 |
We ran neXtSIM with the optimal parameter values, and Fig. shows a comparison with RGPS for the three exact dates. We can see that the patterns of the divergence and shear fields are very similar for neXtSIM and RGPS concerning the density and orientation of LKFs, the magnitude of deformation, and (overall) the texture of the deformation field, thus confirming that the found parameters are indeed optimal.
We compared the PDFs of the deformation descriptors using the Kolmogorov–Smirnov (KS) test. The KS test is applied to the PDFs of deformation descriptors computed from neXtSIM and RGPS and is averaged over all usable descriptors. The average KS test is the lowest for the neXtSIM run with the optimal parameters (0.41) and is slightly lower than for the default parameters (0.49).
The simulated sea ice drift was validated against the RGPS drift by comparing the velocity vectors of each virtual buoy from RGPS data and the matching node on the neXtSIM mesh. The ice drift root-mean-square error for the run with optimal parameters is 0.04 m s−1, slightly lower than for the run with the default parameters (0.05 m s−1).
Sea ice thickness (SIT) from different runs was compared to the monthly average ice thickness from ICESat-1 in March 2007 . SIT RMSE is the highest ( m) for the runs with MPa, but no significant differences between the other runs were found (RMSE m).
It is interesting to note similarities and differences between the parameters used by . We note that the values we get for , and are very similar to those used by . This is to be expected for and , which are based on well-established values
In terms of the stress balance in the ice, , and are the most important, as they determine the momentum transfer from atmosphere to ice, the resistance to ridging and the shear strength of the ice, respectively. Interestingly, our estimate of is higher than the value used by ; vs. , resulting in more momentum transfer from the atmosphere to the ice. At the same time, both and are lower ( vs. kPa and vs. MPa, respectively). This results in an overall weaker ice cover compared to the parameters used by . It would, therefore, be reasonable to expect that our set of parameters would lead to an overestimate of the deformation, but this is not the case. This underlines the system's complexity and indicates that there may be multiple local minima in the parameter space, giving reasonable results. For such a system, using a systematic approach which samples the full parameter space – similar to what we propose here – is extremely beneficial.
Figure 11
Maps of divergence and shear from the neXtSIM run with default parameters (a, d), optimal parameters (b, e) and from RGPS (c, f) for 3 February 2007.
[Figure omitted. See PDF]
5.5 Temporal variations of neXtSIM parameter valuesThe PDFs of parameters presented in Fig. are derived from all RGPS descriptors computed in winter 2006–2007. However, we can also apply the ML model trained on neXtSIM data from winter 2006/07 to the RGPS data acquired since 1998. We applied the autoencoder described in Sects. 4.5 and 5.3 to the RGPS data from the earlier period to test if the trained ML model has sufficient generalisation skills. We could not detect any significant anomalies in these data. Since the encoder section of the autoencoder has the same architecture as the ML model used for inferring the rheology parameters, we can conclude that the ML model trained on data from winter 2006/07 is general enough to be applied to the earlier RGPS data. Figure shows temporal variations of the derived optimal parameters on daily and interannual timescales. To create the latter plot, we computed the descriptors from the RGPS data for 1997–2008 and applied the DNN models trained on neXtSIM data from 2006–2007.
On daily timescales, the derived parameters show very high variability, reflecting frequent changes in the pattern of sea ice deformation due to varying atmospheric forcing. The DNN model from the first experiment is more sensitive to these variations and even produces unphysical negative values of during very high deformation at the beginning of February 2007. Despite substantial variability, the parameters are stable on the annual scale and do not show any significant trends.
Figure 12
Time series of parameter values derived from RGPS for 1 year (left column) and several years (right column). Colour denotes the experiment, and the shaded area shows the standard deviation of samples produced by 10 neural networks for the daily values (left column) or the samples collected from the entire year (right column).
[Figure omitted. See PDF]
On the interannual scale, the parameters tend to change; and slightly decrease, while and gradually increase. A gradual change in the observed pattern of sea ice deformation can explain this observation. In the 1990s, thicker ice had sparser and more pronounced LKFs (see Fig. : high and low maps). As ice became thinner, more background deformation appeared, LKFs became denser and the magnitude of deformation in LKFs slightly decreased (see Fig. : low and high maps). The impact of internal friction angle or air drag is not shown in Fig. , but the mechanisms are similar: higher Mohr–Coulomb slope allows the ice to resist breakup longer and creates larger floes (mimicking the earlier, thicker ice situation); a low air coefficient decreases ice drift and, consequently, ice deformation, which again corresponds to an earlier period of RGPS observations.
After the first experiments, we observed the interannual trends in the and parameters (see Fig. ). This leads us to conclude that the weak dependence of these rheology parameters on ice thickness (and potentially concentration) indicates that the parameterisation of (see Eq. ) requires optimised tuning of and parameters. That was the motivation to run the second experiment and to retrieve values of , and . Unfortunately, as Table shows, the accuracy of ML models for these parameters is not sufficiently high, and we cannot derive their optimal values. Nevertheless, accounting for these parameters in the second experiment allowed us to train ML models that show fewer diurnal variations and are more stable on interannual timescales. Moreover, despite low accuracy, the ML models predict lower values of (1.38) and () than the default ones. Dedicated experiments are needed to tune these parameters further.
6 ConclusionsWe developed a new set of metrics for characterising the patterns in the sea ice deformation fields. These metrics are sensitive to the parameters of a sea ice model rheology and can be used to compare simulated and observed ice deformation for model evaluation or parameter tuning.
We developed a new method for tuning model parameters using machine learning. In our process, we train an ML model using simulated data and apply it to observations. In our case, the inputs to the ML model are the descriptors of sea ice deformation, and the targets are the sea ice rheology parameters. We tested a linear regression (LR) and a deep neural network (DNN) as ML models, and DNN always showed higher accuracy. This method can be applied to tune the parameters of any other model.
Using the new set of metrics and the new ML-based method, we found values of four BBM rheology parameters that were poorly constrained previously: scaling parameter for compressive strength ( kPa), cohesion at the reference scale ( MPa), internal friction angle tangent () and ice–atmosphere drag coefficient ().
Our experiments cover a wide range of weather and sea ice conditions: from thin young ice in the eastern Arctic to thick multiyear ice (MYI) near the Canadian Arctic Archipelago, from the beginning to the end of the freezing period and from calm days to winter storms. Presumably, the ML model trained on such heterogeneous data is general enough to be applied in regions with similar ice conditions, e.g. Laptev, Kara, Barents, and Lincoln seas in the Arctic or Weddell and Ross seas in the Antarctic. Applying neXtSIM in the Antarctic shows that the model reproduces the seasonal cycle of sea ice extent and that the BBM rheology simulates the sea ice drift with higher accuracy . In other regions (e.g. the marginal ice zone), where the conditions are quite different, the sea ice concentration is lower; the ice elasticity drops substantially (see Eq. 4); and the rheology is no longer sensitive to , , or other parameters.
The tuned parameters exhibit weak interannual drift related to changes in sea ice thickness and corresponding changes in ice deformation patterns. Improving the dependence on thickness and concentration in the BBM rheology or tuning the corresponding parameters may reduce the drift and make the rheology completely independent of the ice thickness influence. Recent observations of ice drift and deformation obtained from Sentinel-1 SAR data processing are recommended for tuning the rheology to reflect the current situation and provide higher-accuracy forecasts.
Other parameters in our experiments (exponent of compression factor, Poisson's ratio, compaction parameter) did not impact the pattern of sea ice deformation, or the influence of other parameters masked their impact. Therefore, their values could not be derived using our method. Dedicated experiments are required to study the sensitivity of the proposed metrics to these parameters and to tune their values.
Appendix A Additional figures
Figure presents a scheme of the Bingham–Maxwell rheology, where a spring is connected in series with a block, where a dashpot and a friction element are connected in parallel.
Figure .A shows examples of RGPS trajectories spanning 150–180 d, and Fig. b shows positions of virtual buoys detected on SAR images acquired between 1 and 5 January 2007. Points are coloured by the time of image acquisition. Panel (c) in Fig. shows the shear component of deformation computed from the drift of buoys shown on panel (b). The figure illustrates how heterogeneous the RGPS Lagrangian ice motion data are in space and time.
Figure shows a colocation of a neXtSIM mesh (shown in black) and a triangulated subset from the RGPS dataset (shown in red). The RGPS subset is created by selecting starting and ending virtual buoy positions belonging to the same RADARSAT-1 images separated by 3 d. All nodes on the neXtSIM mesh within 10 km from the RGPS subset are selected (blue) and used for the computation of deformation.
Figure A1
A schematic of the Bingham–Maxwell constitutive model showing a dashpot and a friction element connected in parallel, with both connected to a spring in series. The figure is adapted from .
[Figure omitted. See PDF]
Figure A2
(a) Example trajectories of virtual buoys detected on RADARSAT-1 data by the RGPS between 1 December 2006 and 15 May 2007. (b) Position of virtual buoys on SAR images acquired between 1 and 5 January 2007. Points are coloured by the starting and ending image acquisition times, as shown in the legend. (c) Shear computed from the RADARSAT-1 image pairs shown on (b).
[Figure omitted. See PDF]
Figure A3
Illustration of the RGPS and neXtSIM mesh colocation. The neXtSIM mesh from one snapshot is shown in black. The RGPS mesh created by triangulation of virtual drifters detected on a pair of RADARSAT-2 images is shown in red. The neXtSIM mesh subsampled from the original mesh for colocation with the RGPS mesh is shown in blue.
[Figure omitted. See PDF]
Figure A4
PDFs of all deformation descriptors for RGPS (red), all neXtSIM runs (blue), and runs with lowest (orange) or highest values of .
[Figure omitted. See PDF]
Code and data availability
The code for processing RGPS and neXtSIM data and computing ice deformation descriptors is publicly available on GitHub and at 10.5281/zenodo.13301869 .
The code for tuning the neXtSIM BBM parameters and preparing the figures for the paper is publicly available on GitHub and at 10.5281/zenodo.13302227 .
Samples of neXtSIM data used in the study are publicly available on Zenodo
RGPS data are publicly available on the Alaska Satellite Facility website:
Author contributions
AK, YY and EÓ designed the experiment and wrote the paper. EÓ developed the BBM rheology and largely contributed to the development of neXtSIM. AK developed the code for computing the metrics and for running the ML-based parameter tuning. AK ran all the experiments.
Competing interests
The contact author has declared that none of the authors has any competing interests.
Disclaimer
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
Acknowledgements
We acknowledge support from the Research Council of Norway (project “Multi-scale Sea Ice Code”) and support from the Norwegian Space Agency (project “Synergy of SAR, radiometry, and altimetry for Digital Arctic Sea Ice Twin”. We are thankful to Pierre Rampal for fruitful discussions on the methodology for image anisotropy computation and the results of the experiments.
Financial support
This research has been supported by the Norges Forskningsråd (grant no. 325292) and the Norsk Romsenter (grant no. 74CO2221).
Review statement
This paper was edited by Alex Megann and reviewed by William Gregory and one anonymous referee.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
We developed a new method for tuning sea ice rheology parameters, which consists of two components: a new metric for characterising sea ice deformation patterns and a machine learning (ML)-based approach for tuning rheology parameters. We applied the new method to tune the brittle Bingham–Maxwell rheology (BBM) parameterisation, which was implemented and used in the next-generation sea ice model (neXtSIM). As a reference dataset, we used sea ice drift and deformation observations from the RADARSAT Geophysical Processing System (RGPS).
The metric characterises a field of sea ice deformation with a vector of values. It includes well-established descriptors such as the mean and standard deviation of deformation, the structure–function of the spatial scaling analysis, and the density and intersection of linear kinematic features (LKFs). We added more descriptors to the metric that characterises the pattern of ice deformation, including image anisotropy and Haralick texture features. The developed metric can describe ice deformation from any model or satellite platform.
In the parameter tuning method, we first run an ensemble of neXtSIM members with perturbed rheology parameters and then train a machine learning model using the simulated data. We provide the descriptors of ice deformation as input to the ML model and rheology parameters as targets. We apply the trained ML model to the descriptors computed from RGPS observations. The developed ML-based method is generic and can be used to tune the parameters of any model.
We ran experiments with tens of members and found optimal values for four neXtSIM BBM parameters: scaling parameter for compressive strength (
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer