1 Introduction
Seismic event detection is an important initial processing step in the analysis of signals from a seismic network. Using human analysts or automated techniques with analyst review, the objective is to form an event catalogue representative of the seismicity during a time period. Event catalogues have value as they may be used for seismicity studies or as a working database for further scientific investigation. For these purposes, catalogues aid the comparison between localities and improve the detection of change over time. Equivalent to human “picking”, automated techniques detect seismic signals above background noise and calculate the magnitude and other quantitative characteristics of events using computational means. With regard to earthquake seismology, automated event detection techniques such as STA LTA (“short-term average over long-term average”; see Sect. for an explanation of how uppercase and lowercase STA/sta and LTA/lta abbreviations are differentiated) and correlation type have been used with increasing success and continue to be developed . These techniques are suitable for the seismological analysis of specific examples of the smaller and more varied signals from volcanoes, landslides, mine activity, and glacier processes.
In contrast to targeting the event detection to a specific signal or event type, we take an alternative approach to the detection process, intending it to be “catch-all” such that it includes both events and event-like noise. We use the term “event” broadly to include impulsive signals and waveform changes (such as an amplitude increase or frequency content change) with a less distinct onset. In some glacier environments, event-like noise is of as much interest as impulsive cryoseismic events, as both signal types yield insight into glacier and/or ice shelf processes. The workflow that we develop through this contribution thus aims to capture the wide variety of seismic events and event-like noise present in a glacier environment and any adjacent ice shelf (Part 1), and we subsequently undertake a reconnaissance of event types using unsupervised machine learning
The ObsPy Python project is a software package, widely used at the time of writing, for observational seismology including the implementation of the STA/LTA algorithm (or STA/LTA for short) and other data analysis tools in seismology research. As a framework for processing observational seismological data, ObsPy provides a community-wide resource, and we have drawn upon the nomenclature used therein unless otherwise stated. The STA/LTA algorithm calculates average values of absolute waveform amplitude in two time windows, one short and one long, and compares their ratio to a threshold value for detection . An event is triggered (i.e., an event arrival time is picked) at the point where the ratio rises above the trigger threshold value and is detriggered (i.e., an event stop time is picked) once it falls below the detrigger threshold value. When discussing the STA/LTA algorithm, we are explicitly referring to the recursive STA/LTA algorithm, which may be regarded as a current standard approach of this type. The recursive STA/LTA algorithm improves upon the classic algorithm as it reduces memory usage, yields a decaying exponential impulse response instead of a rectangular one, and limits shadow zones. Shadow zones occur when short event bursts (transients) in the STA cause a large LTA, and the recursive algorithm limits the dominating effect that such transients could have on successive event detections. All together, this produces a more efficient and smoother result . A shortcoming of the STA/LTA algorithm is that it is sensitive to changing noise levels. In environmental seismology studies, this leads to missed detections because the signals of interest have low signal-to-noise ratios and/or are diverse with regard to maximum amplitude and timescales .
In comparison to STA/LTA, correlation-type algorithms assume similar events will occur within a catalogue and so can be less prone to missed detections of specified events because they typically include some type of template matching . This involves computing a normalized correlation coefficient of a template for a previously known waveform and searching a dataset for similar waveforms . Under the umbrella of correlation-type algorithms, seismic studies have employed stacking algorithms and artificial neural networks . Other advanced techniques include assigning nonlinear filters and the computationally efficient similarity search . Despite successful applications in earthquake seismology, correlation-type algorithms can lead to missed detections in environmental seismology because of the varied and previously undetermined waveforms of the signals of interest.
Since STA/LTA and correlation-type algorithms have enjoyed only limited success when applied to environmental seismology, there is no community-wide consensus on a best event detection technique for generating event catalogues for cryoseismic signals. In fact, the many different methods that have been applied demonstrate the experimental and individualized approach to detection in cryoseismology . The most basic method is manual detection , which avoids missed and false detections associated with automated event detection. However, visually picking event arrival times in continuous cryoseismic data is time-consuming and analyst dependent. In terms of automated detection, many studies use the STA/LTA algorithm, which can be restrictive (i.e., only recognizes a single set of parameters) and so is difficult to apply across the diversity of signals that can come from various glacier environments. Therefore, several cryoseismic and environmental seismic studies have adapted STA/LTA to certain applications. For example, design a variation of STA/LTA based on the detection requirements of the seismometers in their study. In this case, a small sample of manually detected events are used to optimize parameters. Similarly, other hybridized approaches to STA/LTA are presented in and for the purpose of more accurate detection in diverse signal-and-noise environments, like those of microseismic networks. Accounting for the possibility of missed detections, rely on post-processing events detected by STA/LTA (e.g., extending an event's length, accounting for potential low-amplitude events that can be left undetected). apply STA/LTA to data filtered by two separate frequency bands such that parameters can be adjusted for event types characterized by different frequencies. Ultimately, most studies determine the parameters by trial and error
Alternatives to the amplitude-dependent STA/LTA are the use of other measures of an event, such as kurtosis, i.e., quantifying deviations from a standard distribution of waveform amplitudes , and spectrograms for frequency-related thresholds for detection . QuakeMigrate, an advanced extension of the STA/LTA algorithm and waveform stacking, relies on a coalescence (i.e., joint) energy approach that quantifies the cumulative seismic energy recorded by a network of stations . While QuakeMigrate and spectral-based methods have been shown to be effective for detecting basal icequakes, they are less effective at capturing longer tremors . Some studies also use STA/LTA and correlation-type methods together
The wide variety of techniques for the detection of icequakes highlights the extent of analytical challenges in event-based cryoseismology. Where the area of interest is an ice stream or other ice sheet outlet glacier, the challenge is increased by the remote location together with the need to undertake a reconnaissance across a relatively large area. The diversity of event types in glacier environments therefore suggests the need for a workflow comprising a (Part 1) catch-all algorithm that can automatically capture heterogeneous seismicity characteristics. Given the advent of machine learning research, a rapid and broad event detection method is a critical tool for preparing datasets for subsequent (Part 2) semi-automated reconnaissance. In terms of any type of analysis, a consistent approach from glacier to glacier will be particularly useful. The high potential utility of comparable event catalogues being generated from different seismic deployments provides motivation to develop a generalized approach for event detection and catalogue compilation for cryoseismology data.
In this contribution, as Part 1 of the workflow proposed above, we outline the development and testing of a novel event detection method, the multi-STA/LTA algorithm, which uses a set of sta–lta pairs to optimize the detection of diverse cryogenic signals. We form a synthetic set of test waveforms, using a Monte Carlo approach, and then perform a grid search to inform our choice of parameters for the subsequent application of the algorithm. We also compare the event detection performance of the multi-STA/LTA algorithm with the recursive STA/LTA method. In a subsequent section, we apply the multi-STA/LTA algorithm to a dataset from the Whillans Ice Stream to form an event catalogue, and we demonstrate how our semi-automated event detections are substantiated by a number of visual-based event detections. Our broad use of the term “event” includes both impulsive signals and waveform changes with a less distinct onset. The new event catalogue may be considered sufficiently comprehensive to allow for an appraisal of the recorded glacier seismicity and lends itself to subsequent analysis using unsupervised machine learning
Table 1
Parameter definitions for the multi-STA/LTA algorithm, based on their usage in ObsPy; see the main text for clarification. The right-hand column shows the ranges of the parameter values sampled in a fine-grid search for the best parameter value choice (Sect. S2.2b), the step size within the given ranges is uniformly distributed in space.
| Parameter name (units) | Definition | Range in parameter search |
|---|---|---|
| sta (seconds) | span of the minimum short time window | 0.001–100 |
| lta (seconds) | span of the minimum long time window | 1–100 |
| (unitless) | multiplier of sta (used to compute the span of the maximum short time window) | 10–1000 |
| (unitless) | multiplier of lta (used to compute the span of the maximum long time window) | 10–1000 |
| (unitless) | the target ratio between sta–lta pairs (used as a tolerance value to limit the | 1.78–100 |
| number of time windows that are generated to ensure computational efficiency) |
In this section, we develop the multi-STA/LTA algorithm to detect events with a range of durations and maximum amplitudes. This algorithm is based on the principles of the established STA/LTA procedure . We use a set of simulated waveforms for algorithm testing using a Monte Carlo approach and carry out a fine-grid search to inform the choice of multi-STA/LTA algorithm parameters, defined in Table . The relevant functions of the ObsPy package use nomenclature including the use of “sta” (lower case) as a duration for a short time window, and correspondingly “lta” refers to the span of a long time window, with both given in seconds. The short and long time windows are used to calculate the average amplitudes within these window spans. We explicitly use the uppercase “STA” and “LTA” as abbreviations for the short- and long-term averages. STA LTA is thus a ratio of the averaged amplitudes within these short and long time windows.
2.1 Algorithm description
The foundation of the multi-STA/LTA algorithm is the formation of a hybrid characteristic function (CF) for optimized event detection. In statistics, a CF often represents a probability distribution of maximum eigenvalues . When applied in this context in seismology, the CF is a nonlinear transform of a seismogram showing a probability distribution of likelihood for event detection. Physically based, it quantifies changes in the energy in a waveform's direction of displacement.
The multi-STA/LTA algorithm takes the given input parameters (Table ) and forms a hybrid CF through the following four steps:
-
Generate a set of short-time-window and long-time-window “sta–lta pairs”. The algorithm determines the number and spans of sta–lta pairs based on the input parameters: sta, lta, and , and .
-
Calculate the CF for each sta–lta pair using the recursive STA/LTA algorithm applied to an input waveform. The CF is the ratio of the absolute waveform amplitudes averaged within the short and long time windows.
-
Calculate a hybrid CF from the maximum values of each CF computed in Step 2 (i.e., maximum STA LTA ratios) using Eq. (1).
-
Trigger (or continue) an event while the hybrid CF value at a given point in time is above a trigger threshold and detrigger an event when the hybrid CF falls below a detrigger threshold, yielding an event list for the desired time period.
Larger values in a CF signify a greater likelihood of an event having occurred at that time; therefore, the maximum value of the CF at a given point in time should signify the highest likelihood of an event. The result is a hybrid CF that is tailored systematically to each event in a waveform. This contrasts the previously defined STA/LTA approach which uses one CF computed by one short and long time window. The multi-STA/LTA algorithm thus innovates on the recursive STA/LTA algorithm by merging multiple CFs, resulting in a hybrid CF.
The parameters of the hybrid CF used in the multi-STA/LTA algorithm, defined by the following equation, are selected to optimize the successful detection of events and to minimize false detections:
1 where the function is the CF for the multi-STA/LTA algorithm and the function is the CF for the recursive STA/LTA algorithm implemented per sta–lta pair. and are thresholds for which an event is triggered and detriggered, respectively, and is the number of sta–lta pairs, defined as the smallest integer satisfying the following equation: 2
As an example to demonstrate how the algorithm generates a set of sta–lta pairs (Step 1), using parameter values of sta 1 , lta 10 , 10, 10, and 2 as inputs to the multi-STA/LTA algorithm, the following pairs are generated: 3
Figure 1
Example to illustrate the differences in event detection for a recursive vs. a multi-STA/LTA approach. In the first panel (a) we provide a seismic waveform sourced from https://examples.obspy.org/ (last access: 14 July 2023) with a high-pass 1 filter. In (b), the CF for a single recursive sta–lta pair (sta 2.15 , lta 21.5 ), whereby one set of sta–lta pairs is used to calculate the STA LTA amplitude ratios, is provided as a point of comparison to (c), the hybrid CF (black) with the CFs for the four sets of sta–lta pairs in Eq. (3) that are generated by the multi-STA/LTA parameters sta 1 , lta 10 , 10, 10, and 2. Two reference lines for a typical trigger value of 3 and a detrigger value of 0.5 (dashed grey) are overlaid in (b) and (c); these values are picked to show the best comparison between event detections but are not subsequently applied to real data. In (d) the waveform is repeated from (a) with underlying horizontal bars showing the extents of each detected event per CF in (b) as they meet the respective trigger and detrigger thresholds. The multi-STA/LTA detected event (black) is at the maximum extent for the set of four recursive STA/LTA detected events.
[Figure omitted. See PDF]
The corresponding CFs (i.e., absolute amplitude ratios) will be calculated, for each pair in this set, and the maximum CF per time window will be used to form the hybrid CF (Steps 2 and 3). We illustrate the construction of the hybrid CF for this example parameter set applied to a waveform in Fig. .
2.2 Algorithm testing and optimizationWe use a Monte Carlo simulation to provide a set of waveforms for algorithm development. Each waveform thus generated includes randomized background noise and two events that vary in time of occurrence, duration, amplitude, and time between the events. The range of seismic signals that we intend to capture in our simulations includes long-period events with decaying amplitude, two events that are proximate in time with different amplitudes and durations, and low-amplitude events whose structure is barely detectable above background noise. A large population of simulated events are positioned at varying temporal separations with respect to each other in each simulated waveform. The set of simulated waveforms is not intended to accomplish the difficult task of replicating the exact nature of likely cryoseismic signals but rather to provide a working set of signals that present similar challenges to the STA/LTA algorithm, for example, events of different types being closely spaced in time.
We define two event classes and randomly sample for a uniform distribution over the variables of each class. The sample space for each class is shown in Table 2. Further details of the simulated waveforms including the equations for the two classes of simulated events, Eqs. (S1a) and (S1b) in the Supplement, and further discussion of the simulated seismic signal are given in Sect. S2.2a in the Supplement. Following the definitions of the event classes and sample space, we use statistical analyses to guide parameter choices for the algorithm.
Figure 2
A guide to parameter choice for the multi-STA/LTA algorithm: the marginal probability distributions of sta, lta, , , and . Each scatterplot reflects the fine grid of tested parameter values. Displayed are unconditioned values (blue circles); values conditioned by sta between 0.01 and 0.18 (green squares); values conditioned by the sta and lta at 100 (red triangles); and values conditioned by sta, lta, and the line 1.24 (black diamonds). The threshold for statistical significance is shown at 0.005 (dashed grey line). Further discussion of the fine-grid parameter search is provided in Sect. S2.2b.
[Figure omitted. See PDF]
We find the marginal probability distribution results for sta, lta, and , and (Fig. ). Finding ambiguity in the choices of and , we also look at the marginal probability distributions of the two parameters (Fig. ). These figures are provided in the main text to support the parameter recommendations. However, full discussion of the figures and statistical methods applied is given in Sect. S2.2b in the Supplement. Limitations to these methods, as they are applied, are examined in Sect. .
Figure 3
A further guide to parameter choices of and : the two-dimensional marginal probability distribution for and . Overlaid is the line of best fit for the points of highest success of event detection (red line: 1.24 ). A low probability signifies a rejection of the null hypothesis that events and noise are equally likely to be detected and, therefore, a higher success in event detection. The color bar is scaled by the maximum probability of rejecting the null hypothesis. Further discussion of the fine-grid parameter search is provided in Sect. S2.2b.
[Figure omitted. See PDF]
Table 2Sample space for the variables of each simulated event class. The values per variable are randomly sampled for a uniform distribution in space and are used to generate a simulated waveform containing two events as defined by Eqs. (S1a) and (S1b). Further discussion of the two event classes is provided in Sect. S2.2a.
| Variable | Sample space for Eq. (S1a) | Sample space for Eq. (S1b) |
|---|---|---|
| Amplitude () | 1–1000 | 1–1000 |
| Duration () | 1–100 | 1–100 |
| Period component () | 1–10 | 1–10 |
| Period component () | – | 10–100 |
| Decay () | 1–3 | 1–3 |
| Constant () | – | 1–1 |
We now apply the multi-STA/LTA algorithm to a real cryoseismology dataset – seismic recordings from the Whillans Ice Stream (WIS; also known as Ice Stream B) – with the aim of generating an event catalogue spanning the deployment period in the 2010–2011 austral summer.
Figure 4
Location of the Whillans Ice Stream (WIS) and the seismic stations used in this study. (a) Outline of Antarctica indicating the location of the Ross Ice Shelf and exposed bedrock (brown) of (for example) the Transantarctic Mountains from . (b) Location of the Whillans Ice Stream in the context of the Ross Ice Shelf (BB01: 84°1743.8066 S, 158°947.1636 W), with ice flow speed (). (c) The WIS temporary broadband stations deployed in austral summer 2010 and 2011. Shown are the stations used in this analysis (red shapes, labeled by station name) and the other deployed stations not used in this study (black crosses). Ice flow speed data are from . Map generated by the “agrid” Python module .
[Figure omitted. See PDF]
3.1 DataThe WIS is one of five major glaciers that discharge ice from the grounded West Antarctic Ice Sheet into the Ross Ice Shelf. It is a fast-flowing ice stream at 300 due to its well-lubricated, deformable subglacial bed ; however it is also decelerating at an estimated rate of 5.5 , resulting in a high deformation rate. It experiences large-scale stick-slip motion with tidal effects from the Ross Sea and Ross Ice Shelf. The WIS is therefore a locality with the potential for a wide range of cryoseismic events such as signals related to resonance in subglacial water-filled cracks and glacier earthquakes and tremors relating to the release of strain the ice–bed boundary
The WIS is a challenging case study for cryoseismic studies because its proximity to the Ross Ice Shelf (RIS) contributes to an especially noisy seismic wave field. Though the RIS front is distant from the WIS deployment (up to 600 ), indirect, external sources can still be detected on the WIS ; therefore, seismicity recorded on the RIS and WIS could have various potential generative mechanisms. As well as the stick-slip events noted above, signals are possible from teleseismic events , ocean swell and gravity waves , surface resonance , rift fracture , and flexure of the frozen surface . Environmental factors that are associated with detections of higher-frequency signals are tidal stresses, changes in air temperature or insolation, and wind speed .
Seismic sensors were deployed on the WIS in West Antarctica during the 2010–2011 and 2011–2012 austral summers . From the 49 stations deployed, we examine data from 14 stations (station names of format BBXX; Fig. , filled red shapes). The BBXX seismometers are all Nanometrics Trillium 120 s instruments with Reftek 130 D recorders using 200 sampling. Each station has continuous three-component data for the time period between 14 December 2010 and 31 January 2011. Excluded are stations BB02, BB05, and BB09 due to missing components and/or incomplete data for a significant proportion of the deployment.
A common practice of the recursive STA/LTA algorithm in earthquake detection is to use only the vertical component; however, we also wish to make use of the horizontal components. This is a practical solution in the case of an instrument problem on the vertical component and accounts for possible ray paths of seismic events to sensors. We utilize the root sum squared (Euclidean norm) of the north, east, and vertical () component amplitudes for each station; this allows weaker signals to be detected by the algorithm irrespective of the component. This absolute value (no implied physical meaning) is simply a means of triggering events. Conventional frequency filtering is not applied during pre-processing in order to maintain the full range of frequencies available to be captured by the multi-STA/LTA algorithm.
3.2 Compiling the master event catalogue for the WIS
We compile a cryoseismic event catalogue for the WIS using the multi-STA/LTA algorithm with optimized parameters as outlined above (Sect. 2.2). We use the waveform handling pipeline for automated analysis developed by , adding the multi-STA/LTA as a new option, with a view to subsequent signal reconnaissance using unsupervised learning
The two catalogues also report amplitude and energy, respectively, as metrics for quantitatively describing events. The amplitude is the maximum (peak) amplitude of a recorded event, and the energy is the integral of the amplitude squared with respect to time. Both metrics are in practice approximations because of the effect of the seismometer instrument-response function. In the reference catalogue, these values are determined by taking the average of the peak amplitude of the three seismometers (the exact number is a chosen variable) with the largest amplitudes. The trace catalogue lists these values by the station for which the event has been triggered. Reference event and trace (metadata) catalogues for the WIS region during the 2010–2011 austral summer include stations triggered per event, network time, duration, amplitude, and energy (
3.2.1 Assigning confidence and known seismicity labels to events
Events are first assigned high- or low-confidence labels based on the number of adjacent detecting stations (
Using the global seismic catalogue during the 2010–2011 austral summer, we find 68 events as potential teleseisms. We then label events according to how they compare to a local minimum in peak amplitude distribution in terms of arbitrary units () at 3.5 (, ): Teleseism I ( 3.5) and Teleseism II ( 3.5). We thus find 32 Teleseism I events and 36 Teleseism II events.
Figure 5
Automatically detected events shown as an overlay on manually identified events (black) to compare the performance of alternative algorithms. The axis shows the east component amplitude centered around zero, and the axis shows seconds from event arrival time. The first event (a) has been identified as a basal stick-slip event by . The second (b) has characteristics that suggest a tectonic teleseism (Teleseism I, main text); however it is possible that other modes of brittle deformation are contributing to the signal (investigated further in Fig. S4 in the Supplement). The third (c) also has characteristics that suggest a tectonic teleseism (Teleseism II, main text). Top row (purple overlay): detection by multi-STA/LTA, with recommended parameters (i.e., sta 0.03, lta 100, 18, 56, and 10). Second row (light-orange overlay): detection by the recursive algorithm with RECmin parameters (i.e., the minimum sta–lta pairs sampled by the multi-STA/LTA with recommended parameters: sta 0.03 and lta 100). Bottom row (light-blue overlay): detection by the recursive algorithm with RECmax parameters (i.e., the maximum sta–lta pairs sampled by the multi-STA/LTA with recommended parameters: sta 0.54 and lta 5600). The multi-STA/LTA algorithm combines advantages of the other algorithms, as it is able to capture the full length of extended events in common with RECmax.
[Figure omitted. See PDF]
Each event with a known seismicity label was subject to a visual review (stick-slip and teleseismic events are available as .pdf products organized by assigned labels in
We compare the multi-STA/LTA algorithm, implemented with the recommended sta–lta pairs, with two runs of the recursive STA/LTA algorithm for real data from the WIS. From this comparison, we aim to determine if only one sta–lta pair is contributing to the event catalogue. If so, it would signify that running multiple sta–lta pairs does not enhance the resulting event catalogue and a recursive approach would be sufficient.
Given the recommended parameters, we use the minimum and maximum sta–lta pairs from the multi-STA/LTA set as inputs to the recursive algorithm (Fig. , caption). We choose these two corner cases from the multi-STA/LTA parameters for sta–lta pairs as a sensible point of comparison between the two algorithms that can be easily illustrated. The minimum recursive sta–lta pair is referred to as RECmin, and the maximum as RECmax.
In order to compare algorithms, we identify three detected events that exhibit diverse characteristics (i.e., impulsive or emergent behavior, envelope descriptors, and durations), with the time frame of each event being set by the detected multi-STA/LTA event duration. We then highlight the events detected by the multi-STA/LTA, RECmin, and RECmax approaches within the indicated time frames (Fig. ). The frequency spectrum and other available information are also reviewed to aid the analysis of the event (Figs. S3–S5).
For the event shown in Fig. a that begins on 26 December 2010 at 18:07:00 UTC, we identify the source mechanism as basal stick-slip based on a previous study
Figure 6
Reference event catalogue feature distributions. Shown on the axes are duration ( ), total energy ( ), and peak amplitude ( ), while the axis shows the occurrences of events with the respective feature value. Detections from the three algorithms are MSL, i.e., multi-STA/LTA (low confidence, light-grey filled, stacked; high confidence, dark-grey filled); the recursive RECmin (light-orange outline); and the recursive RECmax (light-blue outline). The multi-STA/LTA algorithm combines advantages of the other algorithms, as it is able to match, and improve upon, the detections achieved by RECmin while capturing the full length of extended events in common with RECmax (see also Fig. , with parameter values given in caption). False detections are less of a concern using the current workflow than in conventional catalogue generation as events are appraised using the Part 2 unsupervised learning with both events and event-like noise potentially providing insight into glacier processes. See also the discussion of low-confidence events following patterns of high-confidence events in the main text.
[Figure omitted. See PDF]
RECmax is effective at detecting the initial arrivals of events and can capture the full length of long events (Fig. ). In contrast, RECmin generally splits up events that are known to be continuous into separate segments. As a synthesis of the advantages of both RECmin and RECmax, the multi-STA/LTA algorithm with the recommended parameters detects the full lengths of a variety of event types.
The populations of events that are yielded by the different algorithms are compared using the reference event catalogue features: duration, total energy, and peak amplitude (Fig. ). We use the term “occurrence” to avoid possible confusion between count frequency and waveform frequency. Event durations from the multi-STA/LTA catalogue show a near-symmetrical distribution in semi-log space, with an equivalent number of very short and very long events and a maximum occurrence at about 10 duration. RECmin, in contrast, cannot detect events greater than 300 , and RECmax skews towards longer events, likely missing the large subset of shorter, potentially real events detected by multi-STA/LTA. The total energy of events detected by both multi-STA/LTA and RECmin plots as a near-symmetric peak, but RECmax does not detect the lower-energy events. All three algorithms show detections for approximately 50 events at very large energies, greater than 10 ().
The distributions of the high- and low-confidence multi-STA/LTA events are similar for higher durations ( 1 , seconds), energies ( 5 , ), and amplitudes ( 3 , ). The distribution of low-confidence events where there are no high-confidence events highlights, logically, that we have greater uncertainty in the shorter and weaker detected events. However, the low-confidence event distributions also follow similar general trends to the high-confidence events, lending credibility to the potentially real source mechanisms for some events that are labeled low confidence.
The distribution of the peak amplitudes provides source mechanism information that would commonly be extracted from the magnitude of a tectonic event
The purpose of the event detection algorithm development and workflow is to facilitate a consistent approach to the generation and analysis of event catalogues for cryoseismology and similar research.
4.1 Limitations
The varied nature of cryoseismicity raises the question of how an event should be defined for inclusion in the catalogue. Two definitions were tested with regard to how an event is detected by an array. In the first definition, we included a trigger condition that defined a detection when three or more seismometers were triggered within 60 of the same arrival time. This biased the catalogue towards events with impulsive initial arrivals, resulting in events that we know to have a single source mechanism to be truncated prematurely or divided up into separate events. As stick-slip events are known to last between 20–30 , we tested (and make subsequent use of) a second definition that merges overlapping events from three or more seismometers together regardless of initial start time (Fig. S6). In all cases, the start times in the resulting catalogue indicate the time window within which the signal is observed and not the origin time of the event. No event locations are calculated or recorded in the catalogue, which is intended as a generalized reconnaissance tool. As a drawback to this approach, a small number of event groups might be catalogued under a single energetic reference event even though the source mechanisms could be different.
It is possible that events in other locations of interest for cryoseismology have event types with substantially different seismic signatures than those of the WIS (on which our simulated waveform population was based). As an example, stick-slip events can have a wide range of lengths and frequencies with documented durations from 0.1 to 1000 and frequencies from 0.01 to 1000
Use of the Euclidean norm, computed from the three-component seismic signal, has the advantage of enabling event detections irrespective of the component on which the signal arrives with the highest amplitude. However, insights yielded by separate P and S wave signals could be lost in this process, and we encourage analysis, following the reconnaissance enabled by the Part 1–Part 2 workflow, that makes use of all three components. Examples include array methods and beamforming, of relevance to both impulsive events and event-like noise bursts . Such array approaches have excellent potential for glacier studies as seismic sensor, low-power instrumentation and battery technology continue to evolve.
In this study we use the Monte Carlo approach to optimize the five key model parameters that have the strongest conditional interplay when applying the multi-STA/LTA method (sta, lta, and , and ) as previously described (Fig. 2). Secondary parameters, which will vary based on the study environment (i.e., background noise and seismic signal amplitudes), include the trigger and detrigger values. These values were set in this study following a brief, visual-based analysis as this was a straightforward process. Whilst any parameter choices could be optimized through the Monte Carlo analysis, the visualization and appraisal process needed for the trigger values could become unwieldy. In general, the parameters that are used should be recorded and supplied with the resulting catalogue.
Figure 7
The complete reference event catalogue for multi-STA/LTA detected events shown as bivariate plots for (a) all events and (b) high-confidence events only. The filled markers show all events (grey), stick-slip events (blue), Teleseism I events (yellow), and Teleseism II events (pink). In the panels showing total energy versus duration (bottom row), events are subdivided according to peak amplitude to show the distribution of smaller events 3.5 (light blue-grey). The threshold of 3.5 is determined from the local minimum in the amplitude–occurrence distribution in Fig. . Some events corresponding to amplitudes 3.5 are obscured by points corresponding to larger-amplitude events and/or known seismicity labels.
[Figure omitted. See PDF]
4.2 Event catalogue analysisWe now investigate bivariate relationships in the event catalogue produced using the multi-STA/LTA algorithm (all events, Fig. a; high-confidence events only, Fig. b), expanding on the univariate investigation (Sect. 3.3) and using the same event catalogue features: duration, total energy, and peak amplitude. This facilitates the exploration of event types and thus the possible source mechanisms that make up the catalogue. Here, the qualitative assessments of two-dimensional event types are provided to be descriptive; we further investigate event types in the companion paper to this work .
We choose the local minimum in the amplitude–occurrence distribution (Fig. ) as a rudimentary threshold for a split between two large groups of events: events with amplitudes 3.5 and events with amplitudes 3.5 As described in Sect. 3.2.1, to support catalogue reconnaissance, we have manually identified events of stick-slip origin and teleseismic events (divided as Teleseism I with amplitudes 3.5 and Teleseism II with amplitudes 3.5), confirmed using several verification methods elaborated in the
The general trend between peak amplitude and duration (Fig. , top) and energy and duration (Fig. , bottom) of events is consistent with the positive linear association expected from cryogenic sources
The Teleseism I events show similar distributions on the bivariate plots to the stick-slip events, with a wider range of durations (0.1–10 000 ). This spread emphasizes the diversity of earthquake sources during the WIS deployment, in terms of distance from WIS, magnitude, and depth. The Teleseism II events follow the Teleseism I event trends but into the domain of shorter and weaker signals. In the figures showing only high-confidence events, the Teleseism II events cluster in a small range of durations (10–100 ), amplitudes (3–3.5, ), and energies (6–8, ).
The events of higher amplitude ( 3.5) that are detected with long durations and high energies similar to the stick-slip events could inform studies of the WIS, as they likely pertain to local, active glacier processes. We infer that these events represent more than one glacier process because we can identify several clusters of events in these bivariate spaces. For example, there is a cluster of events with very high energies and long durations (Fig. , bottom). As another example in these two dimensions, there is a cluster of events with high energies and durations between 10–100 .
The events of lower energies, clustered above but adjacent to the 3.5 threshold, that occur for long durations (Fig. , bottom) suggest the presence of harmonic tremors in the catalogue. Harmonic tremors are observed from ice shelf processes (such as iceberg dynamics or ocean waves around ice shelves – , and , respectively) or subglacial water flow beneath an ice stream . Other events (lower amplitudes and energies, shorter durations) likely pertain to the noise events (Sect. 3.3). The probable source mechanisms of these event types are investigated in .
Figure 8
Reference event occurrence against downstream tidal height for the WIS. Events are grouped into rows for (a) all events, (b) events occurring on a falling tide, and (c) events occurring on a rising tide. Bars are shaded light blue-grey for low-confidence events and dark grey, stacked, for high-confidence events. From left to right, columns show all peak amplitudes, peak amplitudes 3.5, and peak amplitudes 3.5. Tidal heights () are determined for a downstream location (84°2020.3994 S, 166°00 W) from the CATS tidal model . This location is on the Ross Ice Shelf, 59 from station BB12 (i.e., the seismometer in the WIS array located furthest downstream). Teleseismic events are included with a view to streamlined future workflows but are not sufficient in number to mask the temporal patterns shown (Table S2).
[Figure omitted. See PDF]
4.2.1 Possible tidal controlIn view of the possible tidal control on the events of the WIS, we undertake a newly enabled overview analysis of this relationship, based on the catch-all identification of events in the catalogue (produced using the multi-STA/LTA algorithm). We acknowledge that the length of the deployment in this study is relatively short for such an analysis, but the comparison indicates what is probable for records covering longer time spans. The tidal heights that we use for comparison are estimated using the Circum-Antarctic Tidal Simulation
The separation of events by falling and rising tides demonstrates that seismicity patterns are moderately correlated to the periodic tidal cycles, mostly with little difference in tide direction (increasing or decreasing), shown by the similarities in Fig. a–c. The events with amplitudes 3.5 show a tendency towards positive tide heights. The high-tide tendency is corroborated in the case of stick-slip seismicity, which occurs preferentially at maximum power during high tide near the center of the stick-slip region and of the array
Figure 9
Reference event occurrences, split into daily totals, are shaded light blue-grey for low-confidence events and dark grey for high-confidence events (stacked). Daily occurrence is plotted against (a) downstream tidal height for the WIS (red, right axis) and (b) ice surface temperature at all relevant stations (dark-blue line is the daily average for the area, and the blue shading covers the minimum and maximum station daily temperatures, right axis). Tidal heights () are determined for a downstream location, as in the caption for Fig. . Ice surface temperatures (°C) are from the high-resolution (25 ) NOAA Climate Data Record (CDR) of the AVHRR Polar Pathfinder (APP) Cryosphere, Version 2, at each of the 14 BBXX station positions .
[Figure omitted. See PDF]
We further examine the context of the events by evaluating the relationships between tides and ice temperature variations and the timing of event occurrence (Fig. ). The ice temperature, retrieved for each of the 14 BBXX station positions over the deployment period, is a surface product sourced from the AVHRR Polar Pathfinder Cryosphere (Fig. caption). The neap and spring tide cycles correlate to some extent, with a lower and higher number of events per day, respectively. A possible causative mechanisms could be processes occurring within the Ross Ice Shelf in response to ocean gravity waves . In comparison, the ice temperature variations appear to be more weakly correlated to event occurrence. Even so, cooling through the months of available data may have a gradual influence on the seismic response of the WIS and the surrounding region
The systematic compilation of reference event and trace catalogues using the multi-STA/LTA algorithm newly enables the future application of a variety of seismic techniques to understand glacier dynamic and hydrological processes, as the event start time (signal arrival window) and other information are provided to the analyst. Further, the production of catch-all (near-comprehensive), reproducible event catalogues is a critical step towards standardized glacier monitoring as comparative studies between locations are enabled. The algorithm and workflow may enable a more complete analysis of diverse events from longer-duration networks. In this way, new seismic deployments with “in-ice” stations can draw on the experience gained in Greenland and Antarctica including large-scale seismic networks like the Greenland Ice Sheet Monitoring Network (GLISN) and the Polar Earth Observing Network (POLENET) . The multi-STA/LTA algorithm could be applied to these long-duration deployments to enable an expanded catalogue and optimize fill-in deployment planning. We intend the multi-STA/LTA algorithm to be used as an additional tool in the cryoseismology toolbox and endorse existing approaches such as template matching or array approaches if the intent of the cryoseismology study is to examine or locate a more specific event (glacier process) type
The event catalogue produced here includes a list of the seismic stations in the array which detected each event; the network time at detection; and the duration, amplitude, and energy. Complementing the reference catalogue, the trace (metadata) catalogue enables manual analysis of represented stations. The new catalogue will find utility in guiding conventional glacier seismology, taking the place of a lengthy manual reconnaissance of event types in most cases and also pointing to any temporal patterns in event and event-like noise occurrence.
Further, investigation of event types using a machine learning approach, which is being used increasingly , has been enabled and is the subject of a companion paper . One of the key outcomes of our current study is that the catalogue reveals the diverse character of events from the nearby ocean and ice shelf, in addition to the events within the ice stream. Of these, many events are ambient noise, but the fluctuating noise level means that they manifest as events. Therefore, using software such as that provided in , we aim to investigate the variety of event types using unsupervised learning based on the features computed from the seismic time series per event (equivalent to the feature sets described in ).
While the methods described have been developed and tested for a glacier environment, a similar workflow, including use of the multi-STA/LTA algorithm, has potential for application to other similar environments, such as volcanoes, landslides, and mining.
The semi-automated nature of the processing makes glacier monitoring using seismic methods increasingly feasible. Large outlet glaciers drain and buttress major ice sheets covering Greenland and Antarctica from the warming ocean. The contribution of these glaciers to sea level rise constitutes an increasing threat . The information from catch-all (near-comprehensive) event catalogues would enable the detection and further understanding of hidden processes such as brittle cracking and basal slip and provide improved temporal resolution of intermittent processes such as melt episodes and calving. In tandem with other information, such as that provided by satellite data, this provides a means to advance our understanding of glacier dynamics and the response of glaciers to forcings and change.
5 Conclusions
We present a novel seismic event detection algorithm (multi-STA/LTA) that successfully detects events that have low signal-to-noise ratios and/or are diverse with regard to maximum amplitude and event duration. Using a Monte Carlo simulation of test waveforms and subsequent parameter search, we demonstrated how the algorithm parameters can be optimized. The algorithm's utility in glacier seismology for generating a catch-all event catalogue has been illustrated through application to 14 stations from the Whillans Ice Stream 2010–2011 austral summer seismic deployment
We find that multi-STA/LTA is more adept than the conventional recursive approach at capturing diverse events that are characterized by a wide range of durations, amplitudes, and energies. In particular, the multi-STA/LTA approach detects events across a wide range of characteristic timescales, with durations varying by at least an order of magnitude, in contrast to implementations where the computation is based on a single set of such parameters.
The catalogue is appraised through assigning high confidence (approximately 35 %) and low confidence to events. We show that the low-confidence event distributions are similar to those of the high-confidence events in most cases. The significant proportion of low-confidence events for this catalogue highlights the challenges of glacial seismology in a noisy environment such as that of the Whillans Ice Stream and surrounding Ross Ice Shelf, where both local events and those external to the ice stream are potentially of interest. Many of the captured events are not immediately obvious to a visual check of the time series but show a shift in frequency content on closer analysis.
We demonstrate the utility of the catalogue through investigating aspects of event property distributions and links to possible signal generation mechanisms. We are able to begin analysis of the diverse event types, including stick-slip seismicity and teleseismic events, all produced from one heterogeneous catalogue. Events in the catalogue are visualized in terms of their duration, energy, and peak amplitudes. We find a partial association of seismicity with the tidal cycle, noting that a longer deployment would be preferable for such an analysis, and we consider 11 % of the catalogue to be stick-slip and teleseismic events (Sect. 3.2.1). We find a slight association with ice surface temperature, as an indicative example of one atmospheric observable. For both results, longer time series would be needed to support a statistical test for correlation; therefore we use the term “association” to indicate a qualitative assessment.
The new algorithm and workflow for systematic event detection have multi-faceted potential. For conventional seismological analysis, they will enable the reproducible generation of catch-all (near-comprehensive) event catalogues for cryoseismology and facilitate further manual analysis. They will also enable progress in the wider fields of environmental and geotechnical applications of seismology. Significantly a semi-automated approach to data analysis is enabled such that machine learning and other automated analyses may be used to enhance pattern detection and dataset exploration. Improving analysis capabilities, whether by conventional or semi-automated means, should prove to be a valuable step forward in analyzing the response of remote glaciers to global change.
Code and data availability
The codes that produce the discussed results are written in Python, open-access, and available for download (10.5281/zenodo.5499909); further information on the implementation and architecture of the software is accessible in . The codes used to produce figures and numerical results in this paper are available in Jupyter Notebook format (.ipynb), for generalized use (10.5281/zenodo.11062069; ). The
The supplement related to this article is available online at:
Author contributions
RBL developed software, carried out data analysis, and wrote the text; RJT developed software and provided guidance; AMR gave overall project direction and provided guidance; JPW advised on the dataset and provided guidance. All authors contributed to the refinement of the text.
Competing interests
The contact author has declared that none of the authors has any competing interests.
Disclaimer
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
Acknowledgements
This research was funded under Australian Research Council Discovery Project DP210100834, with additional support from DP190100418 and ARC's Special Research Initiatives: Antarctic Gateway Partnership, SR140300001, and the Australian Centre for Excellence in Antarctic Science, SR200100008. The software utilized for the analysis was based in the ObsPy Python project for seismic analysis . We also made use of for the ObsPy software library that implements the multi-STA/LTA algorithm and other analytical tools, as well as event catalogue production. Figure was generated by agrid Python module with assistance from Tobias Stål. The Tide Model Driver (TMD) toolbox (
Financial support
This research has been supported by the Australian Research Council (projects DP210100834, DP190100418) with additional support from ARC's Special Research Initiatives (Antarctic Gateway Partnership, SR140300001; Australian Centre for Excellence in Antarctic Science, SR200100008).
Review statement
This paper was edited by Evgeny A. Podolskiy and reviewed by two anonymous referees.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Cryoseismology is a powerful toolset for progressing the understanding of the structure and dynamics of glaciers and ice sheets. It can enable the detection of hidden processes such as brittle fracture, basal sliding, transient hydrological processes, and calving. Addressing the challenge of detecting signals from many different processes, we present a novel approach for the semi-automated detection of events and event-like noise, which is well-suited for use as Part 1 of a workflow where unsupervised machine learning will be used as Part 2
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
1 School of Natural Sciences (Physics), University of Tasmania, Private Bag 37, Hobart, TAS 7001, Australia
2 Department of Geological Sciences, Central Washington University, Ellensburg, WA, USA





