1 Introduction
Several magnitude scales have been defined to characterize the size of an earthquake. We can, however, divide magnitude scales in two groups, with one including magnitudes based on the amplitudes and periods of different seismic phases measured on band-limited signals (e.g., the body and surface wave magnitudes; ) and the other including magnitude scales related to estimations of macroscopic physical parameters of the earthquake source. The latter comprise the moment (; ) and the energy (; ) magnitudes, which are based on seismic moment and radiated seismic energy , respectively. These two magnitude scales are somewhat complementary because, although both represent an estimation of earthquake-related energy, they are determined by different parts of the source spectrum. The seismic moment extrapolated from the low-frequency end and represents the release of elastic energy stored in the Earth's crust or mantle being proportional to the integrated slip across the fault surface. The radiated seismic energy describes the fraction of the total energy released being radiated as seismic waves across all frequencies; i.e., it depends on the earthquake dynamics such as rupture velocity but also stress drop. estimates have been shown to play an important role when used in conjunction with to better characterize the tsunami and shaking potential of an earthquake .
is routinely computed from long period signals of broadband recordings, and it has become a robust and reliable source parameter for large and moderate earthquakes worldwide . On the other hand, the computation of is hindered by the necessity of integrating the velocity power spectra over a wide frequency range, while using signals in a limited bandwidth and taking into account propagation effects at high frequencies.
Aiming at validating and testing the procedures for operational purposes, we present a seismic catalog of computed, following the methodology proposed by and for the rapid assessment of energy magnitude (i.e., without requiring additional source information other than the hypocentral location). The approach is based on the analysis of spectra computed for teleseismic vertical component P waveforms. Teleseismic P waves are commonly used to compute for global earthquakes, as their energy loss during propagation can be more reliably modeled compared to S waves. We further present a detailed analysis of the residuals in a reduced high-quality catalog for events with with respect to the available in the GEOFON (GEOFOrschungsNetz) catalog and the values computed by IRIS.
2 Energy magnitude computation
2.1 Single-station estimation
We implement the methodology proposed by and to compute . Teleseismic vertical component P waveforms (BHZ channels, where B stands for broadband, H for high-gain seismometer, and Z for vertical component) are analyzed in the distance range from to and for earthquakes shallower than 80 km. Standard teleseismic range usually starts at , but we use to allow closer stations to be used for rapid response purposes. Shorter distances, however, are difficult to include for global earthquakes, as regional effects are not well accounted for with a 1-D model. Propagation effects are accounted for by frequency-dependent amplitude decay functions that are computed numerically for the ak135Q model in the frequency range 0.012–1 Hz.
An estimate of radiated seismic energy is obtained for each single station from the integral of the power spectra of the vertical component P waveform, which is corrected for propagation effects :
1 where , , and are the P-wave velocity, S-wave velocity, and the density at the source, respectively. is the frequency, and Hz and Hz are the lower and upper limits of the considered spectral bandwidth. is the P-wave velocity spectrum. is the median value of Green's function spectrum for displacement, computed from multiple combinations of focal mechanisms and varying strike, dip, and rake over a regular grid .
We used analysis windows starting 10 s before the P arrival and with lengths of 90 s for , 120 s for , and 180 s for . The energy magnitude estimate for a single-event station pair is in turn computed as , with given in joules . The procedure provides estimates at each recording station that can be averaged to minimize path-specific deviations not accounted for by the theoretical model (e.g., directivity and focal mechanism effects and regional variations in attenuation).
2.2Open-source tool for computing
The above procedure is implemented in the package me-compute . The program uses stream2segment to download events, station metadata, and waveforms from International Federation of Digital Seismograph Networks (FDSN)-compliant repositories in a structured query language (SQL) database.
In our application, the download is configured to fetch events from the GEOFON event web service, selecting events with computed in the time span 2011–2023. Waveforms are download from EIDA and IRIS (
The final output consists of the following files:
-
a tabular file in a hierarchical data format (HDF), where each row represents the metadata and measurements, specifically also the station energy magnitude estimate, for a single waveform;
-
a tabular file in a comma-separated value (CSV) format aggregating the results of the previous file, where each row represents a seismic event, reporting the event data end metadata and including the estimate for the event;
-
an HTML (HyperText Markup Language) file visualizing the selected content reported in the CSV file, where the information for each event can be visualized on an interactive map; and
-
one file per processed event in the Quake Markup Language (QuakeML) format, where we also included the value.
We use me-compute to compute for earthquakes since 2011 in the GEOFON catalog. Table 1 summarizes the steps followed to compile the disseminated catalog. The catalog reports the single-waveform energy magnitude estimated at station for earthquake . The energy magnitude for each considered event is then computed as the median of over the set of recording stations, without considering station static corrections. The starting dataset D0 consists of more than 1 000 000 waveforms (channels BHZ) generated by 6963 earthquakes recorded by 7765 stations belonging to 246 different networks. Only recordings with an average signal-to-noise ratio (SNR) for the amplitude greater than 3 within the frequency range of interest are included in D0. Several integrity and quality checks are applied to remove outliers and faulty signals. Dataset D1 is obtained by analyzing the median residual at the network level, discarding 14 networks characterized by median residuals outside the 2.5 and 97.5 percentile range (Fig. a). Dataset D2 is then generated by analyzing the station median values and excluding 382 stations with residuals outside the 2.5–97.5 percentile range (Fig. b). Most of the networks and stations removed will have instrumental problems or faulty metadata regarding instrument responses, although in some cases stations with very strong site effects might also be excluded.
Figure 1
Median network residuals (circles) for dataset D0 (a) and median station residuals for dataset D1 (b). Red lines correspond to the 2.5 and 97.5 percentiles of the distributions. For each network (a) and station (b), the horizontal bars correspond to the interval median (circle) median absolute deviation (MAD). A few values falling outside the range considered for the horizontal axis are not shown.
[Figure omitted. See PDF]
An anomaly score is computed to further refine the dataset by flagging anomalous amplitudes using the software sdaas . The software, developed from the work of , is based on a machine learning algorithm specifically designed for outlier detection (Isolation Forest) which computes an anomaly score in , representing the degree of belief of a waveform to be an outlier. The score can be used to assign robustness weights or to define thresholds above which data can be discarded. After inspecting the distribution of the anomaly scores, we set the threshold to 0.62 for and to 0.80 for .
The spatial distribution of events and stations generating dataset D3 are shown in Fig. a, b. The corresponding residuals are shown in Fig. against distance and . The largest positive residuals correspond mostly to earthquakes with recorded at distances , where the implemented methodology is expected to generate biased station estimates due to the limitations in the analyzed bandwidth and low signal-to-noise ratio . The overall residual distribution is unbiased and does not show trends of the mean value with distance and magnitude.
Figure 2
Panels (a) and (b) show event and station locations for dataset D3 (Table 1), respectively. Panels (c) and (d) show event and station locations for dataset D6 (Table 1), respectively.
[Figure omitted. See PDF]
Table 1Datasets considered in this study. Datasets in bold are available in .
Dataset | Records | Networks | Stations | Events | Selections applied sequentially |
---|---|---|---|---|---|
D0 | 1 126 465 | 246 | 7765 | 6963 | |
D1 | 1 072 381 | 232 | 7617 | 6944 | Network selection (2.5–97.5 %) |
D2 | 1 034 833 | 228 | 7235 | 6880 | Station selection (2.5–97.5 %) |
D3 | 1 031 396 | 228 | 7234 | 6349 | Anomaly score (, for , resp.) |
D4 | 754 025 | 228 | 7228 | 1731 | |
D5 | 751 567 | 227 | 7135 | 1731 | No. of records per station |
D6 | 750 903 | 227 | 7135 | 1671 | No. of records per event |
Dg | 153 | Comparison between D6 and real time |
Figure 3
Energy magnitude residuals versus distance (a) and moment magnitude (b) for dataset D3. Blue dots indicate residuals also included in D6. The horizontal red lines bound the 90 % confidence interval of the residual distribution. The error bars indicate the mean standard deviation of the residuals computed over different distance ( wide) and magnitude (1 m.u. wide) intervals.
[Figure omitted. See PDF]
Therefore, we further limit the dataset by only considering events with and at least 10 single-station measurements; we further exclude stations with fewer than 10 recordings in total. We added a column in the disseminated D3 dataset to flag lines corresponding to D6. It consists of waveforms for 1671 earthquakes and 7135 stations. The event and station locations of D6 are shown in Fig. c and d.
4 Quality assessment via residual analysisWe perform residual analysis to validate the D6 catalog. The relationship between and is analyzed by performing the following mixed-effects regression :
2 where is the single-waveform energy magnitude estimate at station for earthquake , is the moment magnitude of earthquake , intercept and slope parameters define the median model, and and are terms that capture station-specific and earthquake-specific adjustments, respectively. accounts for the leftover effects (i.e., residuals that are specific to a particular path/waveform). The random effects , , and are zero mean normal distributions by construction. In particular, (inter-station residual) can represent site effects or instrumental gain corrections, with most of the latter probably removed by the outlier filtering stages described above. The interevent residual is an event-specific deviation from the expected for a given from the linear regression term. Finally, can be thought of as a noise term for individual measurements, which can be either related to path-specific heterogeneity in attenuation with respect to the 1-D reference model or the influence of ambient noise on the actual measurement.
The interevent and inter-station term distributions are shown in Fig. , which are described by standard deviations of and m.u., respectively; the standard deviation of the is m.u. When combining the interevent variability with the intra-event variability equal to , we obtain the total standard deviation , which represents the variability in the single-station residuals with respect to the average computed per event. It is worth noting that the values can be used as station corrections to compute the energy magnitude of new events. In this case, the inter-station contribution to the total variability is removed, and the expected variability in the distribution is reduced to . Finally, the linear regression model is defined by the coefficients m.u. and . Considering the simplicity of the linear model in Eq. () and the large dataset analyzed, the uncertainty in the median model (sometimes referred to as ; ) is very low, increasing from 0.007 for to 0.039 for .
Figure 4
Cumulative distribution functions for event (a), station (b), and leftover distributions (circles) determined according to the mixed-effects regression in Eq. () applied to dataset D6. Dotted lines correspond to standard deviations (a), (b), and (c). Horizontal red lines in panels (a) and (b) are the standard errors in the random effects. In panel (c), values of exceeding in absolute value are not shown.
[Figure omitted. See PDF]
Figure 5
(a) Distribution of the site-specific residuals (see Eq. ) and (b) magnified view over a portion of Europe. Numbers in panel (b) indicate the following locations: (1) Netherlands; (2) Harz highlands, Germany; (3) Switzerland; (4) Po plain, Italy; (5) Pyrenees mountain range; (6) Apennine Mountains; (7) East Anatolian Fault region; and (8) Moesian Platform.
[Figure omitted. See PDF]
We show the spatial distribution of in Fig. . Since is computed considering spectral values below 1 Hz and using teleseismic recordings for distances above , capture station-specific effects connected to large-scale geological and tectonic crustal features, as exemplified in Fig. b for stations located in Europe. Positive (i.e., larger than the median) are observed for stations located in basins like in the Po plain, in the Moesian region, in the Netherlands, and in the East Anatolian Fault region. Negative values (i.e., lower than the median) are observed for stations located in mountain ranges such as the Pyrenees, the Alps, or in Harz Highlands and also tectonically highly active regions that are as cratonic as the East African Rift. The station terms can represent both site amplification, e.g., for stations in sedimentary basins, and anomalously high or low attenuation in the crust and or mantle surrounding the station. The station-specific residuals are disseminated along with the catalog to allow the computation of for future earthquakes, while taking into account static magnitude corrections to reduce variability.
The spatial distribution of the interevent variability, , is shown in Fig. for the smallest and largest values.
Figure 6
Extreme values for event specific residuals for the versus mixed-effects model of Eq. (). Only values below the 10th percentile (panels b and d) and above the 90th percentile (panels a and c) of the distribution are shown (the percentiles are about ). In panels (a) and (b), earthquakes with hypocentral depths shallower than 30 km are selected. In panels (c) and (d), events deeper than 30 km are considered. The distribution of versus depth for all events is shown in panel (e).
[Figure omitted. See PDF]
Considering depths shallower than 30 km (Fig. a and b), continental Asia, the Philippines, and Indonesia, as well as the Aleutian Islands, show positive values. California, Mexico, central America, and the Mid-Atlantic Ridge are characterized mostly by negative values. Considering deeper events (Fig. c and d), Japan and the Philippines have mostly positive values, while Mexico and central America have mostly negative values. The event-specific residuals are also disseminated, along with the catalog to increase the usefulness of the product from the event point of view, to allow the user to perform further refinements.
Figure 7
(a) Residual distribution of Eq. () for three different receiving areas, showing only . (b) As in panel (a) but considering only the European receiving area. (c) As in panel (a) but considering only the receiving area in California. (d) As in panel (a) but considering only the receiving area in Australia. Circles indicate the earthquake locations.
[Figure omitted. See PDF]
Figure 8
versus scaling. Gray circles are the station estimates, and filled circles represent event values calculated as medians of all station estimates for that event. Color indicates how many stations contributed to each estimate. The best-fit line in green is derived from the mixed-effects regression (Eq. ), considering interevent standard deviation (red lines). The faint black line shows equality for reference.
[Figure omitted. See PDF]
Path-specific residuals are shown in Fig. for three selected receiving areas in Europe, California, and Australia. Since in the partition of the residuals the leftover distribution represents the component not related to systematic station and event effects, they are mostly connected to lateral variability in attenuation in the Earth's interior with respect to the used global 1-D model and amplitude variation related to P-wave radiation patterns for different focal mechanisms.
Finally, the versus scaling defined by the linear regression coefficients and of Eq. () is shown in Fig. .
4.1 Catalog validation: comparison with IRISThe energy magnitude computed in this study is compared to the values disseminated by IRIS through the SPUD service . The methodology implemented by IRIS is described by and based on the analysis of and . Similar to our approach, the energy flux is computed from the P-wave group (P pP sP) in the frequency domain. The single-station estimations are corrected for frequency-dependent anelastic attenuation effects and converted back to the energy radiated by the source by applying corrections for geometrical spreading, depth, and mechanism-dependent effects for P waves and considering a theoretical partition of the energy between P and S waves. The energy is computed considering the frequency range 0.014–2 Hz (broadband for ) or 0.5-02 Hz (high frequency for ), analyzing stations in the distance range 25–80°. The duration of the time window used for the computation is based on analysis of the cumulative high-frequency energy (0.5–2 Hz) as a function of time. The crossover time used to compute the energy flux is identified at the intersection between the near-constant increasing rate for short times and the relative flat asymptotic behavior for long durations. The SPUD service disseminates both the high-frequency and broadband estimates.
Two regression models are calibrated against the broadband and high-frequency estimates disseminated by IRIS through SPUD. The best-fit models, shown in Fig. , are and , with the standard deviation of the residuals equal to 0.234 and 0.175, respectively. For the magnitude range from 6 to 8, this results in biases of 0.06 m.u. for vs. and varying from 0.17 to m.u. for vs. ; i.e., our estimates are nearly unbiased relative to and tend to slightly overestimate at the lower end of the applicability range.
Figure 9
Comparison with energy magnitude disseminated by IRIS considering (a) and (b) (717 common events). The red line shows the linear regression fit, and the dotted lines show 1 standard deviation of the residuals. The blue line shows line of equality for reference.
[Figure omitted. See PDF]
4.2 Catalog validation: role of style of faultingThe faulting style is classified into normal, reverse, and strike slip categories, based on the plunge of the P, T, and N axes , as extracted from the GEOFON moment tensor solutions. There is a normal fault (NF) if the plunge () , a strike slip (SS) if the plunge () , and a thrust fault (TF) if the plunge . In the other cases, the earthquake is labeled with OF (other faulting styles). To investigate the role of the style of faulting (SOF), we separate the event term into a fixed offset for each SOF class and a perturbation term for each event. If we indicate the classes of the SOF grouping factor (corresponding to NF, SS, TF, and OF) with and the class of event with , then the equation for the extended mixed-effects model is
3 where are the terms characterizing the average effects of the different SOF and are accounting for interevent differences within each SOF class (nested random effects). The standard deviations of the , , , and distributions are , , , and , respectively, generating a total standard deviation . The SOF terms are (NF), (SS), (TF), and (OF) (Fig. ). The largest difference is between SS and NF, a total of 0.206 m.u. There is a systematic impact of the SOF on the intercept of the model, but the associated variability is smaller compared to the interevent variability (in other words, SOF effects are statistically significant, but the distributions of interevent terms separated according to faulting style are strongly overlapping).
Figure 10
versus categorized with SOF.
[Figure omitted. See PDF]
The SOF effects might arise due to physical differences (on average) between the different faulting types, e.g., due to systematically different stress drops, differences in the maturity of faults or typical environments (intra-plate vs. interplate), where different faulting types occur most often, or they might be artifacts due to the fact that the method used here does not account for radiation pattern effects, and the teleseismic arrivals utilized here preferentially sample certain parts of the focal sphere. Therefore, we also investigate the role of the SOF in the relationship between derived in this study and the and values disseminated by IRIS. We recall that the methodology implemented by IRIS accounts for radiation pattern effects, which are related to the SOF. For this analysis, the regression model is the following: 4 where is either or . Results shown in Fig. confirm that the largest intercept difference is between normal and strike slip events, and the differences in terms of magnitude units (m.u.) are also similar between the other SOF. This suggests that a large part of the SOF term is influenced by radiation pattern effects, and interpretations of these differences in terms of geodynamics or hazard potential should be done very cautiously.
Figure 11
versus and , as categorized with SOF.
[Figure omitted. See PDF]
5 Real-time module for SeisComPThe module, derived from me-compute has been integrated to the SeisComP package () and has been part of the GEOFON routine real-time processing since December 2021. The first event for which calculations are available and disseminated via the usual GEOFON services is https://geofon.gfz-potsdam.de/eqinfo/event.php?id=gfz2021xxzt (last access: March 2024), which occurred on 7 December 2021 10:28:00.3 UTC ( 5.7 and 5.5). The scmert add-on is available at
Figure 12
Screenshot of the SeisComP Origin Locator View (scolv) interactive tool for the 7.7 Republic of Türkiye earthquake that occurred on 6 February 2023, 01:17 UTC along the East Anatolian Fault. The obtained network magnitude value of is 7.8. Stations used are color-coded according to magnitude residuals (top-left frame). Stations in gray are excluded from the network magnitude, do not match the distance range definition, or were trimmed while computing the average magnitude because they are within the % range. The top-right scatterplot shows residuals by distance (in red those that contributed to actual network magnitude). The topography shown in the map is generated using the ETOPO1 global relief model .
[Figure omitted. See PDF]
The add-on has been configured in GEOFON to trigger the calculation for each origin created by the automatic processing with magnitude and to compute station magnitudes for all stations according to the definition of in the distance 20–98°. The scmert procedure is applied with the settings used by the GEOFON Earthquake Information Service, using stations available in real time from the GEOFON Extended Virtual Network (
The energy magnitude values from both modules are compared in Fig. . We used scmert with the same settings as the GEOFON earthquake monitoring service, including station selection and trimming of the distributions. The values are in good agreement, and the best-fit model is , with a standard deviation of 0.118. The average difference computed for magnitudes between 6 and 8 is .
All values for that have been calculated since the start of the routine processing with scmert can be accessed via the fdsnws-event web service running at GEOFON by specifying as the magnitude type (i.e.,
Figure 13
Comparison between computed in real time by GEOFON, with the scmert add-on for SeisComP ( axis) and offline estimation using me-compute ( axis), considering 153 common events.
[Figure omitted. See PDF]
6 Code and data availabilityCode used for computing the energy magnitude is available from
-
offline computations in me-compute
https://doi.org/10.5880/GFZ.2.6.2023.008 ; and -
real-time computations in SeisComP with scmert
https://doi.org/10.5880/GFZ.2.4.2020.003 .
7 Conclusive remarks
We computed the energy magnitude for 6349 events in the moment magnitude catalog disseminated by GEOFON. When combined with , allows a better characterization of the tsunami and shaking potential of an earthquake. The procedure used to compile the dataset, which includes 1 031 396 values for each recording station, is described in detail. Residuals are evaluated using a mixed-effects regression, which partitions the overall residuals into event-specific and station-specific contributions. These random effects are included in the distributed catalog, enabling the computation of for future events and using inter-station residuals as station corrections to reduce the uncertainty in . They also enable the assessment of energy magnitude adjustments for specific regions or faulting mechanisms, using interevent residuals and locating propagation anomalies with respect to the global model used to compute Green's functions using the leftover residuals. The methodology employed for computing is suitable for the rapid assessment of . Therefore, it has been implemented as a module for SeisComP, allowing the automatic computation of in real time and keeping the catalog up to date.
Author contributions
DB, AS, and DDG conceptualized the study. RZ developed the Python code used to compile the disseminated catalog. AH developed the add-on for SeisComP. DB developed the quality checks. PE, AH, and AS organized the publication of by GEOFON via web services. All authors participated to the finalization of the article.
Competing interests
The contact author has declared that none of the authors has any competing interests.
Disclaimer
Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
Acknowledgements
We thank all network operators providing data via EIDA–ORFEUS and IRIS, as well as all real-time data providers contributing to the GEOFON virtual network. We thank Kirsten Elger for her help in preparing the dissemination of the products of this study. Finally, we appreciated the valuable feedback and suggestions provided by two anonymous reviewers and by the editor Andrea Rovida.
Financial support
he article processing charges for this open-access publication were covered by the Helmholtz Centre Potsdam – GFZ German Research Centre for Geosciences through the finacial support of the Horizon Europe project Geo-INQUIRE, funded by the European Commission (HORIZON-INFRA-2021-SERV-01, project no. 101058518).The article processing charges for this open-access publication were covered by the Helmholtz Centre Potsdam – GFZ German Research Centre for Geosciences.
Review statement
This paper was edited by Andrea Rovida and reviewed by two anonymous referees.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
We present a seismic catalog (
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details





1 German Research Centre for Geoscience GFZ, Potsdam, Germany
2 International Seismological Center ISC, Thatcham, UK
3 German Research Centre for Geoscience GFZ, Potsdam, Germany; Institute of Geociences, University of Potsdam, Potsdam, Germany
4 German Research Centre for Geoscience GFZ, Potsdam, Germany; Institute of Geological Sciences, Freie Universität Berlin, Berlin, Germany