1 Introduction
The oceans mitigate climate change by absorbing both atmospheric CO corresponding to a significant fraction of anthropogenic CO emissions (Friedlingstein et al., 2022; Gruber et al., 2019; Müller et al., 2023) and most of the excess heat in the Earth system caused by the enhanced greenhouse effect (Cheng et al., 2017; Cheng et al., 2020). The objective of GLODAP (Global Ocean Data Analysis Project,
Other chemical tracers are measured on many cruises included in GLODAP, such as dissolved organic carbon and nitrogen, as well as stable and radioactive isotope ratios. In many cases, a subset of these data is distributed as part of the GLODAP data product; however, such data have not been extensively quality-controlled or checked for measurement biases in this effort. For some of these variables, better sources of data exist; for example, the product by Jenkins et al. (2019) for helium isotope and tritium data or the dissolved organic matter product by Hansell et al. (2021). GLODAP also includes some common derived variables to facilitate interpretation, such as potential density anomalies and apparent oxygen utilization (AOU). A full list of variables included in the data product is provided in Table 1.
Table 1
Variables in the GLODAPv2.2023 comma-separated (csv) product files, their units, short and flag names, and corresponding names in the individual cruise exchange files. In the MATLAB product files that are also supplied, the term “G2” has been added to every variable name (e.g., G2cruise).
Variable | Units | Product file | WOCE flag | Second QC flag | WHP-exchange |
---|---|---|---|---|---|
name | name | name | name | ||
EXPOCODE | expocode | ||||
Digital objectidentifier | doi | ||||
Assigned sequentialcruise number | cruise | ||||
Basin identifier | region | ||||
Station | station | STNNBR | |||
Cast | cast | CASTNO | |||
Year | year | DATE | |||
Month | month | DATE | |||
Day | day | DATE | |||
Hour | hour | TIME | |||
Minute | minute | TIME | |||
Latitude | latitude | LATITUDE | |||
Longitude | longitude | LONGITUDE | |||
Bottom depth | m | bottomdepth | |||
Pressure of the deepest sample | dbar | maxsampdepth | DEPTH | ||
Niskin bottle number | bottle | BTLNBR | |||
Sampling pressure | dbar | pressure | CTDPRS | ||
Sampling depth | m | depth | |||
Temperature | °C | temperature | CTDTMP | ||
Potential temperature | °C | theta | |||
Salinity | salinity | salinityf | salinityqc | CTDSAL/SALNTY | |
Potential densityanomaly | kg m | sigma0 | (salinityf) | ||
Potential densityanomaly, ref 1000 dbar | kg m | sigma1 | (salinityf) | ||
Potential densityanomaly, ref 2000 dbar | kg m | sigma2 | (salinityf) | ||
Potential densityanomaly, ref 3000 dbar | kg m | sigma3 | (salinityf) | ||
Potential densityanomaly, ref 4000 dbar | kg m | sigma4 | (salinityf) | ||
Neutral densityanomaly | kg m | gamma | (salinityf) | ||
Oxygen | mol kg | oxygen | oxygenf | oxygenqc | CTDOXY/OXYGEN |
Continued.
Variable | Units | Product file | WOCE flag | Second QC flag | WHP-exchange |
---|---|---|---|---|---|
name | name | name | name | ||
Apparent oxygenutilization | mol kg | aou | aouf | ||
Nitrate | mol kg | nitrate | nitratef | nitrateqc | NITRAT |
Nitrite | mol kg | nitrite | nitritef | NITRIT | |
Silicate | mol kg | silicate | silicatef | silicateqc | SILCAT |
Phosphate | mol kg | phosphate | phosphatef | phosphateqc | PHSPHT |
TCO | mol kg | tco2 | tco2f | tco2qc | TCARBON |
TAlk | mol kg | talk | talkf | talkqc | ALKALI |
pH on total scale, 25 °C and 0 dbar of pressure | phts25p0 | phts25p0f | phtsqc | PH_TOT | |
pH on total scale, in situ temperature andpressure | phtsinsitutp | phtsinsitutpf | phtsqc | ||
CO at 20 °C and0 dbar of pressure | atm | fco2 | fco2f | FCO2/PCO2 | |
CO temperature | °C | co2temp | (fco2f) | FCO2_TMP/PCO2_TMP | |
CFC-11 | pmol kg | cfc11 | cfc11f | cfc11qc | CFC-11 |
pCFC-11 | ppt | pcfc11 | (cfc11f) | ||
CFC-12 | pmol kg | cfc12 | cfc12f | cfc12qc | CFC-12 |
pCFC-12 | ppt | pcfc12 | (cfc12f) | ||
CFC-113 | pmol kg | cfc113 | cfc113f | cfc113qc | CFC-113 |
pCFC-113 | ppt | pcfc113 | (cfc113f) | ||
CCl | pmol kg | ccl4 | ccl4f | ccl4qc | CCL4 |
pCCl | ppt | pccl4 | (ccl4f) | ||
SF | fmol kg | sf6 | sf6f | sf6qc | SF6 |
pSF6 | ppt | psf6 | (sf6f) | ||
C | ‰ | c13 | c13f | c13qc | DELC13 |
C | ‰ | c14 | c14f | DELC14 | |
C counting error | ‰ | c14err | C14ERR | ||
H | TU | h3 | h3f | TRITIUM | |
H counting error | TU | h3err | TRITER | ||
He | % | he3 | he3f | DELHE3 | |
He counting error | % | he3err | DELHER | ||
He | nmol kg | he | hef | HELIUM | |
He counting error | nmol kg | heerr | HELIER | ||
Ne | nmol kg | neon | neonf | NEON | |
Ne counting error | nmol kg | neonerr | NEONER |
Continued.
Variable | Units | Product file | WOCE flag | Second QC flag | WHP-exchange |
---|---|---|---|---|---|
name | name | name | name | ||
O | ‰ | o18 | o18f | DELO18 | |
Total organic carbon | mol L | toc | tocf | TOC | |
Dissolved organiccarbon | mol L | doc | docf | DOC | |
Dissolved organicnitrogen | mol L | don | donf | DON | |
Dissolved total nitrogen | mol L | tdn | tdnf | TDN | |
Chlorophyll | g kg | chla | chlaf | CHLORA |
The only derived variable assigned a separate WOCE flag is AOU, as it depends strongly on both temperature and oxygen (and less strongly on salinity). For the other derived variables, the applicable WOCE flag is given in parentheses. Secondary QC flags indicate whether data have been subjected to full secondary QC (1) or not (0), as described in Sect. 3. 1 is the Atlantic Ocean, 4 is the Arctic Mediterranean Sea (i.e., the Arctic Ocean plus the Nordic Seas), 8 is the Pacific Ocean, and 16 is the Indian Ocean. Included for clarity is 20 °C for all occurrences. Note that ppt stands for parts per trillion. Units have not been checked; some values are in micromoles per kilogram (for TOC, DOC, DON, and TDN) or microgram per liter (for chl ) are probable.
The oceanographic community largely adheres to principles and practices for ensuring open access to research data, such as the FAIR (Findable, Accessible, Interoperable, Reusable) initiative (Wilkinson et al., 2016), but the plethora of file formats and different levels of documentation, combined with the need to retrieve data on a per cruise basis from different access points, limits the realization of their full scientific potential. In addition, the manual data retrieval is time-consuming and prone to data-handling errors (Tanhua et al., 2021). For biogeochemical data, there is the added complexity of different levels of standardization and calibration, and even different units and scales are used for the same variable, such that the comparability between datasets is often poor. Standard operating procedures have been developed for some variables (Dickson et al., 2007; Hood et al., 2010; Becker et al., 2020), certified reference materials (CRMs) exist for seawater TCO and TAlk measurements (Dickson et al., 2003), and reference materials for nutrients in seawater (RMNS; certified based on International Organization for Standardization Guide 34; Aoyama et al., 2012; Ota et al., 2010) are also available. Despite all this, biases in data still exist. These can arise from poor sampling and preservation practices, calibration procedures, instrument design and calibration, and inaccurate calculations. The use of CRMs does not by itself ensure accurate measurements of seawater CO chemistry (Bockmon and Dickson, 2015), and the RMNS have only become available recently and are not universally used. For salinity and oxygen, the lack of calibration of the data from conductivity–temperature–depth (CTD)-profiler-mounted sensors is an additional and widespread problem, particularly for oxygen (Olsen et al., 2016). For halogenated transient tracers, uncertainties in standard gas composition, extracted water volume, and purge efficiency typically provide the largest sources of uncertainty. In addition to bias, occasional outliers occur. In rare cases poor precision – many multiples worse than that expected with current measurement techniques – can render a set of data to be of limited use. GLODAP deals with these issues by presenting the data in a uniform format, including any metadata either publicly available or submitted by the data originator, and by subjecting the data to rigorous primary and secondary quality control assessments, focusing on precision and consistency, respectively. The secondary quality control focuses on deep data in which natural variability is minimal. Adjustments are applied to the data to minimize cases of bias that could be confidently established relative to the measurement precision for the variables and cruises considered. Key metadata are provided in the header of each data file, and original unadjusted data along with full cruise reports submitted by the data providers (where available) are accessible through the GLODAPv2 cruise summary table hosted by the Ocean Carbon and Acidification Data System (OCADS) at the National Oceanographic and Atmospheric Administration (NOAA) National Centers for Environmental Information (NCEI) (
This most recent GLODAPv2.2023 data product builds on earlier synthesis efforts for biogeochemical data obtained from research cruises, namely GLODAPv1.1 (Key et al., 2004; Sabine et al., 2005), Carbon dioxide In the Atlantic Ocean (CARINA) (Key et al., 2010), PACIFic ocean Interior CArbon (PACIFICA) (Suzuki et al., 2013), and notably GLODAPv2 (Olsen et al., 2016). GLODAPv1.1 combined data from 115 cruises with biogeochemical measurements from the global ocean. The vast majority of these were the sections covered during the World Ocean Circulation Experiment and the Joint Global Ocean Flux Study (WOCE/JGOFS) in the 1990s, but data from important “historical” cruises were also included, such as from the Geochemical Ocean Sections Study (GEOSECS), Transient Traces in the Ocean (TTO), and South Atlantic Ventilation Experiment (SAVE). GLODAPv2, which forms the basis for the update presented here, was released in 2016 with data from 724 scientific cruises, including those from GLODAPv1.1, CARINA, and PACIFICA, as well as data from 168 additional cruises. GLODAPv2 not only combined all previous efforts, but it also created ocean-wide consistency across all cruise data through an inversion analysis. A particularly important source of additional data was the cruises executed within the framework of the “repeat hydrography” program (Talley et al., 2016) instigated in the early 2000s as part of the Climate and Ocean – Variability, Predictability, and Change (CLIVAR) program and which has been organized as the Global Ocean Ship-based Hydrographic Investigations Program (GO-SHIP) since 2007 (Sloyan et al., 2019). GLODAPv2 is updated regularly using the “living data process” of Earth System Science Data to document significant additions and modifications to the data product.
There are two types of GLODAP updates: full and intermediate. Full updates involve a reanalysis, notably crossover and inversion, of the entire dataset (both historical and new cruises) in which all data points are subject to potential adjustment. This was carried out for the creation of GLODAPv2. For intermediate updates, recently available data are added following quality control procedures to ensure their consistency with the cruises included in the latest GLODAP release. Except for obvious outliers and similar types of errors (Sect. 3.3.1), the data from previous releases are not changed or adjusted during intermediate updates. Note that the GLODAP mapped climatologies (Lauvset et al., 2016) are not updated for these intermediate products. A naming convention has been introduced to distinguish intermediate from full product updates. For the latter, the version number will change, while for the former, the year of release is appended. The exact version number and release year (if appended) of the product used should always be reported in studies, rather than making a generic reference to GLODAP.
Creating and interpreting inversions, as well as other checks of the entire dataset needed for full updates, are too demanding in terms of time and resources to be performed every year or every 2 years. The aim is to conduct a full analysis (i.e., including an inversion) again after the third GO-SHIP survey has been completed. This completion is currently scheduled for 2024. Work on GLODAPv3 commences in fall 2023, and we anticipate that GLODAPv3 will become available at the end of 2025. In the interim, the fifth intermediate update is presented here, which adds data from 23 cruises to the last update, GLODAPv2.2022 (Lauvset et al., 2022). Given the anticipated schedule towards producing GLODAPv3, this will be the final intermediate update of GLODAPv2.
2 Key features of the updateGLODAPv2.2023 contains data from 1108 cruises covering the global ocean from 1972 to 2021, compared to 1085 for the period 1972–2021 for the previous GLODAPv2.2022 (Lauvset et al., 2022). Information about the 23 cruises added to this version is provided in Table A1 in the Appendix. Cruise sampling locations are shown alongside those of GLODAPv2.2023 in Fig. 1, while the coverage in time is shown in Fig. 2. Not all cruises have data for all the abovementioned 13 core variables. For example, cruises with only seawater CO chemistry or transient tracer data are still included, even without accompanying nutrient data, due to their value with respect to the computation of carbon inventories. In a few cases, cruises without any of these properties are included because they do contain data for other carbon-related tracers, such as carbon isotopes, with the intention of ensuring their wider availability. The added cruises are from 2002 to 2021, with the majority being more recent than 2018. This update consists of the following:
-
five cruises in the northeastern North Atlantic from Marine Institute, Ireland;
-
three time series datasets in the Arctic Ocean;
-
a dataset from the Greenland–Scotland ridge, including multiple stations in the Irminger and Iceland seas over the years 2002–2006, and the Iceland standard section at 64°20 N;
-
four Japanese cruises in the northwestern Pacific Ocean;
-
one Chinese National Arctic Research Expedition (CHINARE) cruise from the Arctic (Chukchi Sea) in 2008;
-
an update of the Irminger Sea and Iceland Sea time series, with new data from 2014–2019;
-
one Line P cruise (2015);
-
three coastal cruises out of Iquique from the Coastal Ecosystem and Global Environmental Change lab (ECCA Lab) in Chile;
-
one Discovery cruise in the Southern Ocean;
-
A20 and A22 occupations from 2021; and
-
the 2018 Observatoire de la variabilité interannuelle à décennale en Atlantique Nord (OVIDE) cruise.
All new cruises in GLODAPv2.2023 include seawater CO chemistry data, and eight of these also include data on halogenated transient tracers.
Figure 1
Location of stations in (a) GLODAPv2.2022 and for (b) the new data added in this update.
[Figure omitted. See PDF]
Figure 2
Number of cruises per year in GLODAPv2, GLODAPv2.2022, and GLODAPv2.2023.
[Figure omitted. See PDF]
All new cruises were subjected to primary (Sect. 3.1) and secondary (Sect. 3.2) quality control (QC). These procedures are very similar to those used for GLODAPv2.2022 and previous versions, aiming to ensure the consistency of the data from the 23 new cruises with the previous release of the GLODAP data product (in this case, the GLODAPv2.2022 adjusted data product). As for GLODAPv2.2022, we also apply secondary QC routines to SF data.
For GLODAPv2.2021, we added a basin identifier to the product files, where 1 is the Atlantic Ocean, 4 the Arctic Mediterranean Sea (i.e., the Arctic Ocean plus the Nordic Seas), 8 the Pacific Ocean, and 16 the Indian Ocean. These regions are abbreviated to AO, AMS, PO, and IO, respectively, in the adjustment table. Data in the Mediterranean Sea, Caribbean Sea, and Gulf of Mexico are classified as belonging to the Atlantic Ocean (1). These basin identifiers were added to the product files, for the first time in GLODAPv2.2022, to make it easier for users to identify which ocean basin an individual cruise belongs to without having to use one of the four regional files. This setting remains in GLODAPv2.2023. Note that there is no overlap between the regional files or for our basin identifiers, and cruises in the Southern Ocean are placed in the basin where most of the data were collected. As in the previous version, GLODAPv2.2022, we include the DOI for each cruise in all product files with the aim of easing user access to the original data and metadata, as well as improving the visibility of data providers.
3 Methods3.1 Data assembly and primary quality control
Data from the 23 new cruises were submitted directly to us or retrieved from data centers, typically OCADS (
The individual cruise data files were converted to the WHP-exchange format, which is a comma-delimited ASCII (American Standard Code for Information Interchange) format for data from hydrographic cruises with different and specific versions for CTD and bottle data. GLODAP only includes WHP-exchange in the bottle format with data and CTD data at bottle trip depths. An overview of the significant points is given below, with full details provided at
Exchange file preparation required unit conversion in some cases, most frequently from concentrations expressed as milliliters per liter (mL L; oxygen) or micromoles per liter (mol L; nutrients) to substance contents expressed as micromoles per kilogram of seawater (mol kg). Procedures, as described in Jiang et al. (2022), were used for these conversions. The default conversion procedure for nutrients was to use seawater density at reported salinity, an assumed measurement temperature of 22 °C, and pressure of 1 atm. For oxygen, the factor 44.66 was used for the “milliliters of oxygen” to “micromoles of oxygen” conversion, while the density required for the “per liter” to “per kilogram” conversion was calculated from the reported salinity and draw temperatures whenever possible. However, potential density was used instead when draw temperature was not reported. The potential errors introduced by any of these procedures are insignificant. Missing numbers are indicated by .
Each data column (except temperature and pressure, which are assumed to be “good” if they exist) has an associated column of data flags (Joyce and Corry, 1994). For the original data exchange files, these flags conform to the WOCE definitions for water samples and are listed in Table 2. For the merged and adjusted product files these flags are simplified; questionable (WOCE flag 3) and bad (WOCE flag 4) data are removed, and their flags are set to 9. The same procedure is applied to data flagged 8 (very few such data exist); 1 (data not received) and 5 (data not reported) are also set to 9, while flags of 6 (mean of replicate measurements) and 7 (manual chromatographic peak measurement) are set to 2 if the data appear good. Also, in the merged product files, a flag of 0 is used to indicate a value that could be measured but is approximated. For salinity, oxygen, phosphate, nitrate, and silicate, the approximation is conducted using vertical interpolation; for seawater CO chemistry variables (TCO, TAlk, pH, and CO), the approximation is conducted using the calculation from two measured CO chemistry variables (Sect. 3.2.2). Importantly, the interpolation of CO chemistry variables is never performed, and thus a flag value of 0 has a unique interpretation.
Table 2
WOCE flags in GLODAPv2.2023 exchange format original data files (briefly; for full details, see Swift, 2010) and the simplified scheme used in the merged product files.
WOCE flag value | Interpretation | |
---|---|---|
Original data exchange files | Merged product files | |
0 | Flag not used | Interpolated or calculated value |
1 | Data not received | Flag not used |
2 | Acceptable | Acceptable |
3 | Questionable | Flag not used |
4 | Bad | Flag not used |
5 | Value not reported | Flag not used |
6 | Average of replicate | Flag not used |
7 | Manual chromatographic peak measurement | Flag not used |
8 | Irregular digital peak measurement | Flag not used |
9 | Sample not drawn | No data |
Flag set to 9 in product files. Data are not included in the GLODAPv2.2023 product files, and their flags are set to 9. Data are included, but the flag is set to 2.
If no WOCE flags were submitted with the data, then they were assigned by the GLODAP reference group (
The aim of the secondary QC was to identify and correct any significant biases in the data from the 23 new cruises relative to GLODAPv2.2022 while retaining any signal due to temporal changes. To this end, secondary QC in the form of consistency analyses was conducted to identify offsets in the data. All identified offsets were scrutinized by the GLODAP reference group through a series of teleconferences during May 2023 to decide the adjustments to be applied to reduce the apparent offset (if any). To guide this process, a set of initial minimum adjustment limits was prescribed (Table 3). These represent the minimum bias that can be confidently established relative to the measurement precision for the variables and cruises considered and are the same as those used for GLODAPv2.2022. In addition to the average magnitude of the offsets, factors such as the precision of the offsets, persistence towards the various cruises used in the comparison, regional dynamics, and the occurrence of time trends or other variations were considered. Thus, not all offsets larger than the initial minimum limits have been adjusted. A guiding principle for these considerations was to not apply an adjustment when in doubt. Conversely, in some cases, when data and offsets were very precise and the cruise had been conducted in a region where variability is expected to be small, adjustments lower than the minimum limits were applied. Any adjustment was applied uniformly to all values for a variable and cruise; i.e., an underlying assumption is that cruises suffer from either no or a single and constant measurement bias. Adjustments for salinity, TCO, TAlk, and pH are always additive, while adjustments for oxygen, nutrients, and the halogenated transient tracers are always multiplicative. Except where explicitly noted (Sect. 3.3.1 and Table A2 in the Appendix), adjustments were not modified for data previously included in GLODAPv2.2022.
Table 3
Initial minimum adjustment limits. These limits represent the minimum bias that can be confidently established relative to the measurement precision for the variables and cruises considered. Note that these limits are not uncertainties but rather a priori estimates of global inter-cruise consistency in the data product.
Variable | Minimum adjustment |
---|---|
Salinity | 0.005 |
Oxygen | 1 % |
Nutrients | 2 % |
TCO | 4 mol kg |
TAlk | 4 mol kg |
pH | 0.01 |
CFCs | 5 % |
Crossover comparisons were the primary source of information used to identify offsets for salinity, oxygen, nutrients, TCO, TAlk, and pH (Sect. 3.2.2). As in GLODAPv2.2022, GLODAPv2.2021 and GLODAPv2.2020, but in contrast to GLODAPv2 and GLODAPv2.2019, the evaluation of the internal consistency of the seawater CO chemistry variables was not used for the evaluation of pH (Sect. 3.2.3). As in the three previous updates (2020, 2021, and 2022) we made extensive use of two predictions from two empirical algorithms – CArbonate system And Nutrients concentration from hYdrological properties and Oxygen using a Neural network, version B (CANYON-B), and CONsisTency EstimatioN and amounT (CONTENT) (Bittig et al., 2018) – for the evaluation of offsets in nutrients and seawater CO chemistry data (Sect. 3.2.4). For previous versions, we have also used multiple linear regression analyses and deep water averages, broadly following Jutterström et al. (2010), for additional information for the secondary QC of salinity, oxygen, nutrients, TCO, and TAlk data. As in GLODAPv2.2022, we did not have to rely on the results of the multiple linear regression (MLR) analyses to make decisions about adjustments, and in general, we are increasingly moving towards only using CANYON-B and CONTENT estimates (Sect. 3.2.4), as additional information when the crossover analysis conducted is insufficient.
For the halogenated transient tracers, comparisons of surface saturation levels and the relationships among the tracers were used to assess the data consistency (Sect. 3.2.5). For salinity and oxygen, CTD and bottle values were merged into a “hybrid” variable prior to the consistency analyses (Sect. 3.2.1).
3.2.1 Merging of sensor and bottle dataSalinity and oxygen data can be obtained by analysis of water samples (bottle data) and/or directly from the CTD sensor pack. These two measurement types are merged and presented as a single variable in the product. The merging was conducted prior to the consistency checks, ensuring their internal calibration in the product. The merging procedures were only applied to the bottle data files, which commonly include values recorded by the CTD at the pressures where the water samples are collected. Whenever both CTD and bottle data were present in a data file, the merging step considered the deviation between the two and calibrated the CTD values if required and possible. Altogether, seven scenarios (Table 4) are possible for each of the CTD conductivity and oxygen (O) sensor properties individually, in which the fourth never occurred during our analyses but is included to maintain consistency with GLODAPv2. For salinity, 48 % of the 23 new cruises included both CTD and bottle data in the original cruise files, and for oxygen, 57 % of the new cruises where oxygen was included (14) had both CTD and bottle data (Table 6). For all these cruises, the two data types were found to be consistent for both oxygen and salinity. These new data have a higher proportion of cruises with both bottle and CTD measurements than GLODAPv2.2022 (39 % for both salinity and oxygen). For oxygen, the remaining cruises have only CTD data, while for salinity, 22 % have only CTD data and 30 % have only bottle data. Having both CTD and bottle values in the data files is highly preferred, as the information is valuable for quality control (bottle misfires, leaking Niskin bottles, and oxygen sensor drift are among the issues that can be revealed). The extent to which the bottle data (i.e., OXYGEN in the individual cruise exchange files) are mislabeled CTD data (i.e., should be CTDOXY) is uncertain. Regardless, all CTD and bottle data for salinity and oxygen were consistent and did not need any further calibration.
Table 4
Summary of salinity and oxygen calibration needs and actions. The number of cruises with each of the scenarios identified is also given.
Case | Description | Salinity | Oxygen |
---|---|---|---|
1 | No data are available: no action needed. | 0 | 9 |
2 | No bottle values are available: use CTD values. | 11 | 4 |
3 | No CTD values are available: use bottle values. | 1 | 2 |
4 | Too few data of both types are available for comparison, and % of the records have bottle values: use bottle values. | 0 | 0 |
5 | The CTD values do not deviate significantly from bottle values: replace missing bottle values with CTD values. | 11 | 8 |
6 | The CTD values deviate significantly from bottle values: calibrate CTD values using linear fit and replace missing bottle values with calibrated CTD values. | 0 | 0 |
7 | The CTD values deviate significantly from bottle values, and no good linear fit can be obtained for the cruise: use bottle values and discard CTD values. | 0 | 0 |
The crossover analyses were conducted with the MATLAB toolbox prepared by Lauvset and Tanhua (2015) and with GLODAPv2.2022 as the reference data product. The toolbox implements the “running-cluster” crossover analysis first described by Tanhua et al. (2010). This analysis compares data from two cruises on a station-by-station basis and calculates a weighted mean offset between the two and its weighted standard deviation. The weighting is based on the scatter in the data, such that data that have less scatter have a larger influence on the comparison than data with more scatter. Whether the scatter reflects the actual variability or data precision is irrelevant in this context, as increased scatter nevertheless decreases the confidence in the comparison. Stations are compared when they are within 2 arcdeg distance ( 200 km) of each other. To minimize the effects of natural variability, only deep data are used. Either the 1500 or 2000 dbar pressure surface was used as the upper bound, depending on the amount of available data, their variation at different depths, and the region in question. Which one to use was determined on a case-by-case basis by comparing crossovers with the two depth limits and using the one that provided the clearest and most robust information. In regions where deep mixing or convection occurs, such as the Nordic, Irminger, and Labrador seas, the upper bound was always placed at 2000 dbar; while winter mixing in the first two regions is normally not deeper than this (Brakstad et al., 2019; Fröb et al., 2016), convection beyond this limit has occasionally been observed in the Labrador Sea (Yashayaev and Loder, 2017). However, using an upper depth limit deeper than 2000 dbar will quickly give too few data for robust analysis. In addition, even below the deepest winter mixed layers, properties do change over the time periods considered (e.g., Falck and Olsen, 2010), so this limit does not guarantee steady conditions. In the Southern Ocean, deep convection beyond 2000 dbar seldom occurs, with an exception being the processes accompanying the formation of the Weddell Polynya in the 1970s (Gordon, 1978). Deep and bottom water formation usually occur along the Antarctic coasts, where relatively thin nascent dense water plumes flow down the continental slope. We avoid such cases, which are easily recognizable. To avoid removing persistent temporal trends, all crossover results are also evaluated as a function of time (see below).
As an example of crossover analysis, the crossover for alkalinity measured on the two cruises 35TH20180611, which is new to this version, and 45CE20170427, which was included in GLODAPv2.2019, is shown in Fig. 3. For alkalinity, the offset is determined as the difference, in accordance with the procedures followed for GLODAPv2. The mean weighted offset shows there is no difference between these cruises.
Figure 3
Example crossover figure for alkalinity for cruises 35TH20180611 (blue) and 45CE20170427 (red), as was generated during the crossover analysis. Panel (a) shows all station positions for the two cruises, and panel (b) shows the specific stations used for the crossover analysis. Panel (d) shows the data of alkalinity (mol kg) below the upper depth limit (in this case 2000 dbar) versus potential density anomaly referenced to 4000 dbar as points and the interpolated profiles as lines. Non-interpolated data either did not meet minimum depth separation requirements (Table 4 in Key et al., 2010) or are the deepest sampling depth. The interpolation does not extrapolate. Panel (e) shows the mean alkalinity difference profile (black; dots) with its standard deviation, as well as also the weighted mean offset (straight red lines) and weighted standard deviation. Summary statistics are provided in panel (c).
[Figure omitted. See PDF]
For each of the 23 new cruises, such a crossover comparison was conducted against all possible cruises in GLODAPv2.2021, i.e., all cruises that had stations closer than 2 arcdeg distance to any station for the cruise in question. The summary figure for alkalinity on 35TH20180611 is shown in Fig. 4. The alkalinity data measured on this cruise are on average 1.82 0.40 mol kg higher when compared to the data measured on nearby cruises included in GLODAPv2.2022. This is smaller than the initial minimum adjustment limit for alkalinity of 4 mol kg (Table 3) and as such does not lead to an adjustment of the data in the merged data product. Detailed analyses are performed for every new cruise, and every core variable, to detect potential temporal trends in the offsets. Figure 4 further illustrates that the alkalinity offset is nearly zero – also when comparing only with cruises since 2010 – and no thus trend is apparent. All other variables also show very high consistency; thus, no adjustment is given to any other variable on cruise 35TH20180611 in GLODAPv2.2022. This is supported by the CANYON-B and CONTENT results (Sect. 3.2.4). Note that adjustments, when applied, are typically round numbers (e.g., not for TCO and 0.005 not 0.0047 for pH) to avoid communicating that the ideal adjustments are accurately known.
Figure 4
Example summary figure for alkalinity crossovers for 35TH20180611 versus the cruises in GLODAPv2.2022 (with cruise EXPOCODE listed on the axis sorted according to the year the cruise was conducted). The black dots and vertical error bars show the weighted mean offset and standard deviation for each crossover (as a ratio). The weighted mean and standard deviation of all these offsets are shown in the red lines and are 1.82 0.40. The dashed black lines are the reference line for a mol kg offset.
[Figure omitted. See PDF]
3.2.3 pH scale conversion and quality controlAltogether 12 of the 23 new cruises included measured spectrophotometric pH data, and none required an adjustment (Sect. 4). All cruises reported pH data on the total scale and at 25 °C. For the single cruise reporting pH on the seawater scale, the data were converted following established routines (Olsen et al., 2020). For details on scale and temperature conversions in previous versions of GLODAPv2, we refer to Olsen et al. (2020). In contrast to quality control of pH data in GLODAPv2 (Olsen et al., 2016), the evaluation of the internal consistency of CO system variables has not been used for the secondary quality control of the pH data in the GLODAPv2 updates of 2020 and onwards. For the 12 new cruises with pH in GLODAPv2.2023, only crossover analysis was used and was supplemented by CONTENT and CANYON-B comparisons (Sect. 3.2.4). Recent literature has demonstrated that internal consistency evaluation procedures are subject to errors owing to an incomplete understanding of the thermodynamic constants, major ion contents, measurement biases, and potential contribution of organic compounds or other unknown protolytes to alkalinity. These complications lead to pH-dependent offsets in calculated pH compared with cruise spectrophotometric pH measurements (Álvarez et al., 2020; Carter et al., 2018; Fong and Dickson, 2019; Takeshita et al., 2020). The pH-dependent offsets may be interpreted as biases and generate false corrections (Álvarez et al., 2020; García-Ibáñez et al., 2022). The offsets are particularly strong at pH levels below 7.7, where calculated and measured pH values are different by between 0.01 and 0.02 on average. For the North Pacific, this is a problem, as pH values below 7.7 can occur at the depths used during the QC ( dbar for this region; Olsen et al., 2016). Since any correction, which may be an artifact, would be applied to the full profiles, we use a minimum adjustment of 0.02 for the North Pacific pH data in the merged product files. Elsewhere, the inconsistencies that may have arisen are smaller, since deep pH is typically higher than 7.7 (Lauvset et al., 2020), and at such levels, the difference between calculated and measured pH is less than 0.01 on average (Álvarez et al., 2020; Carter et al., 2018). Outside the North Pacific, we believe that the pH data are consistent to within 0.01. Avoiding CO chemistry internal consistency considerations for these intermediate products helps to reduce the problem, but since the reference dataset (as also used for the generation of the CANYON-B and CONTENT algorithms) may have these issues, a future full re-evaluation, envisioned for GLODAPv3, is needed to address the problem completely.
3.2.4 CANYON-B and CONTENT analyses
CANYON-B and CONTENT (Bittig et al., 2018) were used to support decisions regarding the application of adjustments (or not). CANYON-B is a neural network for estimating nutrients and seawater CO chemistry variables from temperature, salinity, and oxygen content. CONTENT additionally considers the consistency among the estimated CO chemistry variables to further refine them. These approaches were developed using the data included in the GLODAPv2 data product (i.e., the 2016 version without any more recent updates). Their advantage compared to crossover analyses for evaluating consistency among cruise data is that the effects of water mass changes on ocean properties are represented in the nonlinear relationships in the underlying neural network. For example, if elevated nutrient values measured on a cruise are not due to a measurement bias but rather the actual aging of the water masses that have been sampled and is as such accompanied by a decrease in oxygen content, then the measured values and the CANYON-B estimates are likely to be similar. Conversely, if the nutrient values are biased, then the measured values and CANYON-B predictions will be dissimilar.
Used in the correct way and with caution, this tool is a powerful supplement to the traditional crossover analyses which form the basis of our analyses. Specifically, we gave no weight to comparisons in which the crossover analyses had suggested that the salinity and/or O data were biased, as this would lead to error in the predicted values. We also considered the uncertainties in the CANYON-B and CONTENT estimates. These uncertainties are determined for each predicted value, and for each comparison, the ratio of the difference (between measured and predicted values) to the local uncertainty was used to gauge the comparability. As an example, the CANYON-B and CONTENT analyses of the data obtained for 35TH20180611 are presented in Fig. 5. The CANYON-B and CONTENT results confirmed the crossover comparisons for alkalinity discussed in Sect. 3.2.2, showing an inconsistency of only 1.3–1.4. For the other variables, the inconsistencies are low and agree with the crossover results (not shown here but results can be accessed through the adjustment table).
Figure 5
Example summary figure for CANYON-B and CONTENT analyses for 35TH20180611. Any data from regions where CONTENT and CANYON-B were not trained are excluded. The top row shows the nutrients and the bottom row the seawater CO chemistry variables. All are shown versus sampling pressure (dbar), and the unit is micromoles per kilogram (mol kg) for all except pH, which is on the total scale at in situ temperature and pressure. Black dots (which are to a large extent hidden by the predicted estimates) are the measured data, blue dots are CANYON-B estimates, and red dots are the CONTENT estimates. Each variable has two figure panels. The left shows the depth profile, while the right shows the absolute difference between measured and estimated values divided by the CANYON-B and CONTENT uncertainty estimate, which is determined for each estimated value. These values are used to gauge the comparability; a value below 1 indicates a good match, as it means that the difference between measured and estimated values is less than the uncertainty in the latter. The statistics in each panel are for all data deeper than 500 dbar, and N is the number of samples considered. A multiplicative adjustment and its interquartile range are given for the nutrients. For the seawater CO chemistry variables, the numbers in each panel are the median difference between the measured and predicted values for CANYON-B (upper) and CONTENT (lower). Both are given with their interquartile range.
[Figure omitted. See PDF]
Another advantage of the CANYON-B and CONTENT comparisons is that these procedures provide estimates at the level of individual data points; e.g., pH values are determined for every sampling location and depth where temperature, salinity, and O data are available. Cases of strong differences between measured and estimated values are always examined. This has helped us to identify primary QC issues for some cruises and variables; for example, there was a case of an inverted pH profile on cruise 32PO20130829 which was identified and amended in GLODAPv2.2020.
3.2.5Halogenated transient tracers and SF
For the halogenated transient tracers (CFC-11, CFC-12, CFC-113, and CCl; CFCs for short) an inspection of surface saturation levels and an evaluation of relationships between the tracers for each cruise were used to identify biases rather than crossover analyses. Crossover analysis is of limited value for these variables, given their transient nature and low contents at depth. As for GLODAPv2, the procedures were the same as those applied for CARINA (Jeansson et al., 2010; Steinfeldt et al., 2010).
Beginning with GLODAPv2.2022, we have performed secondary quality control for SF data, as this tracer is increasingly being measured and has proven a valuable addition to CFCs. The procedure is mainly based on comparisons with the quality-controlled CFC-12 data, which are available for all cruises with SF measurements. We compare the surface saturation of SF with that of CFC-12 and also consider the correlation between SF and CFC-12 in the ocean interior. Typically, this relation shows some scatter and does not follow a distinct curve (Fig. 6). However, for a given CFC-12 value, the SF content should fall into a certain range, and this range can be estimated by the transit time distribution (TTD; Hall et al., 2002) method. Note that we are not trying to adjust SF to perfectly correlate with CFC-12, as that would severely decrease the value of SF as an independent constraint on ocean circulation. We merely confirm that the SF content is within an allowable range, and adjustments are only applied if all lines of evidence suggest that it is warranted. In GLODAPv2.2023, as in GLODAPv2.2022, no adjustment smaller than 10 % has been applied.
Figure 6
Example of plots used as basis for the SF QC procedure. Shown are the results for cruises 096U20160426 (left) and 320620170703 (right). (a, e) CFC-12 versus pressure for the specific cruise (red), together with all data from the corresponding GLODAP region (Pacific in this case; gray). (b, f) Same as the upper row but for SF. (c, g) CFC-12 versus SF (red dots). Here the measured contents have been converted into atmospheric mixing ratios. Solid black lines show the atmospheric time history of CFC-12 versus that of SF. Dotted lines show CFC-12 versus SF derived from the TTD method for two different sets of TTD parameters. (d, h) CFC-12 vs. SF saturation for the surface layer ( dbar), where the numbers give the mean saturation.
[Figure omitted. See PDF]
As TTD, we use an inverse Gaussian function, which can be described by two parameters: the mean age () and the width () (Hall et al., 2002). Typically, the ratios of are chosen as a fixed parameter, and is varied. Here, we use a range of between 0 and 2000 years and two values for , namely 0.5 and 2. This range of TTD parameters reproduces a simultaneous observation of different tracers, like CFC-12 and SF, when calculating the tracer contents from the TTD and the atmospheric mixing ratio (Steinfeldt et al., 2009). Typically, for the same CFC-12 value derived from the TTD, the corresponding SF value increases with the ratio of the TTD, and it also increases with decreasing saturation (). As the range for the expected SF to CFC-12 relation, we use the TTD with / and as the lower boundary and the TTD with / and 80 % saturation as the upper boundary. In some cases, like deep-water formation or an ice-covered region, the tracer saturation might be lower, as the minimum of 65 % from Steinfeldt et al. (2009) indicates, but the majority of the data is actually located between our assumed lower and upper boundaries (see the results for cruise 096U20160426 in Fig. 6). A few exceptions are found for cruises in the Southern Ocean, as has already been shown in Stöven et al. (2015). Note that in 1996, a SF release experiment was performed in the Greenland Sea (Watson et al., 1999). This leads to a large excess of SF compared to CFC-12 in the Nordic Seas, which is clearly visible in our analyses and hampers the quality control of the SF data in this region.
3.3 Merged product generationThe merged product file for GLODAPv2.2023 was created by updating cruises and correcting known issues in the GLODAPv2.2022 merged file and then appending a merged and bias-corrected file containing the 23 new cruises – sorted according to EXPOCODE, station, and pressure – to this updated GLODAPv2.2022 file. GLODAP cruise numbers were assigned consecutively, starting from 5001, so they can be distinguished from the GLODAPv2.2022 cruises, which ended at 4096. The merging was otherwise performed following the procedures used for previous GLODAP versions (Olsen et al., 2019, 2020; Lauvset et al., 2021).
3.3.1 Updates and corrections for GLODAPv2.2022
For GLODAPv2.2023 we made some minor updates to cruises included in GLODAPv2.2022 (and earlier versions).
Table A2 in the Appendix shows a list of the cruises that have been updated, as well as what the update consists of. In addition, a small number of duplicate data points were removed, and we fixed some rounding errors which resulted in the combination of station, cast, and bottle numbers not being unique for several cruises.
4 Secondary quality control results and adjustments
The secondary QC has five possible outcomes which are summarized in Table 5, along with the corresponding codes that appear in the online adjustment table and that are also occasionally used as shorthand for decisions in the text below. Some cruises were not applicable for full secondary QC. Specifically, in some cases, data were too shallow or geographically too isolated for full and conclusive consistency analyses. In other cases, the results of these analyses were inconclusive, but we have no reason to believe that the data in question are of poor quality. A secondary QC flag has been included in the merged product files to enable their identification, with “0” used for variables and cruises not subjected to full secondary QC (corresponding to code in Table 5) and “1” for variables and cruises that were subjected to full secondary QC. The secondary QC flags are assigned per cruise and variable, not for individual data points, and are independent of – and included in addition to – the primary (WOCE) QC flag on individual measurements. For example, interpolated (salinity, oxygen, and nutrients) or calculated (TCO, TAlk, and pH) values, which have a primary QC flag of 0, may have a secondary QC flag of 1 if the measured data these values are based on have been subjected to full secondary QC. Conversely, individual data points may have a secondary QC flag of 0, even if their primary QC flag is 2 (good data).
Table 5
Possible outcomes of the secondary QC and their codes in the online adjustment table.
Secondary QC result | Code |
---|---|
The data are of good quality, are consistent with the rest of the dataset, and should not be adjusted. | |
The data are of good quality but are biased: adjust by adding (for salinity, TCO, TAlk, and pH) or by multiplying (for oxygen, nutrients, and CFCs) the adjustment value. | Adjustment value |
The data have not been quality controlled, are of uncertain quality, and are suspended until full secondary QC has been carried out. | |
The data are of poor quality and excluded from the data product. | |
The data appear of good quality, but their nature, being from shallow depths and coastal regions without crossovers or similar, prohibits full secondary QC. | |
No data exist for this variable for the cruise in question. |
The value of 0 is used for variables with additive adjustments (salinity, TCO, TAlk, and pH) and 1 for variables with multiplicative adjustments (for oxygen, nutrients, and CFCs). This is mathematically equivalent to “no adjustment” in both cases.
The secondary QC actions for the 13 core variables and the distribution of adjustments applied on the 23 new cruises are summarized in Table 6 and Fig. 7, respectively. No salinity, oxygen, TCO, pH, or phosphate data were adjusted. Out of the 22 cruises with TAlk, data from only 1 were adjusted. For nitrate, data from 2 out of 20 cruises were adjusted, and for silicate, 4 out of 17 cruises were adjusted. For the CFCs, no cruises required adjustment. Of the 23 new cruises in GLODAPv2.2022, 7 include SF, and 1 required an adjustment. Overall, the magnitudes of the various adjustments applied are small, and the tendency observed during the production of the three previous updates remains, namely that the majority of recent cruises are consistent with earlier releases of the GLODAP data product. However, the new crossover and inversion analysis of all pH data in the northwestern Pacific that was planned following the release of GLODAPv2.2020 has not yet been performed. Such an analysis is planned for the next full update of GLODAP, i.e., GLODAPv3. Therefore, the conclusion from GLODAPv2.2020 remains that some caution should be exercised if looking at trends in ocean pH in the northwestern Pacific using GLODAPv2.2023 or earlier versions.
Table 6Summary of secondary QC results for the 23 new cruises (number of cruises per result and per variable).
Sal. | Oxy. | NO | Si | PO | TCO | TAlk | pH | CFC-11 | CFC-12 | CFC-113 | CCl | SF | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
With data | 23 | 14 | 20 | 17 | 20 | 22 | 22 | 12 | 7 | 8 | 5 | 0 | 7 |
No data | 0 | 9 | 3 | 6 | 3 | 1 | 1 | 11 | 16 | 15 | 18 | 96 | 16 |
Unadjusted | 12 | 10 | 8 | 6 | 10 | 12 | 12 | 8 | 7 | 7 | 5 | 0 | 6 |
Adjusted | 0 | 0 | 2 | 4 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
11 | 4 | 10 | 7 | 10 | 10 | 9 | 4 | 0 | 1 | 0 | 0 | 0 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
The data are included in the data product file as captured, with a secondary QC flag of 1. The adjusted data are included in the data product file with a secondary QC flag of 1. Data appear to be of good quality but have not been subjected to full secondary QC. They are included in data product with a secondary QC flag of 0. Data are of uncertain quality and suspended until full secondary QC has been carried out; they are excluded from the data product. Data are of poor quality and excluded from the data product.
Figure 7
Distribution of applied adjustments for each core variable that received secondary QC, in micromoles per kilogram (mol kg) for TCO and TAlk and unitless for salinity and pH (but multiplied by 1000 in both cases so a common axis can be used), while for the other properties adjustments are given in percent ((adjustment ratio ) ). Gray areas depict the initial minimum adjustment limits. The figure includes numbers for data subjected to secondary quality control only. Note also that the -axis scale is set to render the number of adjustments visible, so the bar showing zero offset (the 0 bar) for each variable is cut off (see Table 6 for these numbers).
[Figure omitted. See PDF]
For the nutrients, adjustments were applied to maintain consistency with data included in GLODAPv2.2022 and earlier versions. An alternative goal for the adjustments would be maintaining consistency with data from cruises that employed reference materials (RMNS) to ensure the accuracy of nutrient analyses. Such a strategy was adopted by Aoyama (2020) for preparation of the Global Nutrients Dataset 2013 (GND13) and is being considered for GLODAP as well. However, as this would require a re-evaluation of the entire dataset, this will not occur until the next full update of GLODAP. For now, we note the overall agreement between the adjustments applied in these two efforts (Aoyama, 2020) and that most disagreements appear to be related to cases where no adjustments were applied in GLODAP.
The improvement in data consistency resulting from the secondary QC process is evaluated by comparing the weighted mean of the absolute offsets for all crossovers before and after the adjustments have been applied. This “consistency improvement” for core variables is presented in Table 7. The data for CFCs were omitted from these analyses for previously discussed reasons (Sect. 3.2.5). Globally, the improvement is modest. Considering the initial data quality, this result was expected. However, this does not imply that the data were initially consistent everywhere. Rather, for some regions and variables, there can be substantial improvements when the adjustments are applied.
Table 7Improvements resulting from quality control of the 23 new cruises per basin and for the global dataset. The values in the table are the weighted mean of the absolute offset of unadjusted and adjusted data versus GLODAPv2.2022. The total number of valid crossovers in the global ocean for the variable in question is . The values in this table represent the inter-cruise consistency in the GLODAPv2.2023 product.
Arctic | Atlantic | Indian | Pacific | Global | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Unadj. | Adj. | Unadj. | Adj. | Unadj. | Adj. | Unadj. | Adj. | Unadj. | Adj. | (global) | ||||||
Sal. () | 2.7 | 2.7 | 4.0 | 4.0 | NA | NA | 1.4 | 1.4 | 2.0 | 2.0 | 462 | |||||
Oxy (%) | 1.1 | 1.1 | 0.8 | 0.8 | NA | NA | 0.5 | 0.5 | 0.6 | 0.6 | 393 | |||||
NO (%) | 1.4 | 1.4 | 1.5 | 1.5 | NA | NA | 1.4 | 0.5 | 1.4 | 0.8 | 207 | |||||
Si (%) | 8.1 | 8.1 | 5.1 | 5.1 | NA | NA | 1.2 | 0.6 | 1.5 | 1.0 | 357 | |||||
PO (%) | 3.7 | 3.7 | 1.9 | 1.9 | NA | NA | 0.9 | 0.9 | 1.0 | 1.0 | 348 | |||||
TCO (mol kg) | 2.8 | 2.8 | 5.8 | 5.8 | NA | NA | 1.8 | 1.8 | 2.8 | 2.8 | 176 | |||||
TAlk (mol kg) | 4.7 | 4.7 | 2.5 | 2.5 | NA | NA | 2.9 | 2.0 | 2.9 | 2.3 | 202 | |||||
pH () | 8.8 | 8.8 | 9.7 | 9.7 | NA | NA | 6.5 | 6.3 | 8.2 | 8.2 | 142 |
NA stands for not available.
The various iterations of GLODAP provide insight into initial data quality covering more than 4 decades. Figure 8 summarizes the applied absolute adjustment magnitude per decade. These distributions are broadly unchanged compared to GLODAPv2.2022 (Fig. 8 in Lauvset et al., 2022). Most TCO and TAlk data from the 1970s needed an adjustment, but this fraction steadily declines until only a small percentage is adjusted in recent years. This is encouraging and demonstrates the value of standardizing sampling and measurement practices (Dickson et al., 2007), the widespread use of CRMs (Dickson et al., 2003), and instrument automation. The pH adjustment frequency also has a downward trend; however, issues remain with the pH adjustments, and this is a topic for future development in GLODAP, with the support from the Ocean Carbon & Biogeochemistry (OCB) Ocean Carbonate System Intercomparison Forum (OCSIF,
Table listing the number of data points in GLODAPv2.2023, as well as the number of data with various combinations of variables.
Variables | No. of records |
---|---|
All core (salinity, oxygen, nitrate, silicate, phosphate, TCO, TAlk, pH, CFC-11, CFC-12, CFC-113, CCl, and SF) | 174 |
All core, except SF | 2029 |
Salinity, oxygen, nitrate, silicate, phosphate, CFC-11, CFC-12, CFC-113, CCl, and SF plus two of TCO, TAlk, and pH | 636 |
Salinity, oxygen, nitrate, silicate, phosphate, TCO, TAlk, and pH | 172 554 |
CFC-11, CFC-12, CFC-113, CCl, and SF | 926 |
At least one transient tracer species or SF | 433 932 |
SF | 103 791 |
Two out of the three CO chemistry core variables (TCO,TAlk, and pH) | 460 274 |
Measured CO | 35 387 |
Salinity, oxygen, nitrate, silicate, and phosphate | 873 641 |
Salinity and oxygen | 1 182 940 |
No salinity | 28 777 |
Total in GLODAPv2.2023 | 1 402 829 |
Figure 8
Magnitude of applied adjustments relative to minimum adjustment limits (Table 3) per decade for the 1108 cruises included in GLODAPv2.2023.
[Figure omitted. See PDF]
Figure 9
Locations of stations included in the (a) Arctic, (b) Atlantic, (c) Indian, and (d) Pacific ocean product files for the complete GLODAPv2.2023 dataset.
[Figure omitted. See PDF]
5 Data availabilityThe GLODAPv2.2023 merged and adjusted data product is archived at the OCADS of NOAA NCEI (
All material produced during the secondary QC is available via the online GLODAP adjustment table hosted by GEOMAR, Kiel, Germany, at
The original cruise files, with updated flags determined during additional primary GLODAP QC, are available through the GLODAPv2.2023 cruise summary table (CST) hosted by OCADS at
While GLODAPv2.2023 is made available without any restrictions, users of the data should adhere to the fair data use principles; for investigations that rely on a particular (set of) cruise(s), recognize the contribution of GLODAP data contributors by at least citing both the cruise DOI and any articles where the data are described and, preferably, contacting the principal investigators to explore opportunities for collaboration and co-authorship. To this end, DOIs are provided in the product files, together with relevant articles and principal investigator names in the cruise summary table. Contacting principal investigators comes with the additional benefit that the principal investigators often possess expert insight into the data and/or specific region under investigation. This can improve scientific quality and promote data sharing.
This paper should be cited in any scientific publications that result from usage of the product. Citations provide the most efficient means to track the data use, which is important for attracting funding to enable the preparation of future updates.
6 Summary
GLODAPv2.2023 is an update of GLODAPv2.2022. Data from 23 new cruises have been added to supplement the earlier release. GLODAP now includes 48 years, 1972–2021, of global interior ocean biogeochemical data from 1108 cruises. The total number of data records is 1 402 829 (Table 8). Records with measurements for all 13 core variables (salinity, oxygen, nitrate, silicate, phosphate, TCO, TAlk, pH, CFC-11, CFC-12, CFC-113, CCl, and SF) are very rare (174), and requiring only two out of the three core seawater CO chemistry variables, in addition to all the other core variables, is still very rare with only 636 records (Table 8). A major limiting factor to having all core variables together is the simultaneous availability of data for all four transient tracer species and SF. In GLODAPv2.2023 there are 103 791 records with SF data and 433 932 records with at least one transient tracer or SF. A total of 2 % (28 777) of all data records do not have salinity. There are several reasons for this, with the main one being the inability to vertically interpolate due to a separation between the measured samples that is too large. Other reasons for missing salinity include the salinity not being reported and missing depth or pressure, which leads to all other variables being set to but not the removal of that line in the data product.
As for previous versions, there is a bias toward summertime in the data in both hemispheres; most data are collected during April through November in the Northern Hemisphere, while most data are collected during November through April in the Southern Hemisphere (Fig. 10). These tendencies are strongest for the Arctic and Antarctic regions and reflect the harsh conditions during winter months which make fieldwork difficult. The upper 100 m is the best-sampled part of the global ocean, both in terms of the number (Fig. 11a) and density (Fig. 11b) of observations. The number of observations steadily declines with depth. In part, this is caused by the reduction in ocean volume towards greater depths. Below 1000 m, the density of observations stabilizes and even increases between 5000 and 6000 m; the latter is a zone where the volume of each depth surface decreases sharply (Weatherall et al., 2015). In the deep trenches, i.e., areas deeper than 6000 m, both the number and density of observations are low.
Figure 10
Distribution of data in GLODAPv2.2023 in (a) December–February, (b) March–May, (c) June–August, and (d) September–November, as well as (e) the number of observations for each month in four latitude bands.
[Figure omitted. See PDF]
Figure 11
Number (a) and density (b) of observations in 100 m depth layers. The latter was calculated by dividing the number of observations in each layer by its global volume calculated from ETOPO2 (National Geophysical Data Center, 2006). For example, in the layer between 0 and 100 m, there are on average 0.0075 observations per cubic kilometer. One observation is one water sampling point and has data for several variables.
[Figure omitted. See PDF]
Except for salinity and oxygen, the core data were collected exclusively through chemical analyses of collected water samples. The data of the 13 core variables were subjected to primary quality control to identify questionable or bad data points (outliers) and secondary quality control to identify systematic measurement biases. The data are provided in two ways: as a set of individual exchange-formatted original cruise data files with assigned WOCE flags and as globally and regionally merged data product files with adjustments applied to the data according to the outcome of the consistency analyses. Importantly, no adjustments were applied to the data in the individual cruise files while primary QC changes were applied.
The consistency analyses were conducted by comparing the data from the 23 new cruises to the previous data product GLODAPv2.2022. Adjustments were only applied when the offsets were believed to reflect biases relative to the earlier data product release related to measurement calibration and/or data-handling practices and not to natural variability or anthropogenic trends. The adjustment table at
Secondary QC flags are included for the 13 core variables in the product files. These flags indicate whether (1) or not (0) the data successfully received secondary QC. A secondary QC flag of 0 does not by itself imply that the data are of lower quality than those with a flag of 1. It means these data have not been as thoroughly checked. For C, the QC results by Becker et al. (2016) for the North Atlantic were applied, and a secondary QC flag was therefore added to this variable.
The primary WOCE QC flags in the product files are simplified (e.g., all questionable and bad data were removed). For salinity, oxygen, and the nutrients, any data flagged 0 are interpolated rather than measured. For TCO, TAlk, pH, and CO, any data flags of 0 indicate that the values were calculated from two other measured seawater CO variables. Finally, while questionable (WOCE flag 3) and bad (WOCE flag 4) data have been excluded from the product files, some may have gone unnoticed through our analyses. Users of the data product are encouraged to report on any data that appear suspicious.
Based on the initial minimum adjustment limits and the improvement in the consistency resulting from the adjustments (Table 7), the data subjected to consistency analyses are believed to be consistent with to better than 0.005 in salinity, 1 % in oxygen, 2 % in nitrate, 2 % in silicate, 2 % in phosphate, 4 mol kg in TCO, 4 mol kg in TAlk, and 5 % for the halogenated transient tracers and SF. For pH, the consistency among all data is estimated as 0.01–0.02, depending on the region. As mentioned above, the included CO data have not been subjected to quality control; therefore no consistency estimate is given for this variable. This should be conducted in future efforts.
Appendix A Supplementary tables Table A1Cruises included in GLODAPv2.2023 that did not appear in GLODAPv2.2022. Complete information on each cruise, such as variables included and chief scientist and principal investigator names, is provided in the cruise summary table at
No. | EXPOCODE | Region | Alias | Start | End | Ship |
---|---|---|---|---|---|---|
5001 | 18DD20150210 | Atlantic Ocean | Line P | 20150210 | 20150224 | John P. Tully |
5002 | 20HZ20151205 | Pacific Ocean | LowpHOX 1_1 | 20151205 | 20151209 | Cabo de Hornos |
5003 | 20HZ20151227 | Pacific Ocean | LowpHOX 1_2 | 20151127 | 20151128 | Cabo de Hornos |
5004 | 20HZ20180203 | Pacific Ocean | LowpHOX 2 | 20180203 | 20180206 | Cabo de Hornos |
5005 | 325020210316 | Atlantic Ocean | A20 | 20210316 | 20210416 | Thomas G. Thompson |
5006 | 325020210420 | Atlantic Ocean | A22 | 20210420 | 20210516 | Thomas G. Thompson |
5007 | 35TH20180611 | Atlantic Ocean | OVIDE 2018, GO_SHIP A25 | 20180611 | 20180714 | Thalassa |
5008 | 45CE20170206 | Atlantic Ocean | CE17002 | 20170206 | 20170211 | Celtic Explorer |
5009 | 45CE20180210 | Atlantic Ocean | CE18001 | 20180210 | 20180217 | Celtic Explorer |
5010 | 45CV20180128 | Atlantic Ocean | CV18001 | 20180128 | 20180206 | Celtic Voyager |
5011 | 45CV20190109 | Atlantic Ocean | CV19001 | 20190110 | 20190119 | Celtic Voyager |
5012 | 45CE20190524 | Atlantic Ocean | CV19005 | 20190524 | 20190605 | Celtic Explorer |
5013 | 46BS20020215 | Atlantic Ocean | IcelandSea; IrmingerSea | 20020215 | 20060529 | Arni Fridriksson, Bjarni Saemundsson |
5014 | 49NZ20210713 | Pacific Ocean | MR21-04, P01 | 20210713 | 20210826 | Mirai |
5015 | 49UP20210728 | Pacific Ocean | GO-SHIP P03, RF21-06 | 20210728 | 20210728 | Ryofu Maru III |
5016 | 49UP20210827 | Pacific Ocean | GO-SHIP P03, RF21-07 | 20210827 | 20210827 | Ryofu Maru III |
5017 | 49UP20210920 | Pacific Ocean | GO-SHIP P03, RF21-08 | 20210920 | 20210920 | Ryofu Maru III |
5018 | 58JH20110113 | Atlantic Ocean | 20110113 | 20170605 | Johan Hjort | |
5019 | 58JH20110121 | Atlantic Ocean | 20110121 | 20170402 | Johan Hjort | |
5020 | 58HJ20140110 | Arctic Ocean | 79N | 20140110 | 20140110 | Helmer Hansen |
5021 | 74EQ20191202 | Pacific Ocean | DY111 | 20191202 | 20200109 | Discovery |
5022 | 76XL20080730 | Arctic Ocean | CHINARE2008 | 20080730 | 20080911 | Xuelong |
5023 | 77DN20210725 | Arctic Ocean | SAS-Oden2021 | 20210725 | 20210920 | Oden |
List of cruises included in GLODAPv2.2022 which have been updated as part of GLODAPv2.2023. Complete information on each cruise, such as variables included and chief scientist and principal investigator names, is provided in the cruise summary table at
No. | EXPOCODE | Region | Alias | Update | Adjustment |
---|---|---|---|---|---|
721 | IcelandSea | Atlantic | LN6 | Added new data from 2014 until 2019; added CFC data for entire dataset | |
720 | IrmingerSea | Atlantic | FX9 | Added new data from 2014 until 2019 | |
1040 | 33HQ20150809 | Arctic | ARC01 | Added delC13 and delC14 data | |
34 | 06MT19921227 | Atlantic | 06MT22_5 | Oxygen data | |
1002 | 06AQ20120107 | Atlantic | ANTXXVIII-3 | SF6 data | – |
1109 | 74EQ20151206 | Atlantic | A05.2015 | Remove TCO adjustment and recalculate pH and CO | |
3 | 06AQ19870704 | Arctic | ARK IV/3 | Downward adjustment silicate | 0.96 |
695 | 74DI20040404 | Atlantic | A05 | Removed bad CO data and TCO calculated from those |
Note on former version
A former version of this article was published on 16 December 2022 and is available at 10.5194/essd-14-5543-2022.
Author contributions
SKL and TT led the team that produced this update. AK compiled the original data files and maintains the GLODAPv2 web pages at NCEI/OCADS. NL conducted the primary and secondary QC analyses. HCB conducted the CANYON-B and CONTENT analyses. CS manages the adjustment table e-infrastructure. TT, RS, and EJ performed the secondary QC on all transient tracers. All authors contributed to the interpretation of the secondary QC results and made decisions on whether to apply adjustments. NL, SKL, HCB, JDM, and MPH performed final checks of the data product. Many authors conducted ancillary QC analyses. SKL and NL updated the living data compilation with contributions from all authors.
Competing interests
At least one of the (co-)authors is a member of the editorial board of Earth System Science Data. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.
Disclaimer
Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
Acknowledgements
GLODAPv2.2023 would not have been possible without the effort of the many scientists who secured funding, dedicated time to collect data, and shared the data that are included. Chief scientists on the various cruises and principal investigators for specific variables are listed in the online cruise summary table. The author team also want to thank the large GLODAP user community for useful input and notifications about potential issues in the data products. Such input is invaluable and helps ensure that GLODAP maintains its high quality and consistency over time. The author team thanks all members of the GLODAP reference group who were unable to participate in the product development this year. GLODAP is a largely unfunded volunteer effort, and we acknowledge that it is sometimes not possible to prioritize this work. GLODAP is, regardless, a team effort, and the entire group contributes to the continued success of GLODAP. This work is CICOES and PMEL contribution nos. 2023-1321 and 5563, respectively. This activity has been supported by the International Ocean Carbon Coordination Project (IOCCP), and the author team acknowledges funding to IOCCP provided by a grant to SCOR from the U.S. National Science Foundation (grant no. OCE-2140395).
Financial support
Nico Lange has been funded by EU Horizon 2020 through the EuroSea action (grant no. 862626). Siv K. Lauvset has received internal strategic funding from NORCE Climate. Are Olsen, Nico Lange, and Siv K. Lauvset have received funding from the EU Horizon Europe project OceanICU (grant no. 101083922). Leticia Cotrim da Cunha has been supported by the Prociencia 2022–2024 grant from Universidade do Estado do Rio de Janeiro (UERJ, Brazil) and the PQ2 309708/2021-4 grant from the National Council for Scientific and Technological Development (CNPq, Brazil). Marta Álvarez has been supported by an Instituto Español de Oceanografía (IEO) RADPROF project. Peter J. Brown has received partial funding from the UK Climate Linked Atlantic Sector Science (CLASS) NERC National Capability Long-term Single Centre Science Programme (grant no. NE/R015953/1). Anton Velo and Fiz F. Perez have been supported by the BOCATS2 (grant no. PID2019-104279GB-C21) project funded by MCIN/AEI/10.13039/501100011033 and the Horizon Europe project EuroGO-SHIP (grant no. 101094690). Brendan R. Carter has received funding from the Global Ocean Monitoring and Observing (GOMO) program of the National Oceanic and Atmospheric Administration (NOAA) through contributions (project no. 100007298) via the Cooperative Institute for Climate, Ocean, and Ecosystem Studies (CIOCES) under a NOAA Cooperative Agreement (grant no. NA20OAR4320271; contribution no. 2022–2012). Mario Hoppema has received funding from the EU Horizon 2020 Action SO-CHIC (grant no. 821001). Adam Ulfsbo has been supported by the Swedish Research Council Formas (grant no. 2018-01398). Jens Daniel Müller has received support from the European Union's Horizion 2020 research and innovation program for project 4C (grant no. 821003). Alex Kozyr has been supported by NOAA (grant no. NA19NES4320002; Cooperative Institute for Satellite Earth System Studies – CISESS) at the University of Maryland/ESSIC. Ryan J. Woosley has been supported by NSF grants (grant nos. OCE-1923312 and OCE-2148468). Masao Ishii has been supported by the Environment Research and Technology Development Fund of the Environmental Restoration and Conservation Agency of Japan (grant no. JPMEERF21S20810). GLODAP also acknowledges funding from the Initiative and Networking Fund of the Helmholtz Association through the project “Digital Earth” (grant no. ZT-0025).
Review statement
This paper was edited by François G. Schmitt and reviewed by Sebastien Petton and one anonymous referee.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The Global Ocean Data Analysis Project (GLODAP) is a synthesis effort providing regular compilations of surface to bottom ocean biogeochemical bottle data, with an emphasis on seawater inorganic carbon chemistry and related variables determined through chemical analysis of seawater samples. GLODAPv2.2023 is an update of the previous version, GLODAPv2.2022 (Lauvset et al., 2022). The major changes are as follows: data from 23 new cruises were added. In addition, a number of changes were made to the data included in GLODAPv2.2022. GLODAPv2.2023 includes measurements from more than 1.4 million water samples from the global oceans collected on 1108 cruises. The data for the now 13 GLODAP core variables (salinity, oxygen, nitrate, silicate, phosphate, dissolved inorganic carbon, total alkalinity, pH, chlorofluorocarbon-11 (CFC-11), CFC-12, CFC-113, CCl
The original data, their documentation, and DOI codes are available at the Ocean Carbon and Acidification Data System of NOAA National Centers for Environmental Information (NCEI), which also provides access to the merged data product. This is provided as a single global file and as four regional ones – the Arctic, Atlantic, Indian, and Pacific oceans – under
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details



















1 NORCE Norwegian Research Centre, Bjerknes Centre for Climate Research, Bergen, Norway
2 GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
3 Leibniz Institute for Baltic Sea Research Warnemünde, Rostock, Germany
4 Geophysical Institute, University of Bergen and Bjerknes Centre for Climate Research, Bergen, Norway
5 NOAA National Centers for Environmental Information, Silver Spring, MD, USA
6 Instituto Español de Oceanografía, IEO-CSIC, A Coruña, Spain
7 Department of Fisheries and Oceans, Bedford Institute of Oceanography, Dartmouth, NS, Canada
8 National Oceanography Centre, Southampton, UK
9 Cooperative Institute for Climate Ocean and Ecosystem Studies, University of Washington, Seattle, WA, USA; Pacific Marine Environmental Laboratory, National Oceanic and Atmospheric Administration, Seattle, Washington, USA
10 PPG-Oceanografia, Faculdade de Oceanografia, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, Brazil
11 Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research, Bremerhaven, Germany
12 Department of Ocean Systems (OCS), NIOZ Royal Netherlands Institute for Sea Research, Texel, the Netherlands
13 Meteorological Research Institute, Japan Meteorological Agency, Tsukuba, Japan
14 Research Institute for Global Change, Japan Agency for Marine–Earth Science and Technology, Yokosuka, Japan
15 Environmental Physics, Institute of Biogeochemistry and Pollutant Dynamics, ETH Zürich, Zurich, Switzerland
16 Instituto de Investigaciónes Mariñas, IIM – CSIC, Vigo, Spain
17 Institute of Environmental Physics, University of Bremen, Bremen, Germany
18 Marine Information Research Center, Japan Hydrographic Association, Tokyo, Japan
19 Department of Marine Sciences, University of Gothenburg, Gothenburg, Sweden
20 Center for Global Change Science, Massachusetts Institute for Technology, Cambridge, MA, USA
21 Atmospheric and Oceanic Sciences, Princeton University, Princeton, NJ 08540, USA