Introduction
During the global SARS-CoV-2 outbreak starting in late 2019, several epidemiological measures have been used to inform the general public and policy makers. Occasionally, politicians used epidemiological measures to justify interventions such as the obligation to wear face masks or even lockdown. In Germany, for instance, thresholds regarding incidence rates have been implemented into national laws as necessary conditions for relaxation of lockdown, e.g., school-openings or release of dusk-to-dawn curfews [1].
The most prominent epidemiological measure is the cumulative number of people tested positive, which we will call cumulative case-count (CCC). The CCC has been frequently reported on a daily basis on country level (e.g., by the COVID-19 Dashboard of the Johns Hopkins University [2]). To allow cross-country comparisons, CCC or the numbers of people tested positive often have been adjusted for population size, e.g. in the COVID-19 situation reports of the WHO [3]. Some media reported doubling times and effective reproduction numbers (Reff). For instance, in a nationally widely perceived press conference Germany’s Chancellor Dr. Angela Merkel declared the political goal to reach doubling greater than ten days [4].
Whether someone is reported as an incident case, is based on the results of the diagnostic procedures carried out. Apart from anamnesis, frequently used diagnostic tests are nasopharyngeal swab tests followed by reverse transcriptase polymerase chain reaction (RT-PCR) to detect viral RNA. If the swab is done properly, RT-PCR has a moderate to high diagnostic sensitivity and a very high specificity for SARS-CoV-2 detection [5]. At least in theory, all incident cases can be identified correctly if enough screening and diagnostic efforts are spent. In practice, of course, only limited resources are available and it is likely that incident cases have been missed. Restrictions in applying diagnostic procedures are, e.g., limitations in the number of available test kits and personnel or the local policy of running diagnostic tests. Local and even national eligibility criteria for tests have been changed during the time of the epidemic. Together with the fact that several people with the associated COVID disease show only asymptomatic or mild courses and presumably have not been considered for diagnosis of an infection at any time, the varying testing frequency leads to the hypothesis that the actual disease was at least partially underestimated [6]. We call an infectious subject who in principle qualifies as a case but has not been detected, because she or he has not been (properly) diagnosed, an undetected case.
A useful measure for assessing case-detection is the proportion of all incident cases that were identified, i.e. the number of detected cases divided by the number of all cases (detected plus undetected cases). This proportion is referred to as the case detection ratio (CDR) [7].
The aim of this analysis is to determine the bias of the epidemiological measures used to inform the public and the decision makers due to incomplete case-detection. We conducted a simulation study using the epidemiological parameters of SARS-CoV-2 and mimic incomplete case-detection by assuming scenarios about the CDR. Next, the epidemiological measures are estimated in the presence of incomplete case-detection, and subsequently, the estimations are compared to the true values underlying the simulation.
Materials and methods
We use the infection-age-structured SIR model [8] to simulate the spread of SARS-CoV-2 in a hypothetical population. In the infection-age-structured SIR as well as in the conventional SIR model, the population is partitioned into three states, the susceptible state, the infected and the removed state. The initial letters of the three states give the model’s name `SIR´ (Fig 1). The infection-age-structured SIR model is a generalization of the SEIR model, which in turn is a generalization of the frequently used SIR model. Compared to the simple SIR, the SEIR comprises an additional latency state (termed ’E’ for exposed) when individuals are infected but cannot yet infect others. In this sense, the infection-age-structured SIR model is the most general model which contains SEIR as well as SIR as special cases [8]. We have chosen the infection-aged structured SIR to allow flexibility in the temporal dynamics of transmissibility (for details we refer to the S1 File).
[Figure omitted. See PDF.]
The population under consideration is divided into three states susceptible (prone to be infected), infected and removed (recovered or dead).
The removed state comprises people recovered and deceased from the infected state [9]. As usual in infectious disease epidemiology, changes in numbers of people in the disease states are modeled quantitatively by differential equations, which are presented in the S1 File and can be omitted in favor of keeping the focus on the epidemiological measures used to describe the epidemic. The key part of the model is the transmission rate β, which describes the transmission of the virus from an infectious to a susceptible individual at a given time (see the left arrow in Fig 1).
The infection-age-structured SIR model is a minor modification of the SIR model, where the transmission rate β depends on two time-scales: the infection age τ, i.e., the time since virus transmission from an infectious to a susceptible individual, and calendar time t. The latter is often called period in epidemiological contexts. The dependency on the infection age τ reflects the evidence that the risk of SARS-CoV-2 transmission depends on the time since infection. Parameters of the infection-age-structured SIR model are chosen according to the best knowledge of the SARS-CoV-2 virus. For introductory texts about the SIR model, we refer the reader to [8, 9] and the references in the S1 File.
Governments of many countries decided to invoke one or more lockdowns, which led us to the idea of simulating three consecutive periods of the epidemic: a phase of increasing number of infections from t = 0 to t = 25 (days), a phase of implementation of a (strict) lockdown (from t = 25 to t = 30) followed by a phase post-lockdown when the pandemic is controlled (from t = 30 to t = 60). Start of implementation of lockdown happens at t = 25 (days). We do not make assumptions which specific public health interventions lead to the reduction in transmission e.g. contact tracing, mask wearing, stay-at-home orders, quarantine policies or a combination of these. In this work, we intend to have a realistic dynamic range in the disease transmission rate β imposed by public health interventions, irrespective of how these are obtained in detail.
The five day period following the start of the lockdown was chosen as wash-in phase. In real populations, public health interventions usually require some time before taking full effect. After this wash-in period, we assumed that the effect of the lockdown remains unaltered until the end of the simulation. To assess if our findings are robust with respect to the length of the wash-in period and the assumption that effect of the lockdown remains unaltered, we performed sensitivity analyses where we released these assumptions. The results of these sensitivity analyses are presented in Part 1 of the S2 File. Fig 2 shows the marginal distributions of the transmission rate β. The left and right part present the dependency on the calendar time t and the infection-age τ. Note the steep drop in the transmission rate β in the left part of Fig 2 resulting from the lockdown.
[Figure omitted. See PDF.]
The left panel shows the time dependent part βt of the transmission rate β during three phases of the pandemic: before lockdown at t = 25 (days), installation of lockdown (from t = 25 to t = 30, between vertical dotted lines) and control of the disease (t > 30). The right panel shows the component βτ of the transmission rate β as a function of the infection age τ. More details can be found in Section 4 of the S1 File.
All technical details about the model are given in the S1 File. The source code for use in the free Statistical Software R, version 3.6.3 (The R Foundation for Statistical Computing) is available in the Zenodo open public repository [10].
In the case of the novel coronavirus SARS-CoV-2, new infectious cases are often reported daily [2]. If the number of newly incident cases reported on day t is denoted by Ft(o) (the superscript ’o’ indicates observed), the case-detection ratio (CDR) is the proportion of these reported cases in relation to the actual (true) but unknown number of newly incident cases Ft [7]:(1)
If we neglect false positive findings (which is reasonable in highly specific tests), the CDR is a proportion ranging from 0 to 100%. In the case of complete detection of all incident infectious cases on day t, the CDRt would equal 100% on that day.
We use different scenarios about the temporal variations of the CDR and analyze the biases imposed to the epidemiological measures 1) CCC, 2) incidence rate, 3) Reff and 4) doubling time.
1. The frequently used CCC reported on day t simply adds the number of incident cases until day t:(2)
Instead of the sum of true (but unknown) incident cases, the observed CCCt(o) only includes the observed number of incident cases Ft(o).
2. If the daily incidence rates are reported, Ft and Ft(o) are usually referred to the overall size of the population of the corresponding region, e.g. in the WHO’s weekly epidemiological updates on COVID-19 [3]. In this manuscript, we refer to the incidence rate "per person per day".
3. The effective reproduction number Reff is the average number of secondary infectious cases that one primary case infects during their infectious period [11]. Reff can be estimated from the number of incident cases Ft. A variety of definitions of the effective reproduction number exist and we refer to the instantaneous reproduction number [12].
4. "Doubling times" have often been reported during the temporal course of the pandemic. Originally, the doubling time at some point in time (t) is defined as the time span Δ = Δ(t) until the number of infectious cases in the population doubles [9]. As the number of infections is not easily accessible, sometimes the "doubling time" refers to the time the incidence [13] or "cumulative incidence doubles" [14]. Thus, we use the defining condition CCCt+Δ. = 2 CCCt.
The hypothetical scenarios for the CDR during the pandemic are shown in Fig 3.
[Figure omitted. See PDF.]
CDRs during the pandemic in the four simulated scenarios A to D.
In scenarios A and B we assume a low and moderate CDR without temporal trends. These scenarios have been chosen to see if the absolute level of the CDR makes a difference in the four outcome measures. We have not added a scenario with (close to) perfect case detection, because in this case all four epidemiological indices will perform well. In addition, especially in the early days of the pandemic when testing resources have been sparse, a scenario with close high case detection seems unrealistic. Scenarios C and D simulate a CDR that changes over the period of 60 days including a trend reversal at some time. Scenario C mimics the situation when testing capacities are limited at the early phase of a pandemic and are increased after a while, for instance, by investing in resources for testing programs. Scenario D can partly be seen as the opposite of scenario C. After a phase of high CDR in scenario D, testing capacities are increasingly limited for instance by shortage of test supplies. The phase of decreasing CDR is followed by a phase of rising CDR, e.g. by having overcome shortage of test supplies.
After running the simulation, we mimic incomplete case-detection by applying the scenarios A to D about the CDR, which will lead to incomplete case-detection. Then, the four epidemiological measures are estimated in the presence of incomplete case-detection. These estimates are compared to the true values underlying the simulation during the simulated time period of 60 days. Bias is expressed in terms of relative errors, i.e., 100% (T − E)/T, where T and E denote the true value from the simulation and the estimated value from the observed data in the presence of incomplete case-detection, respectively. The numerical values of the relative errors are assessed at days 15, 30, 45 and 60. In summary, the workflow for running the simulation and making the comparisons is shown in Fig 4.
[Figure omitted. See PDF.]
Work flow for running the simulation and assessing the bias from incomplete case-detection.
Results
Fig 5 shows the biases over time in terms of relative errors of the four epidemiological indices from day 0 to day 60 (end of the simulation). A small distance to the horizontal line at value 0 (solid line in each of the four panels) is advantageous. The farther away from the horizontal line at value 0, the higher is the relative error in absolute terms. We see that Reff and the doubling times outperform the CCC and the incidence rate.
[Figure omitted. See PDF.]
Bias (relative error) of the four epidemiological measures in the four simulated scenarios A to D (indicated by the line type): Cumulative case count (top left), incidence rate (top right), effective reproduction number (bottom left) and doubling time (bottom right). A desirable zero bias is indicated by a horizontal solid line at value 0.
Table 1 shows the associated biases in terms of the relative errors at days 15, 30, 45 and 60 of the pandemic. In the table, we observe that in case of the CCC and the incidence rate, none of the relative errors are below 10% in magnitude. In fact, relative errors of CCC and incidence rate can reach up to 90% in magnitude. In all examined scenarios, the effective reproduction number and the doubling time outperform the CCC and the incidence rate. In absolute values, the effective reproduction number is the most unbiased measure while the most frequently used CCC is the least accurate measure in the tested settings.
[Figure omitted. See PDF.]
Discussion
In this work we compared four epidemiological measures that have been used to inform the general public about the SARS-CoV-2 outbreak. Partly, the epidemiological measures entered the public discussion and have been used as a basis for political decisions. For example, national laws in Germany permit school-openings after lockdown if incidence rates are below a certain threshold [1]. Changes of the epidemiological measures have been used to justify interventions such as face masking, travel restrictions and lockdown. Hence, we would expect that the measures are robust with respect to the problem of incomplete case-detection. We found that particularly two measures have not proven to be robust: the frequently used cumulative case counts (CCC) and the incidence rates. The effective reproduction number Reff and the doubling times are less prone to imperfect case detection. Although all four measures are interrelated by using the (daily) number of reported cases Ft(o). CCC and incidence rate use Ft(o) in an absolute way, either by summing up (CCC) or by relating them to the population (incidence rate). The effective reproduction number Reff and the doubling time use Ft(o) in a relative way. Fraser’s method to estimate Reff is based on a quotient (Equation (11) in S1 File), where the case detection ratio (CDR) cancels out in the numerator and denominator. The doubling time is estimated by the slope of the graph of log(CCC) over time (see Section 3.4 in the S1 File). Estimating the slope in a logarithmic curve is robust with respect to multiplicative changes (because taking the logarithm of a product shifts the curve up- or downwards but does not change the slope). Thus, Reff and doubling time are less prone to effects of imperfect case detection.
With a view to these limitations, for future information of the general public, policy makers and national legislators, we recommend to report and place an emphasis on the relative measures effective reproduction number and doubling time in combination with other reported measures such as e.g., hospitalization or intensive care admittance rates. Although we recommend the use of the effective reproduction number and the doubling time, we acknowledge that in the current implementation these measures are estimable no earlier than day 10 after the outbreak, which we think is a minor drawback compared to the possibly high amount of bias by incomplete case-detection in the remaining measures.
The reason for using the infection-age-structured SIR model in our simulation is that this model comprises the SEIR and SIR models as special cases.
Our simulation has a time horizon of 60 days, which is shorter than any real wave of the SARS-CoV-2 pandemic and possibly shorter than a real lockdown. The rationale of this choice was to demonstrate a realistic dynamic range of a wave of the pandemic. Of course, the wave pattern with a phase of a large number of transmissions followed by a phase of lower number of transmissions arises more than once. However, the conclusions of these simulations requires only a realistic range of transmissions, high and low disease activity, no matter what the reasons for these phases with different disease dynamics are (seasonal variations, mask mandates, stay-at-home orders etc). Modeling of different waves, different variants of concern, inclusion of vaccination strategies, possible re-infections, migration, and different fatality rates in different age groups are beyond the scope of this work. We note that in later waves of the pandemic the percentage of asymptomatic cases are likely to be higher than in earlier waves. In terms of the case detection, this means that later waves have a lower CDR than earlier ones, which strikes the importance of reporting the effective reproduction number in successive COVID-19 waves.
Since we are mainly interested in epidemiological measures that assess disease dynamics, we have chosen a relatively simple model without including known risk factors for severe COVID-related complications, such as e.g., age, prevalent co-morbidities or low socio-economic position. Hence, our work should not be interpreted for evaluation of specific interventions or in any prognostic setting. The main focus of the paper are gradual changes in CDRs due to changes in awareness, changes in test policies, availability of test kits etc. Usually, these changes take effect over several days, especially in larger populations. Fluctuations on a daily basis are beyond the scope of our analyses and in extreme cases may lead to very unreliable estimates in any of the four considered epidemiological measures.
Furthermore, we do not discuss practical problems arising with reporting incident cases such as reporting delays, missing data about symptom onset [15] or that lower rates of case-detection are more likely in asymptomatic cases of COVID-19 than in severe cases. However, all four compared epidemiological measures are affected by these problems and our comparison is fair in that respect.
Conclusions
With a view to future reporting about this or other epidemics, we recommend to use of the effective reproduction number for informing the general public and policy makers. In informing the general public about SARS-CoV-2, we frequently find comparisons between countries. For example, the Corona dashboard of the John Hopkins University simultaneously presents the cumulative case-count for many different counties on a world map [2]. In case that national test strategies differ from one country to another, the possibly biased cumulative case-count or incidence rates leads to erroneous conclusions in cross-country comparisons.
Supporting information
S1 File. Mathematical background.
Formulas and description of the infection-age-structured model and the methods of calculating the four epidemiological figures.
https://doi.org/10.1371/journal.pone.0276311.s001
S2 File. Additional sensitivity analyses.
Effects of changes in the temporal pattern of the pandemic and population size.
https://doi.org/10.1371/journal.pone.0276311.s002
(DOC)
Citation: Brinks R, Küchenhoff H, Timm J, Kurth T, Hoyer A (2022) Epidemiological measures for assessing the dynamics of the SARS-CoV-2-outbreak: Simulation study about bias by incomplete case-detection. PLoS ONE 17(10): e0276311. https://doi.org/10.1371/journal.pone.0276311
About the Authors:
Ralph Brinks
Roles: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing
E-mail: [email protected]
Affiliation: Chair for Medical Biometry and Epidemiology, Faculty of Health/School of Medicine, Witten/Herdecke University, Witten, Germany
https://orcid.org/0000-0003-0961-6592
Helmut Küchenhoff
Roles: Methodology, Writing – review & editing
Affiliation: Department of Statistics, Ludwig-Maximilians-University Munich, Munich, Germany
Jörg Timm
Roles: Methodology, Writing – review & editing
Affiliation: Medical Faculty, Institute of Virology, University Hospital Düsseldorf, Düsseldorf, Germany
Tobias Kurth
Roles: Methodology, Writing – review & editing
Affiliation: Institute of Public Health, Charité—Universitätsmedizin Berlin, Berlin, Germany
Annika Hoyer
Roles: Conceptualization, Methodology, Writing – review & editing
Affiliation: Biostatistics and Medical Biometry, Medical School OWL, Bielefeld University, Bielefeld, Germany
1. German law for prevention and mitigation of infectious diseases §28a: Protective measures against the spread of the corona-virus disease-2019 (COVID-19) [Gesetz zur Verhütung und Bekämpfung von Infektionskrankheiten beim Menschen § 28a Besondere Schutzmaßnahmen zur Verhinderung der Verbreitung der Coronavirus-Krankheit-2019 (COVID-19)]
2. Johns Hopkins University of Medicine, Coronavirus Resource Center (CRC) by the Center for Systems and Engineering (CSSE), https://coronavirus.jhu.edu/map.html, accessed on May 11th, 2021
3. World Health Organization, Coronavirus disease (COVID-19) situation reports, https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports, accessed May 11th, 2021
4. Merkel A. Transcript of press conference about the corona crisis. 2020 Apr 1 [Cited 2021 May 11]. Available from: https://www.bundesregierung.de/breg-de/suche/telefon-pressekonferenz-von-bundeskanzlerin-merkel-ministerpraesident-soeder-und-dem-ersten-buergermeister-tschentscher-nach-der-schaltkonferenz-mit-den-regierungschefinnen-und-regierungschefs-der-laender-1738578
5. Watson J, Whiting PF, Brush JE. Interpreting a covid-19 test result BMJ 2020; 369:m1808, pmid:32398230
6. Lau H, Khosrawipour T, Kocbach P, Ichii H, Bania J, Khosrawipour V. Evaluating the massive underreporting and undertesting of COVID-19 cases in multiple global epicenters. Pulmonology 2021;27(2):110–115. pmid:32540223
7. Borgdorff MW. New measurable indicator for tuberculosis case detection. Emer Infect Dis 2004;10(9): 1523, pmid:15498151
8. Inaba H, Age-structured Population Dynamics in Demography and Epidemiology, Singapore: Springer Nature, 2017
9. Vynnycky E, White R. An Introduction to Infectious Disease Modelling. Oxford: Oxford University Press, 2010
10. Brinks R. Source code to Brinks et al (2021): Epidemiological measures during the SARS-CoV-2-outbreak, Zenodo Repository 2021, https://doi.org/10.5281/zenodo.4750942
11. Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am J Epi 2004; 178(9): 1505–12
12. Fraser C. Estimating Individual and Household Reproduction Numbers in an Emerging Epidemic. PLoS ONE 2007;2(8): e758, pmid:17712406
13. Mitra A, Pakhare AP, Roy A, Joshi A. Impact of COVID-19 epidemic curtailment strategies in selected Indian states: An analysis by reproduction number and doubling time with incidence modelling. PLOS ONE 2020; 15(9): e0239026, pmid:32936811
14. Muniz-Rodriguez K, Chowell G, Cheung C-H, Jia D, Lai P-Y, Lee Y, et al. Doubling time of the COVID-19 epidemic by province, China. Emerg Infect Dis. 2020, pmid:32330410
15. Günther F, Bender A, Katz K, Küchenhoff H, Höhle M. Nowcasting the COVID-19 pandemic in Bavaria. Biom J 2021;63(3):490–502. pmid:33258177
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 Brinks et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
During the SARS-CoV-2 outbreak, several epidemiological measures, such as cumulative case-counts (CCC), incidence rates, effective reproduction numbers (Reff) and doubling times, have been used to inform the general public and to justify interventions such as lockdown. It has been very likely that not all infectious people have been identified during the course of the epidemic, which lead to incomplete case-detection. We compare CCC, incidence rates, Reff and doubling times in the presence of incomplete case-detection. For this, an infection-age-structured SIR model is used to simulate a SARS-CoV-2 outbreak followed by a lockdown in a hypothetical population. Different scenarios about temporal variations in case-detection are applied to the four measures during outbreak and lockdown. The biases resulting from incomplete case-detection on the four measures are compared in terms of relative errors. CCC is most prone to bias by incomplete case-detection in all of our settings. Reff is the least biased measure. The possibly biased CCC may lead to erroneous conclusions in cross-country comparisons. With a view to future reporting about this or other epidemics, we recommend including and placing an emphasis on Reff in those epidemiological measures used for informing the general public and policy makers.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer