CS and MB are joint first authors.
Strengths and limitations of this study
This study offers the first evaluation of COVID-19 test performance with consideration of the heterogeneity of RT-PCR primers.
We compare the performance of manufacturer-based, laboratory/approval data to the performance of the same test kits in a real-world setting in the early phase of the pandemic.
We perform a qualitative analysis of RT-PCR assays using 85 studies (21 530 patients), and a quantitative meta-analysis of RT-PCR versus serum antibody assays in a subset of 30 publications (10 355 patients).
Much of the information in the early pandemic was reported from China, and often from non-peer reviewed, preprint sources.
Data measuring duration of the infection was not available in majority of included studies.
Introduction
In December 2019, there was a cluster of unexplained pneumonia cases in Wuhan, China, and a novel coronavirus was identified as the causative agent.1 The virus was named as SARS-CoV-2, and the disease as COVID-19.2 The clinical spectrum ranges from asymptomatic forms to acute respiratory failure and multiorgan dysfunction syndrome, coagulopathy and death.3 4 On 11 March 2020, the WHO described the spread of these infections as a pandemic, which persists as a global crisis. Robust diagnostic tests are required to mitigate the spread of this virus and thereby to minimise the impact of COVID-19 on the health, economy and social well-being of mankind.
The standard diagnosis of COVID-19 is based on clinical and radiologic evidence and viral genome detection by RT-PCR in respiratory samples.5 Gene-specific primers are used in the RT-PCR assays; structural genes include envelope (E), nucleocapsid (N) and/or spike (S)-genes; non-structural genes include RNA-dependent RNA polymerase (RdRp) or open reading frame1ab (ORF1ab)6 7 Some studies used only a single-gene-specific primer, and others used multiple-gene primers. Since studies published in the early phase of the pandemic reported a 3%–41% range of false negativity by RT-PCR, a repeat RT-PCR testing was suggested.8 9 Furthermore, false negativity was attributed to either mutations in the regions to which the primers bind or to sampling and laboratory practices, including collection, transportation and handling.10 Timing of sample collection with respect to the course of infection and the sample type also influence test results.11 Alternatively, the diagnosis can be made by detection of antigens (E, N or S) and/or antibodies (IgM or IgG or both) in blood samples.12 However, these tests have the potential for false positives owing to cross-reactivity with other human coronaviruses.13 14 Due to the unprecedented public health emergency, the U.S. Food and Drug Administration (FDA) authorised, on 1 June 2020, EUA requests for more than 15 diagnostic and serologic tests. Though serology testing can detect the false positives of RT-PCR tests in clinically suspected patients, its value in COVID-19 diagnosis as a complementary assay in the mitigation of the pandemic is not well defined. However, given the complexities in COVID-19 testing, there is a need for a review of performance for tests commonly used.
In this systematic review and meta-analysis, we examine testing for the diagnosis of COVID-19 in the early pandemic and evaluate the sensitivity and specificity of serological tests relative to RT-PCR tests. Our objectives were to assess the uniformity of primer usage in RT-PCR assays and evaluate whether primers used in gold-standard RT-PCR tests affect the validity of serological tests. Furthermore, we compared the performance of serological tests/platforms in approval contrived/laboratory versus real-world data.
Methods
Literature search
This research was accomplished according to standards outlined in the Preferred Reporting Items for Systematic Reviews and Meta-Analyses statement.15 To find relevant studies, international databases, including PubMed, Embase, MedRiXv, BioRiXv and Google Scholar, were searched for articles published until 31 May 2020. The following search terms were used (selected using English MeSH keywords and Emtree terms): [SARS-CoV-2 AND diagnosis] OR [2019-nCoV AND diagnosis]” OR [“COVID-19 AND diagnosis] and [SARS-CoV-2 AND RT-PCR] OR, [2019-nCoV AND RT-PCR]” OR [“COVID-19 AND RT-PCR] and [SARS-CoV-2 AND serology] OR [2019-nCoV AND serology]” OR ["COVID-19 AND serology]. Additional searches were performed for references listed in the included studies.
Eligibility criteria
Relevant articles that reported diagnostic information for infected patients were included in the analysis. Preprint articles with non-peer review were considered for inclusion. Articles were excluded if appropriate information was not reported or if they were in the Chinese language. Population sample sizes of <5 participants were not included; reviews and editorials were not considered. For meta-analysis and approval versus real-world performance, studies that reported percent sensitivity/specificity without including patient numbers were also excluded.
Data extraction and report quality evaluation
Two authors (CS and VL) screened and evaluated the literature independently. Discrepancies were resolved by consensus after evaluation by a third author (MB). The following were extracted for review and meta-analysis: journal name, authors, period of publication (end of May 2020), location of study, total number of patients, tissue of origin for samples tested, whether samples were from upper or lower respiratory tract (or both), primers for RT-PCR, platforms for serology tests and antibodies tested for serology. Counts of true positives, false negatives, true negatives and false positives were used in the meta-analysis.
An author (MB) extracted and analysed the approved testing kit performance data from the following sources: FDA EUA Authorized Serology Test Performance,16 the Foundation for Innovative New Diagnostics SARS-CoV-2 diagnostic pipeline17 and package inserts provided on company websites for each product. Real-world sample testing data from kits in meta-analyses were compared against the performance of the same kits, or platforms, reported in approval documentation. Variables abstracted were study authors/test developer, name of test, test platform, and true positives, false negatives, true negatives, and false positives for each antibody or antibody combination measured (IgM, IgG, IgA, combined and Pan-Ig). Risk of bias within individual studies of meta-analysis was assessed using the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) tool for assessment of diagnostic studies.18 QUADAS-2 has been developed specifically for evaluating bias in the meta-analyses of diagnostic test accuracy.
Patient and public involvement
Since we performed a meta-analysis and systematic review, it was not appropriate or possible to involve patients or the public in the design, conduct, reporting or dissemination plans of our research.
Statistical analysis
Statistical analyses were performed with R V.6.3.2 (12 December 2019).19 The package ‘meta’ was used for meta-analyses.20 Random effects models were used to measure sensitivity and specificity of outcomes across studies. Subgroup analysis was performed to evaluate the effect of assay, RT-PCR primer type and setting (laboratory vs real world) on serum test performance. Heterogeneity across studies and subgroups was evaluated using Cochrane’s Q statistic, and residual heterogeneity was quantified as a percentage with the I2 statistic. An I2 measure of 0% shows no observed heterogeneity, with increasing values from 0%–100% indicating higher levels of heterogeneity.21 An assumption of homogeneity was rejected for p values<0.1. The evaluation of publication bias was not possible using FDA and EU reported approval data.
Results
Search results and population characteristics
Our search generated 112 publications with potential relevance to the performance of COVID-19 diagnostic tests. After excluding duplicate publications, manuscripts that did not report numbers of patients used for sensitivity/specificity calculations and studies with a sample size of <5 patients, 85 studies were selected for qualitative synthesis of RT-PCR primer usage. From this set, a subset of 30 publications were selected for the quantitative meta-analysis of serologic versus RT-PCR diagnostic testing for COVID-19 (online supplemental table S1). Ancillary analysis compared the performance of these 30 real-world studies to that reported in laboratory approval data from 47 diagnostic serum-based tests. In all, our qualitative synthesis of RT-PCR studies included 85 studies and 21 530 patients. From this synthesis, a group of 30 studies with 10 355 patients from 5 regions of the world were selected for meta-analysis and comparison to performance from laboratory approval data (online supplemental figure S1).
Uniformity of primer usage in RT-PCR diagnostic tests
We reviewed use of single primer of structural genes as compared with use of both structural and non-structural gene primers in 56 population-based studies with 9872 participants. Overall, high proportions of studies employed both structural and non-structural gene primers in RT-PCR testing (55% (31 in 56) in studies and 56% (5484 in 9872) of total participants). Additionally, 29 studies (11 658 patients) did not report RT-PCR primer data. Single markers were most frequently tested in China and North American studies (table 1). In general, the most tested samples were from the upper respiratory tract, regardless of primer status. Sample source and location in the respiratory tract were not reported for 8%–20% of patients, and this was more common for studies using single-gene primer.
Table 1Characteristics of the studies included in qualitative analysis
Total studies | Total pop. | Structural gene primers | Structural and non-structural gene primers | Non-structural gene primers | Not reported | |||||
N studies | N pop. | N studies | N pop. | N studies | N pop. | N studies | N pop. | |||
Total | 85 | 21 530 | 22 | 4265 | 31 | 5484 | 3 | 123 | 29 | 11 658 |
Location | ||||||||||
Asia (excluding China) | 6 | 378 | 2 | 53 | 4 | 325 | ||||
China | 28 | 12 187 | 8 | 1802 | 17 | 3047 | 3 | 123 | 24 | 7215 |
Europe | 12 | 5757 | 4 | 528 | 8 | 993 | 4 | 4236 | ||
North America | 10 | 3001 | 8 | 1882 | 2 | 1119 | ||||
Global | 207 | 1 | 207 | |||||||
Primers | ||||||||||
N—single | 11 | 2016 | 11 | 2016 | ||||||
E—single | 4 | 759 | 4 | 759 | ||||||
S—single | 1 | 412 | 1 | 412 | ||||||
N, E | 2 | 226 | 2 | 226 | ||||||
S, N | 4 | 852 | 4 | 852 | ||||||
ORF1Ab, single | 2 | 59 | 2 | 59 | ||||||
RdRp, single | 1 | 64 | 1 | 64 | ||||||
E+ORF1Ab | 2 | 1119 | 2 | 1119 | ||||||
E+RdRp | 2 | 259 | 2 | 259 | ||||||
M, E | 1 | 48 | 1 | 48 | ||||||
N+ORF1Ab | 14 | 2703 | 14 | 2703 | ||||||
N+E+RdRp | 4 | 333 | 4 | 333 | ||||||
S, N, E, RdRp, ORF1ab | 1 | 13 | 1 | 13 | ||||||
N, E, ORF1ab | 1 | 33 | 1 | 33 | ||||||
N, RNAse P | 1 | 190 | 1 | 190 | ||||||
S, N, RdRp, ORF1ab, E, M | 1 | 52 | 1 | 52 | ||||||
N, S, RdRp | 1 | 273 | 1 | 273 | ||||||
N, E, S, RdRp | 2 | 349 | 2 | 349 | ||||||
S, ORF1Ab | 1 | 112 | 1 | 112 | ||||||
Sample source | ||||||||||
Upper respiratory | 23 | 6748 | 3 | 575 | 9 | 2633 | 1 | 64 | 10 | 3476 |
Upper and lower respiratory | 1 | 52 | 1 | 52 | ||||||
Upper respiratory+other* | 9 | 751 | 3 | 368 | 2 | 44 | 1 | 38 | 3 | 301 |
Lower respiratory+other* | 1 | 273 | 1 | 273 | ||||||
Upper respiratory+serum | 20 | 6407 | 7 | 1473 | 9 | 1432 | 4 | 3502 | ||
Upper respiratory+serum+other* | 4 | 941 | 2 | 840 | 1 | 80 | 1 | 21 | ||
Upper and lower respiratory+other* | 4 | 678 | 1 | 280 | 3 | 398 | ||||
Upper and lower respiratory+serum+other* | 2 | 518 | 1 | 132 | 1 | 386 | ||||
Serum | 18 | 2376 | 6 | 729 | 4 | 440 | 8 | 1207 | ||
Other* | 1 | 199 | 1 | 199 | ||||||
Not reported | 2 | 2587 | 2 | 2587 |
*Other=bronchioalveolar lavage, feces, urine, neonatal, amniotic fluid and breast milk.
E, envelope; N, nucleocapsid; N pop, patient population; ORF1ab, open reading frame1ab; RdRp, RNA-dependent RNA polymerase; S, spike.
Meta-analysis: RT-PCR versus serum antibody testing
In general, patient sera were tested for IgM and IgG antibodies. China was the region with the highest frequency of antibody testing, and lateral flow immunoassay (LFIA) and chemiluminescent immunoassay (CLIA) testing platforms were most often used. Of the 45 studies included in the qualitative synthesis, 30 manuscripts reported both serum antibody testing and RT-PCR testing for the same patients. Key characteristics of this population include: China as the regional location for research; lack of reporting of RT-PCR primer information for ~33% (10/30) of all studies; most studies used IgM and IgG serum-based antibody tests; and LFIA, CLIA and ELISA platforms were common across studies (table 2).
Table 2Characteristics of studies included in quantitative meta-analysis
N studies | N pop. | |
Total | 30 | 10 355 |
Location | ||
Asia (excluding China) | 2 | 261 |
China | 19 | 6375 |
Europe | 7 | 2900 |
North America | 2 | 819 |
PCR primers | ||
Structural | ||
N, single | 5 | 1084 |
E, single | 1 | 49 |
N, E | 1 | 201 |
N, S | 2 | 408 |
Structural and non-structural | ||
with ORF1Ab | 8 | 1115 |
with RdRp | 2 | 186 |
N, RNAse P | 1 | 190 |
not reported | 10 | 7122 |
Ab tested | ||
IgG | 2 | 220 |
IgM+IgG | 25 | 7828 |
IgA+IgG+IgM | 1 | 208 |
IgA+IgG | 1 | 37 |
not reported | 1 | 2062 |
Serum Ab | ||
CLIA | 8 | 3705 |
ELISA | 8 | 1908 |
LFIA | 10 | 3800 |
CLIA+ELISA | 2 | 548 |
LFIA+ELISA | 1 | 80 |
Not reported | 1 | 314 |
CLIA, chemiluminescent immunoassay; E, envelope; LFIA, lateral flow immunoassay; N, nucleocapsid; ORF1ab, open reading frame1ab; RdRp, RNA-dependent RNA polymerase; S, spike.
We used the IgM±IgG serum antibody test since it was most commonly used across studies. Of 1957 participants (pooled sensitivity 0.81, 95% CI 0.66 to 0.90) with a positive RT-PCR COVID-19 result, 1585 were also detected as positive with serum antibody tests. Of 3581 true negatives in RT-PCR, 3509 negatives were also found by serum antibody testing (pooled specificity 0.98, 95% CI 0.94 to 0.99). For both models, heterogeneity between studies was significant (p<0.01 for both, I2=97% and I2=98% for sensitivity and specificity, respectively).
Subanalyses of differences based on the testing platform found that sensitivity between groups differed (p<0.0001), with CLIA tests performing best (0.99, 95% CI 0.97 to 0.99); ELISA as next best (0.89, 95% CI 0.82 to 0.93); and LFIA as having the poorest sensitivity (0.67, 95% CI 0.50 to 0.81). LFIA test sensitivity also showed heterogeneity between studies (p<0.01, I2 95%). For IgM/IgG tests, specificity did not differ significantly by platform (p=0.06). However, a performance trend followed sensitivity, with LFIA underperforming (figure 1 and online supplemental figure S2).
Figure 1. Comparison of performance (sensitivity and specificity) of serology tests (IgM/IgG) based on assay platforms. CLIA, chemiluminescent immunoassay; LFIA, lateral flow immunoassay.
Serum antibody testing performance: approval data versus real-world data
For manufacturer-based, laboratory approval data, IgM±IgG testing detected COVID-19 positivity for 1045 of 1068 RT-PCR-determined ‘true’ positive patients (sensitivity 0.98, 95% CI 0.92 to 1.0). In the same group, serum testing correctly identified 1928 of 1967 (specificity 0.98, 95% CI 0.95 to 0.099) true negatives by RT-PCR. For both models (sensitivity and specificity), there was evidence of heterogeneity (p<0.01 for both and I2=93% and I2=94% for sensitivity and specificity, respectively).
We evaluated IgM±IgG serum test performance in subgroup analyses comparing laboratory approval performance data to real-world performance in study data. In manufacturer data presented for approval, serum antibody testing detected 1047 of 1068 ‘true positive’ cases of COVID-19 (sensitivity 0.98, 95% CI 0.92 to 1.0). Real-world use of serum IgM±IgG testing was evident for 2450 of 3025 participants diagnosed with COVID-19 by RT-PCR (sensitivity 0.81, 95% CI 0.66 to 0.90). For both groups, there was heterogeneity between studies (p<0.01 for both, I2=93% and I2=97% for approval and real-world specificity, respectively) (figure 2 and online supplemental figure S2). In addition, the overall sensitivity between approval and real-world testing groups differed significantly (Q=8.37, p=0.004). An analysis of specificity by the same subgroups found no significant difference between laboratory approval and real-world data. Laboratory data identified 1928 of 1967 participants with true COVID-19 negative status (specificity 0.98, 95% CI 0.95 to 0.99). Real-world data found 5437 of 5548 true negatives (specificity 0.98, 95% CI, 0.96 to 0.99) (analysis not shown).
Figure 2. Comparison of sensitivity of laboratory setting versus real-world setting of RT-PCR and serology (IgM/IgG) kits.
Since, in IgM±IgG tests, there were differences in sensitivity between platforms, we evaluated the effect of approval-based data versus real-world data by the type of platform. In an analysis stratified for ELISA, CLIA and LFIA, there was no significant difference in specificity between approval and real-world data (data not shown). However, for ELISA tests, real-world capacity to detect true positives was lower than in laboratory-based analyses. In real-world studies, the sensitivity of ELISA was 0.89 (95% CI, 0.82 to 0.93), different from laboratory sensitivity for the same platform (0.94, 95% CI 0.91 to 0.96, Q=4.74, p=0.03). The LFIA platform also showed a trend of lower real-world sensitivity (0.67, 95% CI, 0.50 to 0.81) compared with laboratory approval sensitivity (0.99, 95% CI 0.90 to 0.99, Q=8.56, p 0.003). Laboratory/real-world groups for CLIA platforms were too small to be tested reliably (1 and 2 groups, respectively).
Serum antibody testing performance: effect of primer choice on test validity
To evaluate the reliability of RT-PCR as a gold standard for serum-based test performance, we tested the consequences of using structural and non-structural primers in RT-PCR reference testing of serum. Analyses were divided into three subgroups based on antibody targets: IgM, IgG and IgG±IgM combined. In IgM and combined IgG±IgM testing, the primer choice had no significant influence on sensitivity or specificity. However, for IgG antibody tests, use of both a structural and a non-structural gene-specific primers in RT-PCR resulted in reduced sensitivity for serum testing (figure 3 and online supplemental figure S2), Q=6.17, p=0.013). Furthermore, although not statistically significant, the sensitivity of both IgM and IgG±IgM combined data sets was lower when using a referent RT-PCR test with both primer types.
Figure 3. The effect of primer choice in RT-PCR referent on sensitivity of tests based on serum IgG.
Discussion
Because of the highly infectious nature of COVID-19, a prompt, accurate and early diagnosis is necessary to deal with the ongoing pandemic, for such diagnoses can help reduce the spread of infection and its associated risk for mortality. Currently, the COVID-19 diagnosis is generally based on RT-PCR assays.8 Alternative methods such as antigen-based and antibody-based serology tests, although available, have uncertain value. The current systematic review and meta-analysis addresses the challenges encountered in the diagnosis of COVID-19 by various methods. It also analyses differences between the FDA-approved EUA data and real-world data. There is worldwide non-uniformity in the performance of RT-PCR, including the number and types of primers and reagents used for COVID-19 diagnosis, which raise questions about its generalised applicability. Similarly, the studies based on serological tests showed diagnostic inaccuracies owing to individual differences in mounting an immune response as well as dependency on the time duration after the onset of symptoms. Overall, the sensitivity between RT-PCR and serology tests was 0.81 (95% CI, 0.66 to 0.90), and specificity was 0.98 (95% CI, 0.94 to 0.99). Among the various platforms for serodiagnosis, the highest sensitivity was exhibited by ELISA, followed by CLIA and LFIA. Furthermore, use of primers (structural, non-structural or both) had a variable effect on sensitivity based on antibody targets. Sensitivity was significantly higher for IgG serology tests using structural primer-only RT-PCR tests as a referent. Serology tests had higher sensitivity for approval-based data than for real-world reporting. This difference was significant for ELISA-based platforms, and a non-significant trend towards inflated approval-based sensitivity was evident for both CLIA and LFIA platforms. These observations highlight the inconsistencies/challenges in the COVID-19 diagnosis by RT-PCR, which is the current gold standard, as well as in serologic testing.
For RT-PCR assays, the targets in SARS-CoV-2 include structural genes such as E, N and S, and nonstructural genes, including that for RdRp or ORF1ab.22 In the early phase of the pandemic, some studies used a two-step diagnosis that included an initial screening phase using structural genes followed by a confirmatory phase using nonstructural genes.6 7 23 The test is considered positive when both structural and non-structural markers are positive.24 25 However, currently both types of primers are used simultaneously to diagnose COVID-19. The viral load or copy number of the viral genome is expressed as a Ct value, which when <37 is indicative of a positive test, and a value of ≥40 is considered negative. A Ct value between >37 and<40 requires repetition of RT-PCR analysis to confirm the diagnosis.24 However, the Ct value range varies widely according to assays and laboratory practices. A COVID-19-RdRp/Hel assay has a higher sensitivity than a conventional RdRp-P2 assay irrespective of the type of sample.26 Overall, higher proportions of studies (58%) employed both structural and non-structural gene primers in RT-PCR testing. Single markers were used in some Chinese and North American studies. These findings are indicative of non-uniformity in the RT-PCR methodology. We note that half of the positive, symptomatic patients became negative by the second week, when they became asymptomatic. In contrast, the asymptomatic, positive patients became negative 2 days after hospital admission, indicating the importance of a temporal factor in COVID-19 diagnosis by RT-PCR.27 28
Published in the early phase of the pandemic, 11 of 85 studies had clinically suspected COVID-19 patients. In these studies, the average test positivity by RT-PCR, regardless of the sample source, was 44% (online supplemental table S1), and test sensitivity was influenced by sample source (upper vs lower respiratory vs other samples), issues related to testing performance, and delay after onset of symptoms.29 In the early phase of the COVID-19 pandemic, for studies evaluating suspected COVID-19 cases, the total positive RT-PCR for throat swabs was in the range of 30%–60% at initial presentation.8 30 One study reported a yield of 72%–93% positive cases for lower respiratory samples (bronchioalveolar lavage and sputum) as compared with 32%–63% positivity for upper respiratory samples (oral and nasopharyngeal swabs) and 29% for stool samples.29 Hence, a negative COVID-19 test based only on an upper respiratory sample at a single timepoint is questionable. For most studies, the testing sample was from the upper respiratory tract, regardless of primer type used. However, the sample source was not reported for 8%–20% of patients, which was more common for studies using only structural gene primers. For stool samples testing positive for COVID-19, 66.7% also tested positive on pharyngeal swabs. Of the stool samples, 64.3% remained positive after pharyngeal clearance of the virus.31 In contrast, none of the patients showed a positive test on upper respiratory samples after the anal swabs tested negative.31 These findings raise concerns about whether patients with negative respiratory swabs are truly virus-free, and sampling of additional body sites is needed. As determined by various studies, the performance of the RT-PCR depends on usage of comparable protocols, including primers and reagents.32 Additionally, it is unknown whether the currently used RT-PCR primers detect all SARS-Cov-2 strains.
The specific immune response to SARS-CoV-2 can be measured by serological testing. Several rapid serological tests, including point-of-care tests, are being developed. Even though some of these tests have been approved by the FDA through EUA, their accuracy needs to be validated.33 A minimum of 1–2 weeks after the onset of infection is needed for seroconversion. Hence, antibody testing is of no value in the early phase of infection. Additionally, its value is limited by its cross-reactivity with other coronaviruses.34 35 The initial RT-PCR positivity during the early stages (<15 days) of SARS-CoV-2 infection declines to 66.7% in the later phase (15–39 days), during this period, the antibody test can supplement RT-PCR in the diagnosis of COVID-19.34 35 Additionally, serology testing becomes valuable for clinically suspected and RT-PCR negative (false-negative) individuals.
This research has limitations. Due to the dynamic reporting of COVID-19 testing research and inconsistencies in reporting of predictive variables across studies, bias in sampling may have some effect on our results. Patient flow analysis suggests that lack of consistent RT-PCR reference standard given to patients in the same study, as well as the unclear reporting of patient selection methods could contribute to bias in these results (online supplemental figure S3). In addition, the observed heterogeneity between studies in the meta-analysis suggests that we must consider the possibility that the differences in results may be due to chance. Lastly, it is questionable to compare two separate testing methods of RT-PCR and seroprevalence in sensitivity/specificity analysis. In particular, given the relationship between time since diagnosis and accuracy of serology testing, a contributor to the observed differences in performance is time. Furthermore, because of each diagnostic kit having differing cut points for positive/negative, threshold effect as a source of heterogeneity in sensitivity and specificity cannot be ruled out.
The effective containment of COVID-19 involves accurate diagnoses and isolation of SARS-CoV-2-infected persons. Robustness of the assays/platforms is determined by variability of the samples, primers and reagents used. Serological tests alone are of value only during the latter times of infection; however, they complement RT-PCR when used in conjunction and minimise false-negative RT-PCR results. Additionally, some of the approved serological assays/platforms, particularly those developed using contrived/laboratory data, perform poorly when applied to real-world samples. We are currently in a new phase of the pandemic, and there is a need for a reliable/robust diagnostic test to mitigate the spread of COVID-19.
Our analyses of studies published in the early phase of the pandemic have highlighted issues related to COVID-19 diagnosis that need to be addressed as follows: (1) The high mutational rate exhibited by the SARS-CoV-2 virus may lead to the generation of new strains. Therefore, like for influenza virus, the existing diagnostic kits need to be modified constantly to optimise the detection of new strains; (2) Though RT-PCR diagnosis of COVID-19 is the gold standard, its combination with a serologic test may increase the accuracy of SARS-CoV-2 detection; (3) Approval agencies must account for the type of data (contrived vs real world) presented by diagnostic kit developer; (4) Although agencies employed EUA processes for the approval of diagnostic kits, there is a need to monitor their performance and assess their robustness in real-world samples, to permit continued use of these kits; and (5) Standardised assay protocols need to be developed and continually updated to mitigate the COVID-19 pandemic.
We thank Dr. Donald Hill for his editorial assistance.
Data availability statement
Data are available upon reasonable request. All data used in this research was generated from publicly available published manuscripts.
Ethics statements
Patient consent for publication
Not applicable.
Ethics approval
Not applicable.
CS and MB contributed equally.
Contributors CS, MB and UM made substantial contributions to the conception and design of the work. CS, MB and VL contributed to the acquisition of the study data. Data were analysed by CS, MB and UM. MB was responsible for the statistical analysis. CS, MB, SML, SV, GJN and UM contributed to the interpretation of the data, and to drafting and revising of the manuscript. CS, MB and VL have accessed and verified the data. UM serves as the guarantor of overall content. All authors agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work. All authors have read and approved the manuscript.
Funding This study was supported, in part, by grant 5U54CA118948 and institutional/departmental impact funds (UM), and by CA047888, the UAB Cancer Prevention and Control Training Program (MB).
Competing interests None declared.
Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.
1 Chan JF-W, Yuan S, Kok K-H, et al. A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster. Lancet 2020; 395: 514–23. doi:10.1016/S0140-6736(20)30154-9 http://www.ncbi.nlm.nih.gov/pubmed/31986261
2 Naming the coronavirus disease (COVID-19) and the virus that causes it, 2020. Available: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease-(covid-2019)-and-the-virus-that-causes-it. [Accessed 10 Jun 2020 ].
3 Huang C, Wang Y, Li X, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 2020; 395: 497–506. doi:10.1016/S0140-6736(20)30183-5 http://www.ncbi.nlm.nih.gov/pubmed/31986264
4 Rothan HA, Byrareddy SN. The epidemiology and pathogenesis of coronavirus disease (COVID-19) outbreak. J Autoimmun 2020; 109: 102433. doi:10.1016/j.jaut.2020.102433 http://www.ncbi.nlm.nih.gov/pubmed/32113704
5 Cheng MP, Papenburg J, Desjardins M, et al. Diagnostic testing for severe acute respiratory syndrome-related coronavirus 2: a narrative review. Ann Intern Med 2020; 172: 726–34. doi:10.7326/M20-1301 http://www.ncbi.nlm.nih.gov/pubmed/32282894
6 Chu DKW, Pan Y, Cheng SMS, et al. Molecular diagnosis of a novel coronavirus (2019-nCoV) causing an outbreak of pneumonia. Clin Chem 2020; 66: 549–55. doi:10.1093/clinchem/hvaa029 http://www.ncbi.nlm.nih.gov/pubmed/32031583
7 Corman VM, Landt O, Kaiser M, et al. Detection of 2019 novel coronavirus (2019-nCoV) by real-time RT-PCR. Euro Surveill 2020; 25. doi:10.2807/1560-7917.ES.2020.25.3.2000045
8 Ai T, Yang Z, Hou H, et al. Correlation of chest CT and RT-PCR testing for coronavirus disease 2019 (COVID-19) in China: a report of 1014 cases. Radiology 2020; 296: E32–40. doi:10.1148/radiol.2020200642 http://www.ncbi.nlm.nih.gov/pubmed/32101510
9 Xie X, Zhong Z, Zhao W, et al. Chest CT for typical coronavirus disease 2019 (COVID-19) pneumonia: relationship to negative RT-PCR testing. Radiology 2020; 296: E41–5. doi:10.1148/radiol.2020200343 http://www.ncbi.nlm.nih.gov/pubmed/32049601
10 Tahamtan A, Ardebili A. Real-Time RT-PCR in COVID-19 detection: issues affecting the results. Expert Rev Mol Diagn 2020; 20: 453–4. doi:10.1080/14737159.2020.1757437 http://www.ncbi.nlm.nih.gov/pubmed/32297805
11 Kim JY, Ko JH, Kim Y, et al. Viral load kinetics of SARS-CoV-2 infection in first two patients in Korea. J Korean Med Sci 2020; 35: e86. doi:10.3346/jkms.2020.35.e86 http://www.ncbi.nlm.nih.gov/pubmed/32080991
12 Guo L, Ren L, Yang S, et al. Profiling early humoral response to diagnose novel coronavirus disease (COVID-19). Clin Infect Dis 2020; 71: 778–85. doi:10.1093/cid/ciaa310 http://www.ncbi.nlm.nih.gov/pubmed/32198501
13 To KK-W, Tsang OT-Y, Leung W-S, et al. Temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by SARS-CoV-2: an observational cohort study. Lancet Infect Dis 2020; 20: 565–74. doi:10.1016/S1473-3099(20)30196-1 http://www.ncbi.nlm.nih.gov/pubmed/32213337
14 Wang W, Xu Y, Gao R, et al. Detection of SARS-CoV-2 in different types of clinical specimens. JAMA 2020; 2: 1843–4. doi:10.1001/jama.2020.3786
15 Moher D, Liberati A, Tetzlaff J, et al. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Int J Surg 2010; 8: 336–41. doi:10.1016/j.ijsu.2010.02.007 http://www.ncbi.nlm.nih.gov/pubmed/20171303
16 EUA authorized serology test performance, 2020. Available: https://www.fda.gov/medical-devices/coronavirus-disease-2019-covid-19-emergency-use-authorizations-medical-devices/eua-authorized-serology-test-performance [Accessed 18 Jun 2020 ].
17 SARS-CoV-2 diagnostic pipelineSARS-CoV-2 diagnostic pipeline. Foundation for innovative new diagnostics, 2020. Available: https://www.finddx.org/covid-19/pipeline/ [Accessed 5 Jul 2020 ].
18 Whiting PF, Rutjes AWS, Westwood ME, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 2011; 155: 529–36. doi:10.7326/0003-4819-155-8-201110180-00009 http://www.ncbi.nlm.nih.gov/pubmed/22007046
19 R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing, 2019.
20 Balduzzi S, Rücker G, Schwarzer G. How to perform a meta-analysis with R: a practical tutorial. Evid Based Ment Health 2019; 22: 153–60. doi:10.1136/ebmental-2019-300117 http://www.ncbi.nlm.nih.gov/pubmed/31563865
21 Higgins JPT, Thompson SG, Deeks JJ, et al. Measuring inconsistency in meta-analyses. BMJ 2003; 327: 557–60. doi:10.1136/bmj.327.7414.557 http://www.ncbi.nlm.nih.gov/pubmed/12958120
22 Yin C. Genotyping coronavirus SARS-CoV-2: methods and implications. Genomics 2020; 112: 3588–96. doi:10.1016/j.ygeno.2020.04.016 http://www.ncbi.nlm.nih.gov/pubmed/32353474
23 Detection of 2019 novel coronavirus (2019-nCoV) in suspected human cases by RT-PCR who, 2019. Available: https://www.who.int/docs/default-source/coronaviruse/peiris-protocol-16-1-20.pdf?sfvrsn=af1aac73_4
24 Liu R, Han H, Liu F, et al. Positive rate of RT-PCR detection of SARS-CoV-2 infection in 4880 cases from one hospital in Wuhan, China, from Jan to Feb 2020. Clin Chim Acta 2020; 505: 172–5. doi:10.1016/j.cca.2020.03.009 http://www.ncbi.nlm.nih.gov/pubmed/32156607
25 Li X, Geng M, Peng Y, et al. Molecular immune pathogenesis and diagnosis of COVID-19. J Pharm Anal 2020; 10: 102–8. doi:10.1016/j.jpha.2020.03.001 http://www.ncbi.nlm.nih.gov/pubmed/32282863
26 Chan JF-W, Yip CC-Y, To KK-W, et al. Improved Molecular Diagnosis of COVID-19 by the Novel, Highly Sensitive and Specific COVID-19-RdRp/Hel Real-Time Reverse Transcription-PCR Assay Validated In Vitro and with Clinical Specimens. J Clin Microbiol 2020; 58. doi: doi:10.1128/JCM.00310-20. [Epub ahead of print: 23 04 2020 ]. http://www.ncbi.nlm.nih.gov/pubmed/32132196
27 Sethuraman N, Jeremiah SS, Ryo A. Interpreting diagnostic tests for SARS-CoV-2. JAMA 2020; 323: 2249–51. doi:10.1001/jama.2020.8259 http://www.ncbi.nlm.nih.gov/pubmed/32374370
28 Chen J, Qi T, Liu L, et al. Clinical progression of patients with COVID-19 in Shanghai, China. J Infect 2020; 80: e1–6. doi:10.1016/j.jinf.2020.03.004 http://www.ncbi.nlm.nih.gov/pubmed/32171869
29 Wang W, Xu Y, Gao R, et al. Detection of SARS-CoV-2 in different types of clinical specimens. JAMA 2020; 2: 1843–4. doi:10.1001/jama.2020.3786 http://www.ncbi.nlm.nih.gov/pubmed/32159775
30 Yang Y, Yang M, Shen C. Evaluating the accuracy of different respiratory specimens in the laboratory diagnosis and monitoring the viral shedding of 2019-nCoV infections. medRxiv 2020: 02.11.20021493.
31 Chen Y, Chen L, Deng Q, et al. The presence of SARS-CoV-2 RNA in the feces of COVID-19 patients. J Med Virol 2020; 92: 833–40. doi:10.1002/jmv.25825 http://www.ncbi.nlm.nih.gov/pubmed/32243607
32 NHS. Guidance and standard operating procedure: COVID-19 virus testing in NHS laboratories, 2020. Available: https://www.england.nhs.uk/coronavirus/wp-content/uploads/sites/52/2020/03/guidance-and-sop-covid-19-virus-testing-in-nhs-laboratories-v1.pdf [Accessed 2 Jun 2020 ].
33 Maxmen A. The researchers taking a gamble with antibody tests for coronavirus. Nature 2020. doi:10.1038/d41586-020-01163-5
34 Zhao J, Yuan Q, Wang H, et al. Antibody responses to SARS-CoV-2 in patients with novel coronavirus disease 2019. Clin Infect Dis 2020; 71: 2027–34. doi:10.1093/cid/ciaa344 http://www.ncbi.nlm.nih.gov/pubmed/32221519
35 WHO. Laboratory testing for coronavirus disease 2019 (COVID-19) in suspected human cases: interim guidance. WHO, 2020.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. http://creativecommons.org/licenses/by-nc/4.0/ This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ . Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Background
Accurate detection of SARS-CoV-2 is necessary to mitigate the COVID-19 pandemic. However, the test reagents and assay platforms are varied and may not be sufficiently robust to diagnose COVID-19.
Methods
We reviewed 85 studies (21 530 patients), published from five regions of the world, to highlight issues involved in the diagnosis of COVID-19 in the early phase of the pandemic. All relevant articles, published up to 31 May 2020, in PubMed, BioRiXv, MedRiXv and Google Scholar, were included. We evaluated the qualitative (9749 patients) and quantitative (10 355 patients) performance of RT-PCR and serologic diagnostic tests for real-world samples, and assessed the concordance (5538 patients) between test performance in meta-analyses. Synthesis of results was done using random effects modelling and bias was evaluated according to QUADAS-2 guidelines.
Results
The RT-PCR tests exhibited heterogeneity in the primers and reagents used. Of 1957 positive RT-PCR COVID-19 participants, 1585 had positive serum antibody (IgM±IgG) tests (sensitivity 0.81, 95% CI 0.66 to 0.90). While 3509 of 3581 participants RT-PCR negative for COVID-19 were found negative by serology testing (specificity 0.98, 95% CI 0.94 to 0.99). The chemiluminescent immunoassay exhibited the highest sensitivity, followed by ELISA and lateral flow immunoassays. Serology tests had higher sensitivity and specificity for laboratory approval than for real-world reporting data.
Discussion
The robustness of the assays/platforms is influenced by variability in sampling and reagents. Serological testing complements and may minimise false negative RT-PCR results. Lack of standardised assay protocols in the early phase of pandemic might have contributed to the spread of COVID-19.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details



1 Pathology, MNR Medical College and Hospital, Sangareddy, Telangana, India
2 Department of Pathology at UAB, The University of Alabama at Birmingham School of Medicine, Birmingham, Alabama, USA
3 Sri Guru Ram Das Institute of Medical Sciences and Research, Amritsar, Punjab, India
4 Department of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
5 Pathology, The University of Alabama at Birmingham School of Medicine, Birmingham, Alabama, USA