Skip to main content

Evaluation of serological lateral flow assays for severe acute respiratory syndrome coronavirus-2

A Correction to this article was published on 01 July 2021

This article has been updated



COVID-19 has resulted in significant morbidity and mortality worldwide. Lateral flow assays can detect anti-Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) antibodies to monitor transmission. However, standardized evaluation of their accuracy and tools to aid in interpreting results are needed.


We evaluated 20 IgG and IgM assays selected from available tests in April 2020. We evaluated the assays’ performance using 56 pre-pandemic negative and 56 SARS-CoV-2-positive plasma samples, collected 10–40 days after symptom onset, confirmed by a molecular test and analyzed by an ultra-sensitive immunoassay. Finally, we developed a user-friendly web app to extrapolate the positive predictive values based on their accuracy and local prevalence.


Combined IgG + IgM sensitivities ranged from 33.9 to 94.6%, while combined specificities ranged from 92.6 to 100%. The highest sensitivities were detected in Lumiquick for IgG (98.2%), BioHit for both IgM (96.4%), and combined IgG + IgM sensitivity (94.6%). Furthermore, 11 LFAs and 8 LFAs showed perfect specificity for IgG and IgM, respectively, with 15 LFAs showing perfect combined IgG + IgM specificity. Lumiquick had the lowest estimated limit-of-detection (LOD) (0.1 μg/mL), followed by a similar LOD of 1.5 μg/mL for CareHealth, Cellex, KHB, and Vivachek.


We provide a public resource of the accuracy of select lateral flow assays with potential for home testing. The cost-effectiveness, scalable manufacturing process, and suitability for self-testing makes LFAs an attractive option for monitoring disease prevalence and assessing vaccine responsiveness. Our web tool provides an easy-to-use interface to demonstrate the impact of prevalence and test accuracy on the positive predictive values.

Peer Review reports


Coronavirus disease 2019 (COVID-19), caused by infection with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was declared a global pandemic by on March 11th, 2020 [1], with a second wave of the pandemic well underway [1]. However, accurate estimates of transmission rely on accurate and widely distributed population immunosurveillance tools to measure SARS-CoV-2 infection in diverse community settings. Among SARS-CoV-2-infected individuals, 40–45% are estimated to remain asymptomatic [2], suggesting that prevalence is likely underestimated [3]. Therefore, detecting prior exposure to SARS-CoV-2 as opposed to other viral infections or other coronavirus strains is crucial [4].

There are different types of clinical SARS-CoV-2 tests. Diagnostic testing relies on reverse-transcriptase quantitative polymerase chain reaction (RT-qPCR) and antigen-based immunodiagnostics to detect active infection [5]. Conversely, serological tests are useful for monitoring population prevalence and prior exposure by measuring antibodies against SARS-CoV-2 [6,7,8]. These include enzyme-linked immunosorbent assays (ELISAs), chemiluminescence assays, and lateral flow assays (LFAs) [5, 9, 10]. LFAs are attractive for home testing and population surveillance, since they are affordable, scalable, rely on easily accessible specimens such as fingerstick whole blood and give a result readout within minutes [7]. Since multiple vaccines received emergency use authorization [11], serological assays could be used to determine whether vaccines elicit a detectable and durable immune response [12,13,14,15,16]. Hence, easy-to-use LFAs will have important applications in the upcoming phases of the pandemic. Since the onset of the COVID-19 epidemic, multiple studies evaluated the accuracy of serological tests [9, 17,18,19,20,21]. Many of these tests received Emergency Use Authorization (EUA) through the Food and Drug Administration (FDA) [22].

Despite the utility of SARS-CoV-2 antibody tests, misinterpretation of results is very likely [23]. A negative serological test result does not preclude prior infection since seroconversion occurs 9–11 and 18–20 days after symptoms onset for IgM and IgG antibodies, respectively [10, 24]. Conversely, positive results do not indicate active infection [23]. Furthermore, the prevalence of SARS-CoV-2 is highly variable [1, 3], and known to directly impact the predictive value of a test result. A higher prevalence increases the likelihood that a positive test result indicates a real infection (i.e. higher positive predictive value) [25], but will also decrease the negative predictive value, resulting in more false negative results [25]. Therefore, accessible tools to assist the public with interpreting results based on test accuracy and different prevalence scenarios are critical [23, 26].

In April 2020, the Mass General and Brigham Center for COVID Innovation direct-to-consumer working group scanned available serological assays and selected 20 lateral flow assays, based on reported assay characteristics and supply chain availability [27]. The LFAs were evaluated by blinded operators using the same samples to standardize the evaluation of their accuracy. Additionally, we developed a user-friendly web-tool to assist the end user to interpret their results. This study provides both the evaluation data to serve as a public resource to guide implementation of LFAs, and the tool to aid the interpretation of home testing results.


Sample procurement

We procured 56 SARS-CoV-2-positive, 46 pre-pandemic SARS-CoV-2-negative, and 10 SARS-CoV-2-negative HIV-positive EDTA plasma samples. SARS-CoV-2 positive EDTA plasma samples were obtained from clinical discards banked within 24–72 h of collection at the Crimson Core of the MGB Biobank, which was composed of hospitalized symptomatic patients. All samples had positive SARS-CoV-2 PCR results using an EUA approved test at Brigham and Women’s Hospital (Panther Fusion SARS-CoV-2 assay, Hologic, Inc., San Diego, CA or Xpert Xpress SARS-CoV-2, Cepheid, Inc., Sunnyvale, CA) or the Clinical Research Sequencing Platform at the Broad Institute of MIT and Harvard (in house Laboratory Developed Test) 10–40 days prior to sample collection. The participants’ charts were reviewed by study staff to identify samples collected ≥10 days after onset of symptoms and to exclude immunosuppressed participants, after which samples were anonymized and stripped of protected health information. Pre-pandemic negative control samples were randomly selected from healthy participants with a Charlson Age-Comorbidity Index [28] score ≤ 2 and EDTA plasma banked in the MGB Biobank between Jan 1-Dec 1, 2019 from inpatients. HIV-positive control samples were obtained from EDTA plasma samples banked prior to January 2020 in a study on neuropathic pain in HIV. All HIV-positive participants were on antiretroviral therapy. For 8 out of the 10 HIV-positive samples, viral load quantification was available and showed 256 copies/ml or less, and 5 showed loads either undetectable or under 20 copies/ml. All samples were collected from consented individuals. Sample and data collection conformed to Good Clinical Practice guidelines and Declaration of Helsinki. The study was approved under the Massachusetts General Brigham (MGB) Institutional Review Board (protocol no. 2020P001204).

Lateral flow assays (LFAs)

Twenty commercial IgG/IgM lateral flow assays (LFAs) from 18 manufacturers were evaluated (Supplementary Table 1). LFAs were analyzed by blinded operators according to manufacturer instructions for use (IFU), with the exception of using micropipettes instead of manufacturer-provided droppers to minimize technical variability. Samples were thawed on ice, randomized, and brought to room temperature. Kit components were also brought to room temperature. The IFU-specified volumes of sample and buffer were added to the cassette. Specified sample volumes varied for different LFAs but were typically in the 5-20 μL range. The cassettes were run at room temperature on a flat surface and results read immediately after the time interval defined in the IFU (typically ranging from 10 to 15 min). Each cassette was independently scored by two blinded raters as either “positive,” “negative,” or “invalid”. Ratings were designated according to the interpretation guidelines outlined in each individual IFU. Each cassette was photographed under four standardized illumination conditions and viewpoints for future analysis.

Reproducibility testing

For inter-operator reproducibility analysis, separate pools of EDTA plasma were obtained from > 30 pre-pandemic healthy individuals (negative pool) and > 30 convalescent participants collected after symptom resolution at the Massachusetts General Hospital (MGH) respiratory illness clinic (positive pool). Convalescent samples for the positive pool were confirmed to be positive using the COBAS SARS-CoV-2 PCR test (Roche Diagnostics, Indianapolis, IN) at MGH. A total of 20 replicates per pool were run by two independent pairs of blinded operators, alternating between positive and negative pools (10 replicates per pair). Reproducibility was calculated according to agreement between operator ratings as well as concordance of readout with sample pool COVID status.

Sensitivity and specificity testing

Our cohort of 112 EDTA plasma samples was used across all 20 LFAs to evaluate performance: sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV). Samples were sub-aliquoted throughout the analysis to minimize freeze-thaw cycles. Binary presence/absence classifications were used, and discordant calls were resolved by a third operator inspecting photographs taken of the relevant LFA.

LFA usability

In addition to initial screening [27], each LFA kit was assessed for consumer usability based on complexity of kit materials, sample requirements, and IFU clarity. Supplied kit components were documented for completeness and examined for ease-of-use. IFU protocols were rated on a scale from 0 to 14 according to a predefined rubric (Supplementary Table 2) by three independent raters. Usability evaluations are shown in Supplementary Table 3. Sample input requirements for each LFA are in Supplementary Table 1.

Ultrasensitive Simoa serology assays

Plasma samples were diluted 4000-fold, and the total IgG and IgM levels against the SARS-CoV-2 spike protein were measured using a custom Single Molecule Array (Simoa) assay as described [29] on an automated HD-X Analyzer (Quanterix, Billerica, MA, USA), to provide a quantitative reference for anti-spike antibody titers in the plasma samples. Normalized mean Average Enzymes per Bead (AEB) levels were calculated using a standard set of calibrators produced by serially diluting a large volume of plasma from seroconverted individuals. Antibody concentrations were estimated using a calibration curve of recombinant anti-SARS-CoV-2 antibodies [30].

Analysis and Webapp

Detailed methods are in the supplementary.


Study population

We obtained plasma samples from 56 pre-pandemic patients, including 10 HIV+, and 56 symptomatic inpatients in March and April 2020. HIV-negative samples were matched for sex and age. The overall study population included 25.9% Blacks, 4.5% Latinx, 9.8% Asians, and 43.8% Non-Hispanic Whites (Table 1). COVID+ samples were from individuals between 10 and 40 days post symptom onset, with 60.7% samples taken between 2 and 4 weeks. Of all the COVID+ participants, 7 (12.5%) were symptomatic outpatients and 49 (87.5%) were hospitalized. Among those hospitalized, 25 (51%) required intensive care unit (ICU) treatment and 4 (8.2%) were deceased at the time of chart review (Supplementary Table 4). Additional mortalities were possible after chart review since some participants were in critical condition in the ICU.

Table 1 Demographic information of all individuals whose plasma samples were used for this study. Individuals are broken out by COVID+/− and HIV+/− status. Black includes one mixed African American in the COVID-HIV+ group

LFA performance: reproducibility

Four independent operators working in teams of two on separate days applied each pool 10 times to each LFA, with processing as dictated by the instructions for use (IFU). API version1 LFAs were not assessed for reproducibility due to limited cassette availability and the high volume required. Of the remaining 19 LFAs, three (BTNX, Camtech, and Carehealth) had 100% consistent correct outcomes across both isotypes, with an additional three (BioHit, Zhuhai Livzon, and Phamatech) having no incorrect or inconsistent outcomes with one or two invalid tests (Fig. 1). IgG was the more reproducible isotype. The majority of incorrect consistent calls came from operators calling a COVID+ sample IgM negative (Fig. 1).

Fig. 1

LFA reproducibility. Colors represent LFA outcome performed across two days (A & B) by two independent pairs of operators on a COVID positive (COVID+) pooled sample and a COVID negative (COVID-) pooled sample. On each day ten technical replicates of each pool were performed. Blue represents replicates where both operators that day agreed on the outcome, and the outcome was correctly called positive or negative. Red represents occasions where both operators agreed, but the outcome was incorrectly called. Orange represents replicates where the operators did not agree on the LFA outcome. Light yellow represents an invalid test where control bands did not meet criteria for a valid test. IgG was the most reproducible isotype for all LFAs except Oranoxis, and consistent false negatives were common occurrences for IgM for the majority of LFAs

LFA performance: practicalities of use

We assessed the LFAs according to this rubric (Supplementary Table 2) by three independent raters and assigned a composite score on a scale of 0–14 (Supplementary Table 3). Five LFAs (BioHit, InTec, Lumiquick, Phamatech, and U2U) received full scores for IFU clarity. LFAs frequently lost points for imprecise instructions regarding time between adding sample and having sufficient sample flow to read the results. Important kit usability criteria, such as whether the included pipette droppers show clear volume markings, were also recorded (Supplementary Table 3). The IFU ambiguity led to administration of too much or little sample for a valid test. While these IFUs may not yet be intended for the general public, it will be important to clarify the instructions moving forward and include droppers to minimize potential for sample volume errors.

LFA performance: sensitivity and specificity

To focus on LFA specificity, given the likely use of these tests in low-prevalence settings, disagreement between operators was interpreted as a negative call. Across all but three of the LFAs (Biohit, BTNX, and Vivacheck), sensitivity was higher for IgG than IgM. Sensitivity for IgG ranged from 98.2% (Lumiquick) down to 72.7% (Oranoxis), and for IgM from 96.4% (BioHit) to 23.2% (Oxo and U2U) (Table 2). LFA specificity was much higher for both isotypes, with 11 LFAs having a specificity of 100% for IgG (API, API v2, BTNX, Camtech, Genobio, Oranoxis, Phamatech, Ray Biotech, Ray Biotech v2, U2U, and Zhuhai Livzon), and 8 LFAs having a specificity of 100% for IgM (API, CareHealth, Cellex, Lumiquick, Oranoxis, Ray Biotech v2, U2U, and Zhuhai Livzon) (Table 2). Under the assumption that the likelihood of two randomly occurring false positives for any one individual is low, an IgG/IgM composite score (averaged operator scores, see Supplementary Methods) was produced to maximize test specificity. Using this composite score, all LFAs except BioHit, Cellex, Edinburgh, InTec, and Vivachek achieved a specificity of 100%. This result underscores the potential of considering the outcome in both isotypes to minimize false positives, although it is more likely that a single isotype will be used in clinical testing.

Table 2 LFA performance in terms of sensitivity and specificity

We created heatmaps to visualize individual sample outcomes across all LFAs to assess whether we systematically detected the same miscalls across multiple LFAs (Fig. 2). False negatives (blue squares in the COVID+ panel) amongst COVID+ patients were somewhat reproducible, with three COVID+ samples called negative in both isotypes by all, or all but one LFA. These miscalls were not clearly explained by known demographics (age, sex) or clinical variables (disease severity, weeks post symptom onset) (Fig. 2, bottom panel). To investigate whether these miscalls were related to low titers of anti-SARS-CoV-2 antibodies from participants with a suppressed immune response, all 112 samples were analyzed for anti-spike IgG and IgM antibodies using a custom quantitative Simoa assay [29]. The three samples that were called negative across almost all LFAs, which were also 2–3 weeks post-symptoms, had the lowest levels of anti-spike antibodies in COVID+ samples for both IgG and IgM, suggesting these participants had a slower or suppressed immune response to SARS-CoV-2 infection.

Fig. 2

Per-individual LFA performance. Colors represent scores computed for IgG (top panel) and IgM (lower panel), where dark red = + 1 (operators all agree a band is present) and dark blue = −1 (operators all agree there is no band) with intermediate colors (pink and light blue) representing varying degrees of operator disagreement. Grey represents invalid runs. Samples are ordered within the COVID-, COVID-HIV+, and COVID+ groups in order of decreasing average score across all LFAs for both antibodies. Clinical variables include age (< 50 yrs.: light blue, 50-69 yrs.: intermediate blue, ≥70 yrs.: dark blue), sex (blue: male, pink: female), disease severity (hospitalized: light green, ICU: orange, ICU + respirator: red, deceased: black), and weeks post symptom-onset (dark green: 1–2 weeks, lightest green: ≥5 weeks)

Unlike false negatives, most false positives (Fig. 2, red squares in COVID- panel) amongst COVID- individuals appear largely uncorrelated between LFAs. However, two samples showed IgG false positives across multiple LFAs, which may suggest long lasting antibodies from exposure to coronaviruses other than SARS-CoV-2. Nonetheless, this observation is not reflected in the antibody levels from the Simoa analysis, which showed barely detectable anti-spike antibodies in these two samples.

Defining the limit of detection for qualitative LFAs

The custom Simoa anti-spike IgG and IgM antibody assays use a standard curve to determine standardized antibody concentration in each sample [29]. The results obtained from this assay can therefore be used to estimate a limit of detection (LOD) for each of the qualitative LFAs. The cumulative number of false negative LFA calls in COVID+ samples (Fig. 3, y-axis) were computed as a function of decreasing antibody concentrations (Fig. 3, x-axis) separately for IgG and IgM. We define the LOD for each LFA/antibody as the concentration at which ≥95% of the COVID+ samples are called unambiguously positive (Supplementary Table 5). Using this definition, for IgG all LFAs (except Genobio, Oranoxis, OZO, Ray Biotech, and U2U) have an LOD within the linear range of the SIMOA assay (1–10,000 μg/mL). Lumiquick has the lowest LOD at 0.1 μg/mL, which was extrapolated by dilution to be within the linear range of the SIMOA standards, and CareHealth, Cellex, KHB, and Vivachek all have an LOD of 1.5 μg/mL (Supplementary Table 5). Considering the generally lower sensitivity observed with the IgM assays (Fig. 2), IgM assays consistently have higher LODs, with 9 exceeding 1000μg/mL. BioHit has the lowest IgM LOD at 0.6 μg/mL, and API v2, BTNX, CareHealth, and Vivachek all have LODs under 10 μg/mL.

Fig. 3

Determining the limit of detection of qualitative LFAs. Samples were ranked from highest concentration of anti-spike antibody (determined by Simoa) on the right, to lowest concentration (x-axis). As the sample concentration decreases to the left, a cumulative count of false negatives is shown on the y-axis. IgG is shown on the left, IgM on the right. Note the difference in magnitude of the y-axes between IgG and IgM; these LFAs are generally more sensitive to IgG than IgM

Interpreting test positivity with low prevalence in the general population

Low prevalence places a high burden on specificity [25]. Given the high proportion of true negative individuals in the population being studied, prevalence increases the ratio of false positive to true positive test outcomes [31]. Positive predictive values (PPV) correspond to the likelihood that a positive test result reflect true positivity as measured by a gold standard PCR result. We computed PPVs as a function of the fraction of the population infected with (and assumed to have produced antibodies to) SARS-CoV-2 (Fig. 4). Here we see that even with the conservative interpretation of LFA outcome (requiring the majority of operators to see a band to call a sample positive for either IgG or IgM), when the population seroprevalence is ~ 5% the PPV for these assays spans a large range (from ~ 30 to 100%). As the population prevalence increases, the burden on specificity is decreased, and at 50% prevalence the PPV of all LFAs is above 87.5%. Posterior PPV can be improved for most LFAs by requiring both IgG and IgM to be read as positive in order to count an individual as positive (Fig. 4, right panels).

Fig. 4

Effect of a changing disease prevalence (x axis) on the population-level PPV of each test for scores derived from IgG (left), IgM (middle), and the average score of both antibodies (right). As disease prevalence increases, the high burden on specificity of LFAs is reduced. Color coding: Grey: no improvement; Light blue: rescues one of the two antibodies; Dark blue: rescues both of the antibodies; Orange: performs worse than one of the antibodies

To visualize the effect of changing population prevalence on the PPV, we created an interactive webapp (; Fig. 5), which allows the user to extrapolate the likelihood that they do in fact have SARS-CoV-2 antibodies if they have a positive LFA result, given the infection prevalence and test accuracy. The app includes benchmark performance of all 20 LFAs evaluated here, as well as those reported in Whitman et al. [10] (filtered to remove samples taken under 10 days post symptom-onset). In an effort to further generalize the utility of this tool, we allow the user to explore the effects of assay performance under difference prevalence scenarios, and to input the reported prevalence data from specific geographic locations within the US [1], based on national, state, and county records.

Fig. 5

Example image of the interactive webapp hosted at Using the sliders, one is able to visualize the effect of changing the disease prevalence (and the test performance) on the resulting probability that, given a positive LFA test, the individual does in fact have SARS-CoV-2 antibodies. The figure shows an example of positive predictive value using the US national prevalence of 5% (on November 27th, 2020) and the performance characteristics of the InTec LFA


In this study, we report a standardized cross-evaluation of LFAs on the same pre-pandemic SARS-CoV-2-negative and PCR-confirmed SARS-CoV-2-positive samples, and rate their reproducibility, usability, and performance characteristics. Overall, the LFAs showed a higher propensity for false negative than false positive readings. Results are public: We use the Simoa technology [29] to measure the concentrations of anti-spike protein IgG and IgM antibodies and extrapolate the assay limit of detection. We also established a web tool to aid users in understanding the likelihood they have anti-SARS-CoV-2 antibodies given a positive test result. This resource of performance characteristics of several LFAs and a tool for result interpretation, can both be used for immunosurveillance and future home testing applications.

LFAs are tractable tools to estimate community seroprevalence, especially with anticipated seasonal fluctuations in the transmission dynamics of SARS-CoV-2 and other viruses that cause the common cold, which confound the symptomatologic diagnosis of COVID-19 [32]. As new waves of the COVID-19 pandemic resurge around the globe [1], and with commencing vaccinations against SARS-CoV-2 infections [12,13,14, 16], there is a renewed interest in serological tests to detect anti-SARS-CoV-2 antibodies [20, 33]. Affordable LFAs offer an attractive option for monitoring the presence and longevity of anti-SARS-CoV-2 antibodies, and determining population-level herd immunity [34]. LFAs also obviate the need for complex laboratories to process the samples [35]. As the pandemic expands to previously unexposed communities, it is critical to use simple tools to monitor exposure dynamics and seroconversion in SARS-CoV-2-exposed individuals, as well as vaccine-induced immunity [16]. We tested a mixture of LFAs targeting SARS-CoV-2 nucleocapsid, spike proteins, or both. Moderna’s mRNA-1273 and Pfizer-BioNTech COVID-19 vaccines encode SARS-CoV-2 spike proteins to induce anti-spike antibodies [14, 36]. Therefore, LFAs targeting the spike and nucleocapsid proteins of SARS-CoV-2 could be used to differentiate vaccine- and infection-induced antibody responses, respectively.

Rigorous evaluation of these LFAs by manufacturer-independent parties is important. The US FDA independently reviews medical products before commercialization. The FDA used Emergency Use Authorization (EUA) authority to accelerate the implementation of diagnostic products during the pandemic. Commercial manufacturers were required to submit a completed EUA request [22]. Unfortunately, the rush to market introduced many tests that did not meet typical US or international standards [37]. Therefore, the FDA and international regulatory agencies continue to update guidelines for authorization of new serological tests. Our evaluation plan mirrored the FDA guidelines for evaluating serological tests [22]. We included 10 HIV-positive samples to test whether they have higher false positive results in SARS-CoV-2-negative samples [38], and did not detect higher false positive results.

The mere detection of IgG or IgM responses does not guarantee that neutralizing antibodies are present at protective titers [7, 39]. The study demographics suggest a slight over-representation of African Americans among cases, as reported [40]. However, the sample size was underpowered to formally determine the effect of race on test performance. In our analysis, IgM detectability was less sensitive and reproducible than IgGs across multiple LFAs, possibly due to both lower IgM titres, and lower limits of detection for the IgM LFAs. Waning antibody responses have been reported in some SARS-CoV-2-infected individuals [41,42,43,44]. Furthermore, reported cases of re-infection with SARS-CoV-2 suggest that prior exposure, and even seroconversion, do not universally protect against SARS-CoV-2 infection [44, 45]. This could result from low antibody titers as shown in an immunocompromised patient [46], low durability of infection-induced antibodies [42,43,44], or low neutralizing potential of SARS-CoV-2 infection-induced antibodies in some individuals [34]. IgG and IgM antibodies may also target irrelevant epitopes outside the spike and receptor binding domains, and consequently be less efficient at intercepting infection by the virus [47]. The WHO cautions against interpreting presence of antibodies, even neutralizing ones, as lower risk of re-infection and transmission [48]. The presence of antibodies could be however used for rapid immunosurveillance to monitor extent of population transmission, particularly in asymptomatic but SARS-CoV-2-exposed individuals [3, 33].

One major concern about the deployment of these tests is the misinterpretation of positive results [7, 39]. As more tests move towards FDA approval of home use, clear scientific communication about the result interpretation becomes more crucial [23, 39]. A positive serological result does not necessarily mean active infection [23, 31, 49]. Although combined use of molecular and seroconversion results can be used to confirm the diagnosis of symptomatic and hospitalized individuals [35], a positive serological test in the absence of symptoms dissociates the presence of the antibodies from the time of infection [44]. Additionally, it is important to understand the implication of false positive and false negative results, particularly in the context of a low to mid-prevalence disease such as COVID-19 [25]. Low prevalence decreases the negative predictive value of a test, but increases the false positive rates [25]. A false positive serological test result may prematurely instill confidence that one has immunity against SARS-CoV-2 infection, thus resulting in behavioral changes that increase risk of transmission [50]. Hence, the probability that a person without antibodies will test negative on a serological test is more important than test sensitivity [48,49,50].

Our study presents a few limitations. Although we successfully benchmarked the performance of the LFAs to a quantitative assay [29], we did not determine the neutralizing potential of these antibodies. Secondly, samples were acquired when PCR testing was restricted to severely ill patients. For epidemiological studies and population surveillance, it will be important to evaluate assay performance on asymptomatic individuals.


Our study provides a public resource to aid researchers, healthcare providers, public health professionals, and industries impacted by the pandemic such as airlines, in choosing the appropriate serological LFAs for their intended use cases.

Availability of data and materials

Evaluation data are available at:

Web application is available at:

Change history



Average Enzymes per Beads


Coronavirus Disease of 2019


Ethylenediaminetetraacetic acid


Enzyme-linked immunosorbent assay


Emergency Use Authorization


Food and Drug Administration


Human Immunodeficiency Virus


Intensive Care Units




Immunoglobulin G


Immunoglobulin M


Lateral Flow Assay


Limit of Detection


Mass General Brigham


Mass General Hospital


Negative Predictive Value


Positive Predictive Value


Reverse Transcriptase Quantitative Polymerase Chain Reaction


Severe Acute Respiratory Syndrome Coronavirus-2


Single Molecule Array


  1. 1.

    Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect Dis. 2020;20(5):533–4.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Oran DP, Topol EJ. Prevalence of asymptomatic SARS-CoV-2 infection : a narrative review. Ann Intern Med. 2020;173(5):362–7.

    Article  Google Scholar 

  3. 3.

    Havers FP, Reed C, Lim T, et al. Seroprevalence of Antibodies to SARS-CoV-2 in 10 Sites in the United States, March 23-May 12, 2020. [published online ahead of print, 2020 Jul 21]. JAMA Intern Med. 2020.

  4. 4.

    Ehrlich H, Boneva D, Elkbuli A. The intersection of viral illnesses: A seasonal influenza epidemic amidst the COVID-19 pandemic. Ann Med Surg (Lond). 2020;60:41–3.

    Article  Google Scholar 

  5. 5.

    D'Cruz RJ, Currier AW, Sampson VB. Laboratory testing methods for novel severe acute respiratory syndrome-Coronavirus-2 (SARS-CoV-2). Front Cell Dev Biol. 2020;8:468.

    Article  PubMed  PubMed Central  Google Scholar 

  6. 6.

    Pecora ND, Zand MS. Measuring the serologic response to severe acute respiratory syndrome coronavirus 2: methods and meaning. Clin Lab Med. 2020;40(4):603–14.

    Article  PubMed  PubMed Central  Google Scholar 

  7. 7.

    Peeling RW, Wedderburn CJ, Garcia PJ, Boeras D, Fongwen N, Nkengasong J, et al. Serology testing in the COVID-19 pandemic response. Lancet Infect Dis. 2020;20(9):e245–e9.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  8. 8.

    La Marca A, Capuzzo M, Paglia T, Roli L, Trenti T, Nelson SM. Testing for SARS-CoV-2 (COVID-19): a systematic review and clinical guide to molecular and serological in-vitro diagnostic assays. Reprod BioMed Online. 2020;41(3):483–99.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  9. 9.

    Pickering S, Betancor G, Galao RP, Merrick B, Signell AW, Wilson HD, et al. Comparative assessment of multiple COVID-19 serological technologies supports continued evaluation of point-of-care lateral flow assays in hospital and community healthcare settings. PLoS Pathog. 2020;16(9):e1008817.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  10. 10.

    Whitman JD, Hiatt J, Mowery CT, Shy BR, Yu R, Yamamoto TN, et al. Evaluation of SARS-CoV-2 serology assays reveals a range of test performance. Nat Biotechnol. 2020;38(10):1174–83.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  11. 11.

    Oliver SE, Gargano JW, Marin M, Wallace M, Curran KG, Chamberland M, et al. The advisory committee on immunization Practices' interim recommendation for use of Pfizer-BioNTech COVID-19 vaccine - United States, December 2020. MMWR Morb Mortal Wkly Rep. 2020;69(50):1922–4.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  12. 12.

    Jackson LA, Anderson EJ, Rouphael NG, Roberts PC, Makhene M, Coler RN, et al. An mRNA vaccine against SARS-CoV-2 - preliminary report. N Engl J Med. 2020;383(20):1920–31.

    CAS  Article  Google Scholar 

  13. 13.

    Anderson EJ, Rouphael NG, Widge AT, Jackson LA, Roberts PC, Makhene M, et al. Safety and immunogenicity of SARS-CoV-2 mRNA-1273 vaccine in older adults. N Engl J Med. 2020;383(25):2427–38.

    CAS  Article  Google Scholar 

  14. 14.

    Walsh EE, Frenck RW Jr, Falsey AR, Kitchin N, Absalon J, Gurtman A, et al. Safety and immunogenicity of two RNA-based Covid-19 vaccine candidates. N Engl J Med. 2020;383(25):2439–50.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  15. 15.

    Dagotto G, Yu J, Barouch DH. Approaches and challenges in SARS-CoV-2 vaccine development. Cell Host Microbe. 2020;28(3):364–70.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  16. 16.

    Poland GA, Ovsyannikova IG, Kennedy RB. SARS-CoV-2 immunity: review and applications to phase 3 vaccine candidates. Lancet. 2020;396(10262):1595–606.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  17. 17.

    Wang H, Ai J, Loeffelholz MJ, Tang YW, Zhang W. Meta-analysis of diagnostic performance of serology tests for COVID-19: impact of assay design and post-symptom-onset intervals. Emerg Microbes Infect. 2020;9(1):2200–11.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  18. 18.

    Caini S, Bellerba F, Corso F, Diaz-Basabe A, Natoli G, Paget J, et al. Meta-analysis of diagnostic performance of serological tests for SARS-CoV-2 antibodies up to 25 April 2020 and public health implications. Euro Surveill. 2020;25(23):2000980.

  19. 19.

    Lisboa Bastos M, Tavaziva G, Abidi SK, Campbell JR, Haraoui LP, Johnston JC, et al. Diagnostic accuracy of serological tests for covid-19: systematic review and meta-analysis. BMJ. 2020;370:m2516.

    Article  Google Scholar 

  20. 20.

    Moura DTH, McCarty TR, Ribeiro IB, Funari MP, Oliveira P, Miranda Neto AA, et al. Diagnostic Characteristics of Serological-Based COVID-19 Testing: A Systematic Review and Meta-Analysis. Clinics (Sao Paulo). 2020;75:e2212.

    Article  Google Scholar 

  21. 21.

    Gutiérrez-Cobos A, Gómez de Frutos S, Domingo García D, et al. Evaluation of diagnostic accuracy of 10 serological assays for detection of SARS-CoV-2 antibodies. Eur J Clin Microbiol Infect Dis. 2021;40(5):955-61.

  22. 22.

    FDA. EUA Authorized Serology Test Performance. United States Food and Drug Administration. 2020. Accessed 25 Dec 2020.

  23. 23.

    Bermingham WH, Wilding T, Beck S, Huissoon A. SARS-CoV-2 serology: test, test, test, but interpret with caution! Clin Med (Lond). 2020;20(4):365–8.

    Article  Google Scholar 

  24. 24.

    Long QX, Liu BZ, Deng HJ, Wu GC, Deng K, Chen YK, et al. Antibody responses to SARS-CoV-2 in patients with COVID-19. Nat Med. 2020;26(6):845–8.

    CAS  Article  PubMed  Google Scholar 

  25. 25.

    Tenny S, Hoffman MR. Prevalence. Treasure Island (FL): StatPearls; 2020.

    Google Scholar 

  26. 26.

    Theel ES, Slev P, Wheeler S, Couturier MR, Wong SJ, Kadkhoda K. The Role of Antibody Testing for SARS-CoV-2: Is There One? J Clin Microbiol. 2020;58(8):e00797–20. Published 2020 Jul 23.

  27. 27.

    Jorfi M, Luo NM, Hazra A, Herisson F, Miller M, Toombs JA, et al. Diagnostic technology for COVID-19: comparative evaluation of antigen and serology-based SARS-CoV-2 immunoassays, and contact tracing solutions for potential use as at-home products. MedRxiv. 2020.

  28. 28.

    Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373–83.

    CAS  Article  Google Scholar 

  29. 29.

    Norman M, Gilboa T, Ogata AF, Maley AM, Cohen L, Busch EL, et al. Ultrasensitive high-resolution profiling of early seroconversion in patients with COVID-19. Nat Biomed Eng. 2020;4(12):1180–7.

    CAS  Article  Google Scholar 

  30. 30.

    Tian X, Li C, Huang A, Xia S, Lu S, Shi Z, et al. Potent binding of 2019 novel coronavirus spike protein by a SARS coronavirus-specific human monoclonal antibody. Emerg Microbes Infect. 2020;9(1):382–5.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  31. 31.

    Hernaez R, Thrift AP. High negative predictive value, low prevalence, and Spectrum effect: caution in the interpretation. Clin Gastroenterol Hepatol. 2017;15(9):1355–8.

    Article  Google Scholar 

  32. 32.

    Kissler SM, Tedijanto C, Goldstein E, Grad YH, Lipsitch M. Projecting the transmission dynamics of SARS-CoV-2 through the postpandemic period. Science. 2020;368(6493):860–8.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  33. 33.

    Alter G, Seder R. The power of antibody-based surveillance. N Engl J Med. 2020;383(18):1782–4.

    CAS  Article  Google Scholar 

  34. 34.

    Neagu M. The bumpy road to achieve herd immunity in COVID-19. J Immunoassay Immunochem. 2020;41(6):928–45.

  35. 35.

    Nilles EJ, Karlson EW, Norman M, Gilboa T, Fischinger S, Atyeo C, et al. Evaluation of two commercial and two non-commercial immunoassays for the detection of prior infection to SARS-CoV-2. medRxiv. 2020;2020.06.24.20139006. Published 2020 June 26.

  36. 36.

    Corbett KS, Edwards DK, Leist SR, Abiona OM, Boyoglu-Barnum S, Gillespie RA, et al. SARS-CoV-2 mRNA vaccine design enabled by prototype pathogen preparedness. Nature. 2020;586(7830):567–71.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  37. 37.

    Badnjevic A, Pokvic LG, Dzemic Z, Becic F. Risks of emergency use authorizations for medical products during outbreak situations: a COVID-19 case study. Biomed Eng Online. 2020;19(1):75.

    Article  PubMed  PubMed Central  Google Scholar 

  38. 38.

    Tan SS, Chew KL, Saw S, Jureen R, Sethi S. Cross-reactivity of SARS-CoV-2 with HIV chemiluminescent assay leading to false-positive results. J Clin Pathol. 2020:jclinpath-2020-206942.

  39. 39.

    Gill D, Ponsford MJ. Testing for antibodies to SARS-CoV-2. BMJ. 2020;371:m4288.

    Article  Google Scholar 

  40. 40.

    Kullar R, Marcelin JR, Swartz TH, Piggott DA, Macias Gil R, Mathew TA, et al. Racial disparity of coronavirus disease 2019 in African American communities. J Infect Dis. 2020;222(6):890–3.

    CAS  Article  Google Scholar 

  41. 41.

    Long QX, Tang XJ, Shi QL, Li Q, Deng HJ, Yuan J, et al. Clinical and immunological assessment of asymptomatic SARS-CoV-2 infections. Nat Med. 2020;26(8):1200–4.

    CAS  Article  PubMed  Google Scholar 

  42. 42.

    Ibarrondo FJ, Fulcher JA, Goodman-Meza D, Elliott J, Hofmann C, Hausner MA, et al. Rapid decay of anti-SARS-CoV-2 antibodies in persons with mild Covid-19. N Engl J Med. 2020;383(11):1085–7.

    Article  Google Scholar 

  43. 43.

    Kutsuna S, Asai Y, Matsunaga A. Loss of anti-SARS-CoV-2 antibodies in mild Covid-19. N Engl J Med. 2020;383(17):1695–6.

    Article  Google Scholar 

  44. 44.

    Milani GP, Dioni L, Favero C, Cantone L, Macchi C, Delbue S, et al. Serological follow-up of SARS-CoV-2 asymptomatic subjects. Sci Rep. 2020;10(1):20048.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  45. 45.

    To KK, Hung IF, Ip JD, Chu AW, Chan WM, Tam AR, et al. COVID-19 re-infection by a phylogenetically distinct SARS-coronavirus-2 strain confirmed by whole genome sequencing. Clin Infect Dis: Off Publ Infect Dis Soc Am 2020.

  46. 46.

    Mulder M, van der Vegt D, Oude Munnink BB, GeurtsvanKessel CH, van de Bovenkamp J, Sikkema RS, et al. Reinfection of SARS-CoV-2 in an immunocompromised patient: a case report. Clin Infect Dis: Off Publ Infect Dis Soc Am. 2020.

  47. 47.

    Hussain A, Hasan A, Nejadi Babadaei MM, Bloukh SH, Chowdhury MEH, Sharifi M, et al. Targeting SARS-CoV2 spike protein receptor binding domain by therapeutic antibodies. Biomed Pharmacother. 2020;130:110559.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  48. 48.

    WHO. Immunity passports in the context of COVID-19. World Health Organization commentaries. 2020; (Accessed: Friday, December 25th, 2020).

  49. 49.

    Waller J, Rubin GJ, Potts HWW, Mottershaw AL, Marteau TM. Immunity Passports' for SARS-CoV-2: an online experimental study of the impact of antibody test terminology on perceived risk and behaviour. BMJ Open. 2020;10(8):e040448.

    Article  Google Scholar 

  50. 50.

    Brown RCH, Kelly D, Wilkinson D, Savulescu J. The scientific and ethical feasibility of immunity passports. Lancet Infect Dis. 2021;21(3):e58-e63.

Download references


This study was possible through the Diagnostics Pillar of the Mass General Brigham COVID Center for Innovation (MGBCCI) (, predominantly staffed by volunteers. The LFA horizon scan and selection was performed by the Direct-to-Consumer working group, and the Diagnostics Accelerator and Validation working group developed the protocols and performed the evaluations. The study is supported by funding from the Massachusetts Consortium on Pathogen Readiness (MassCPR) (


This study was funded by a grant from the Massachusetts Consortium for Pathogen Readiness (MassCPR).

Author information




BAT, SEK, RRK, BCC, and SS wrote the manuscript. BAT, SEK, and KG led the project management, volunteer coordination, and lab operations. BAT, SEK, BCC, PK, and LLR established the Diagnostics Accelerator lab, acquired samples, and established biosafety and IRB protocols. SS and LLR managed the COVID Diagnostics Accelerator lab. BAT and RRK led sample and data management. RRK and BCC led the analysis and data interpretation. RA, JT, and GM corresponded with LFA providers and down-selected assays for evaluation. DRW led the Diagnostics Pillar of the Mass General Brigham COVID Center for Innovation. All authors planned and/or performed experiments, edited, and approved the final version of the manuscript.

Corresponding author

Correspondence to Sara Suliman.

Ethics declarations

Ethics approval and consent to participate

All samples were collected from participants who provided informed consent. Sample and data collection conformed to Good Clinical Practice guidelines and Declaration of Helsinki. The study was approved under the Massachusetts General Brigham (MGB) Institutional Review Board (protocol no. 2020P001204).

Consent for publication

Not applicable.

Competing interests

DRW has a financial interest in Quanterix Corporation, a company that develops an ultra-sensitive digital immunoassay platform. He is an inventor of the Simoa technology and a founder of the company and also serves on its Board of Directors. DRW's interests were reviewed and are managed by Brigham and Womens Hospital and Partners HealthCare in accordance with their policies on competing interests. The assays in this publication have been licensed by Brigham and Womens Hospital to Quanterix Corporation. MN, TG and DRW have filed intellectual property on the technology described in this manuscript.

All other authors did not declare a conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The authors identified an error in the author’s name of Rakiesha Gay. The incorrect name is: Rakeisha Gay. The correct name is Rakiesha Gay.

Supplementary Information

Additional file 1: Supplementary Methods. Supplementary Table 1.

LFA commercial kit information, sample requirements, and protocol details. Supplementary Table 2. IFU clarity rubric. Supplementary Table 3. Usability ratings and description of kit components. Supplementary Table 4. Clinical information for the COVID-positive individuals whose plasma was used in this study. Supplementary Table 5. LFA Limits of detection estimated from anti-spike antibodies concentrations measured by Simoa.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Trombetta, B.A., Kandigian, S.E., Kitchen, R.R. et al. Evaluation of serological lateral flow assays for severe acute respiratory syndrome coronavirus-2. BMC Infect Dis 21, 580 (2021).

Download citation


  • SARS-CoV-2
  • Coronavirus
  • COVID-19
  • Antibodies
  • Lateral flow assays