Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Optimal classification and generalized prevalence estimates for diagnostic settings with more than two classes

Published

Author(s)

Rayanne Luke, Anthony J. Kearsley, Paul Patrone

Abstract

An accurate multiclass classification strategy is crucial to interpreting antibody tests. However, traditional methods based on confidence intervals or receiver operating characteristics lack clear extensions to settings with more than two classes. We address this problem by developing a multiclass classification based on probabilistic modeling and optimal decision theory that minimizes the convex combination of false classification rates. The classification process is challenging when the relative fraction of the population in each class, or generalized prevalence, is unknown. Thus, we also develop a method for estimating the generalized prevalence of test data that is independent of classification. We validate our approach on serological data with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) naïve, previously infected, and vaccinated classes. Synthetic data are used to demonstrate that (i) prevalence estimates are unbiased and converge to true values and (ii) our procedure applies to arbitrary measurement dimensions. In contrast to the binary problem, the multiclass setting offers wide-reaching utility as the most general framework and provides new insights into prevalence estimation best practices.
Citation
Mathematical Biosciences

Keywords

Antibody testing, diagnostics, multiclass classification, prevalence estimation, SARS-CoV-2

Citation

Luke, R. , Kearsley, A. and Patrone, P. (2023), Optimal classification and generalized prevalence estimates for diagnostic settings with more than two classes, Mathematical Biosciences, [online], https://doi.org/10.1016/j.mbs.2023.108982, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=935493 (Accessed April 30, 2024)
Created February 17, 2023, Updated April 1, 2024