Skip to main content

Advertisement

Log in

Misclassification errors in prevalence estimation: Bayesian handling with care

  • Hints & Kinks
  • Published:
International Journal of Public Health

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

  • Berkvens D, Speybroeck N, Praet N, Adel A, Lesaffre E (2006) Estimating disease prevalence in a Bayesian framework using probabilistic constraints. Epidemiology 17:145–153

    Article  PubMed  Google Scholar 

  • Bouwknegt M, Engel B, Herremans MMPT, Widdowson MA, Worm HC, Koopmans MPG, Frankena K, De Roda Husman AM, De Jong MCM, Van Der Poel WHM (2008) Bayesian estimation of hepatitis E virus seroprevalence for populations with different exposure levels to swine in The Netherlands. Epidemiol Infect 136:567–576

    Article  PubMed  CAS  Google Scholar 

  • Branscum AJ, Gardner IA, Johnson WO (2005) Estimation of diagnostic-test sensitivity and specificity through Bayesian modeling. Prev Vet Med 68:145–163

    Article  PubMed  CAS  Google Scholar 

  • Engel B, Swildens B, Stegeman A, Buist W, De Jong M (2006) Estimation of sensitivity and specificity of three conditionally dependent diagnostic tests in the absence of a gold standard. J Agric Biol Environ Stat 11:360–380

    Article  Google Scholar 

  • Joseph L, Gyorkos TW, Coupal L (1995) Bayesian estimation of disease prevalence and the parameters of diagnostic tests in the absence of a gold standard. Am J Epidemiol 141(3):263–272

    PubMed  CAS  Google Scholar 

  • Liu P, Xiao S, Shi ZX, Bi XX, Yang HT, Jin H (2011) Bayesian evaluation of the human immunodeficiency virus antibody screening strategy of duplicate enzyme-linked immunosorbent assay in Xuzhou blood center, China. Transfusion 51:793–798

    Article  PubMed  Google Scholar 

  • Lunn DJ, Thomas A, Best N, Spiegelhalter D (2000) WinBUGS—A Bayesian modelling framework: concepts, structure, and extensibility. Stat Comput 10:325–337

    Article  Google Scholar 

  • Lunn D, Spiegelhalter D, Thomas A, Best N (2009) The BUGS project: evolution, critique and future directions (with discussion). Stat Med 28:3049–3082

    Article  PubMed  Google Scholar 

  • Rogan WJ, Gladen B (1978) Estimating prevalence from the results of a screening test. Am J Epidemiol 107:71–76

    PubMed  CAS  Google Scholar 

  • Speybroeck N, Praet N, Claes F, Van Hong N, Torres K, Mao S, Van den Eede P, Thi Thinh T, Gamboa D, Sochantha T, Thang ND, Coosemans M, Büscher P, D’Alessandro U, Berkvens D, Erhart A (2011) True versus apparent malaria infection prevalence: the contribution of a Bayesian approach. PLoS ONE 6(2):e16705

    Article  PubMed  CAS  Google Scholar 

  • Speybroeck N, Williams CJ, Lafia KB, Devleesschauwer B, Berkvens D (2012) Estimating the prevalence of infections in vector populations using pools of samples. Med Vet Entomol 26:361–371

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Niko Speybroeck.

Appendix 1: Bayesian approach for estimating “true” prevalence from “apparent” prevalence

Appendix 1: Bayesian approach for estimating “true” prevalence from “apparent” prevalence

To introduce the Bayesian approach for estimating “true” prevalence (pi) from “apparent” prevalence (p), we build on the illustrations from the main text. In these examples, we had assumed a test sensitivity (SE) of 80 % and a test specificity (SP) of 90 %. We then estimated “true” prevalence from an “apparent” prevalence of 24 and 5 %, using the Rogan–Gladen estimator (Eq. (2)). This resulted in estimates of, respectively, 20 and −48 %.

To parameterize the Bayesian model, we need information on the sample size (n), the number of positive test results (x), and an a priori probability distribution for “true” prevalence (pi). As a first illustration, we will continue to assume that SE and SP are known, fixed values (Model 1). This assumption will be relaxed in a further illustration (Model 2).

For simplicity, we will assume that our sample size was 500. An “apparent” prevalence of 24 % would thus have resulted from observing 120 positive test results. Likewise, an “apparent” prevalence of 5 % would have resulted from observing 25 positive results. As prior probability distribution for “true” prevalence, we will apply a uniform distribution ranging from 0 to 100 %. This is a common choice, as it expresses our belief before having observed any data, that the “true” prevalence can take any possible value and that each possible value is equally likely.

This information can now be used to establish the following Bayesian model:

Model 1 Bayesian estimation of true prevalence from apparent prevalence, based on fixed sensitivity and specificity; comments, denoted by the “#” symbol, translate the code in words

We fitted the model in WinBUGS using two chains, each containing 6,000 samples, of which the first 1,000 were discarded as “burn-in”. Typical output is presented in Table 3. As expected, the Bayesian model succeeded in yielding only positive values, thus providing a useful “true” prevalence estimate for our second case.

Table 3 “True” prevalence estimation results from Bayesian model 1

The Bayesian approach makes it possible to flexibly account for uncertainty in the values for SE and SP. Instead of assuming fixed values, we assume in Bayesian model 2 that SE can take any possible value between 70 and 90 %, and that SP can take any possible value between 85 and 95 %. For the sake of comparability, these values were chosen so that their means would correspond to the fixed values applied before.

Model 2 Bayesian estimation of true prevalence from apparent prevalence, based on stochastic sensitivity and specificity; comments, denoted by the “#” symbol, translate the code in words

Again, the model was fitted in WinBUGS using two chains, each containing 1,000 “burn-in” samples and 5,000 retained samples. Typical (numerical and graphical) output is presented in Table 4. The results are similar to those obtained by Model 1, but show wider credibility intervals, owing to the additional uncertainty introduced by defining SE and SP as stochastic nodes.

Table 4 “True” prevalence estimation results from Bayesian model 2

Rights and permissions

Reprints and permissions

About this article

Cite this article

Speybroeck, N., Devleesschauwer, B., Joseph, L. et al. Misclassification errors in prevalence estimation: Bayesian handling with care. Int J Public Health 58, 791–795 (2013). https://doi.org/10.1007/s00038-012-0439-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00038-012-0439-9

Keywords

Navigation