League Tables for Hospital Comparisons

Sharon-Lise T. Normand; Arlene S. Ash; Stephen E. Fienberg; Thérèse A. Stukel; Jessica Utts; Thomas A. Louis

doi:10.1146/annurev-statistics-022513-115617

Annual Review of Statistics and Its Application

Volume 3, 2016

Review Article

Free

League Tables for Hospital Comparisons

Sharon-Lise T. Normand¹, Arlene S. Ash², Stephen E. Fienberg³, Thérèse A. Stukel⁴, Jessica Utts⁵, and Thomas A. Louis⁶
View Affiliations Hide Affiliations

Affiliations: ¹Department of Health Care Policy, Harvard Medical School, and Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115; email: [email protected] ²Department of Quantitative Health Sciences, University of Massachusetts Medical School, Worcester, Massachusetts 01605; email: [email protected] ³Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213; email: [email protected] ⁴Institute for Clinical Evaluative Sciences, Toronto, Ontario M4N 3M5, Canada, and the Institute of Health Policy, Management & Evaluation, University of Toronto, Toronto, Ontario M5T 3M6, Canada, and Dartmouth Institute for Health Policy and Clinical Practice, Hanover, New Hampshire 03766; email: [email protected] ⁵Department of Statistics, University of California, Irvine, California 92697; email: [email protected] ⁶Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland 21205; email: [email protected]
Vol. 3:21-50 (Volume publication date June 2016) https://doi.org/10.1146/annurev-statistics-022513-115617
© Annual Reviews

Abstract

We review statistical methods for estimating and interpreting league tables used to infer hospital quality with a primary focus on methods for partitioning variation into two types: (a) that associated with within-hospital variation for a homogeneous group of patients and (b) that produced by between-hospital variation. We discuss the types of covariates included in the model, hierarchical and nonhierarchical logistic regression models for conducting inferences in a low-information context and their associated trade-offs, and the role of hospital volume. We use all-cause mortality rates for US hospitals to illustrate concepts and methods.

Keyword(s): Bayesian inference, hierarchical model, low information, observational data, profiling, risk adjustment

Article metrics loading...

/content/journals/10.1146/annurev-statistics-022513-115617

2016-06-01

2024-04-18

Full text loading...

/deliver/fulltext/statistics/3/1/annurev-statistics-022513-115617.html?itemId=/content/journals/10.1146/annurev-statistics-022513-115617&mimeType=html&fmt=ahah

Literature Cited

Alexandrescu R, Jen M-H, Bottle A, Jarman B, Aylin P. 2011. Logistic versus hierarchical modeling: an analysis of a statewide inpatient sample. J. Am. Coll. Surg. 213:392–401 [Google Scholar]
Ash AS, Fienberg SE, Louis TA, Normand S-LT, Stukel TA, Utts J. 2012. Statistical issues in assessing hospital performance Rep., Comm. Pres. Stat. Soc. https://www.cms.gov/Medicare/Quality-Initiatives-Patient-Assessment-Instruments/HospitalQualityInits/Downloads/Statistical-Issues-in-Asses-sing-Hospital-Performance.pdf
Ash A, Shwartz M. 1999. R2: a useful measure of model performance when predicting a dichotomous outcome. Stat. Med. 18:375–84 [Google Scholar]
Baggerly KA, Coombes KR. 2011. What information should be required to support clinical “omics” publications?. Clin. Chem. 57:688–90 [Google Scholar]
Bayarri MJ, Castellanos ME. 2007. Bayesian checking of the second levels of hierarchical models. Stat. Sci. 22:322–43 [Google Scholar]
Berk RA. 2008. Statistical Learning from a Statistical Perspective New York: Springer-Verlag
Birkmeyer JD, Siewers AE, Finlayson EVA, Stukel TA, Lucas FL. et al. 2002. Hospital volume and surgical mortality in the United States. N. Engl. J. Med. 346:1128–37 [Google Scholar]
Bishop YMM, Fienberg SE, Holland PW. 2007 (1975). Discrete Multivariate Analysis: Theory and Practice New York: Springer-Verlag
Blumberg MS. 1987. Comments on HCFA hospital death rate statistical outliers. Health Serv. Res. 21:715–39 [Google Scholar]
Breiman L. 2001. Random forests. Mach. Learn. 45:5–32 [Google Scholar]
Bunker JP, Forrest WHJ, Mosteller F, Vandam LD. 1969. The National Halothane Study: a study of the possible association between halothane anesthesia and post-operative hepatic necrosis. Rep. Subcomm. Anesth., Div. Med. Sci. Natl. Acad. Sci.–Natl. Res. Counc. Washington, DC:
Camilli G, Cizek GJ, Lugg CA. 2001. Psychometric theory and the validation of performance standards: history and future perspectives. Setting Performance Standards: Concepts, Methods and Perspectives GJ Cizek 445–75 Mahwah, NJ: Lawrence Erlbaum [Google Scholar]
Carlin BP, Louis TA. 2009. Bayesian Methods for Data Analysis. Boca Raton, FL: Chapman & Hall/CRC, 3rd ed..
CDC (Cent. Dis. Control) 2010. Your guide to the standardized infection ratio (SIR). NHSN e-news Special Edition Dec. 10
Citro C, Kalton G. 2000. Small-Area Income and Poverty Estimates: Priorities for 2000 and Beyond Washington, DC: Natl. Acad. Press
Cleveland WS, Devlin SJ. 1988. Locally-weighted regression: an approach to regression analysis by local fitting. J. Am. Stat. Assoc. 83:596–610 [Google Scholar]
Crainiceanu CM, Caffo BS, Morris J. 2013. Multilevel functional data analysis. The SAGE Handbook of Multilevel Modeling MA Scott, JS Simonoff, BD Marx 223–48 Los Angeles: SAGE [Google Scholar]
Crainiceanu CM, Ruppert D, Carroll RJ, Adarsh J, Goodner B. 2007. Spatially adaptive penalized splines with heteroscedastic errors. J. Comput. Graph. Stat. 16:265–88 [Google Scholar]
Dempster AP. 1988. Employment discrimination and statistical science. Stat. Sci. 3:149–61 Discussion. 3:162–95 [Google Scholar]
Diggle PJ, Thomson MC, Christensen OF, Rowlingson B, Obsomer V. et al. 2007. Spatial modelling and the prediction of Loa loa risk: decision making under uncertainty. Ann. Trop. Med. Parasitol. 101:499–509 [Google Scholar]
Dudley RA, Johansen KL, Brand R, Rennie DJ, Milstein A. 2000. Elective referral to high-volume hospitals: estimating potentially avoidable deaths. JAMA 283:1159–66 [Google Scholar]
Efron B. 1978. Regression and ANOVA with zero-one data: measures of residual variation. J. Am. Stat. Assoc. 73:113–21 [Google Scholar]
Ericksen EP, Kadane JB. 1985. Estimating the population in a census year: 1980 and beyond. J. Am. Stat. Assoc. 80:98–109 Discussion. 80:110–31 [Google Scholar]
Fienberg SE. 2011. Bayesian models and methods in public policy and government settings. Stat. Sci. 26:212–26 Discussion. 26:227–30 [Google Scholar]
Fiscella K, Burstin HR, Nerenz DR. 2014. Quality measures and sociodemographic risk factors: to adjust or not to adjust. JAMA 321:242615–16 [Google Scholar]
Freedman DA, Navidi WC. 1986. Regression models for adjusting the 1980 census. Stat. Sci. 1:3–11 Discussion. 1:12–39 [Google Scholar]
Gatsonis CA. 1998. Profiling providers of medical care. Encyclopedia of Biostatistics 3 P Armitage, T Colton New York: Wiley, 2nd ed.. [Google Scholar]
Gelman A, Carlin J, Stern H, Rubin D. 2004. Bayesian Data Analysis Boca Raton, FL: Chapman & Hall/CRC, 2nd ed..
Gelman A, van Mechelen I, Verbeke G, Heitjan DF, Meulders M. 2005. Multiple imputation for model checking: completed-data plots with missing and latent data. Biometrics 61:74–85 [Google Scholar]
Goldman E, Chu P, Osmond D, Bindman A. 2011. The accuracy of present-on-admission reporting in administrative data. Health Serv. Res. 46:1946–62 [Google Scholar]
Goldstein H, Spiegelhalter DJ. 1996. League tables and their limitations: statistical issues in comparisons of institutional performance. J. R. Stat. Soc. Ser. A 159:3385–443 [Google Scholar]
Greiner DJ. 2008. Causal inference in civil rights litigation. Harvard Law Rev. 122:533–98 [Google Scholar]
Hastie TJ, Tibshirani RJ, Friedman JH. 2009. The Elements of Statistical Learning New York: Springer-Verlag, 2nd ed..
Iezzoni LI. 1997. Assessing quality using administrative data. Ann. Intern. Med. 127:666–74 [Google Scholar]
Iezzoni LI. 2003. Risk Adjustment for Measuring Health Care Outcomes Chicago, IL: Health Adm. Press., 3rd ed..
Jha AK, Zaslavsky AM. 2014. Quality reporting that addresses disparities in health care. JAMA 312:3225–26 [Google Scholar]
Jones HE, Spiegelhalter DJ. 2011. The identification of “unusual” health-care providers from a hierarchical model. Am. Stat. 65:154–63 [Google Scholar]
Kalbfleisch JD, Wolfe RA. 2013. On monitoring outcomes of medical providers. Stat. Biosci. 5:286–302 [Google Scholar]
Kipnis P, Escobar GJ, Draper D. 2010. Effect of choice of estimation method on inter-hospital mortality rate comparisons. Med. Care 48:458–65 [Google Scholar]
Krumholz HM, Brindis RG, Brush JE, Cohen DJ, Epstein AJ. et al. 2006. Standards for statistical models used for public reporting of health outcomes: an American Heart Association Scientific Statement from the Quality of Care and Outcomes Research Interdisciplinary Writing Group: cosponsored by the Council on Epidemiology and Prevention and the Stroke Council. Endorsed by the American College of Cardiology Foundation. Circulation 113:456–62 [Google Scholar]
Landrum M, Bronskill S, Normand S-LT. 2000. Analytic methods for constructing cross-sectional profiles of health care providers. Health Serv. Outcomes Res. Methodol. 1:23–48 [Google Scholar]
Landrum MB, Normand S-LT, Rosenheck RA. 2003. Selection of related multivariate means: monitoring psychiatric care in the Department of Veterans Affairs. J. Am. Stat. Assoc. 98:7–16 [Google Scholar]
Lin R, Louis TA, Paddock SM, Ridgeway G. 2006. Loss function based ranking in two-stage, hierarchical models. Bayesian Anal. 1:915–46 [Google Scholar]
Lin R, Louis TA, Paddock SM, Ridgeway G. 2009. Ranking USRDS provider specific SMRs from 1998–2001. Health Serv. Outcomes Res. Methodol. 9:22–38 [Google Scholar]
Lin X. 2007. Estimation using penalized quasilikelihood and quasi-pseudo-likelihood in Poisson mixed models. Lifetime Data Anal. 13:533–44 [Google Scholar]
Lockwood JR, Louis TA, McCaffrey DF. 2002. Uncertainty in rank estimation: implications for value-added modeling accountability systems. J. Educ. Behav. Stat. 27:255–70 [Google Scholar]
Louis TA, Zeger SL. 2008. Effective communication of standard errors and confidence intervals. Biostatistics 10:1–2 [Google Scholar]
Magder LS, Zeger S. 1996. A smooth nonparametric estimate of a mixing distribution using mixtures of Gaussians. J. Am. Stat. Assoc. 91:1141–51 [Google Scholar]
McCaffrey DF, Ridgeway G, Morral AR. 2004. Propensity score estimation with boosted regression for evaluating causal effects in observational studies. Psychol. Methods 9:403–25 [Google Scholar]
Mesirov JP. 2010. Computer science. Accessible reproducible research. Science 327:415–16 [Google Scholar]
Morris JS, Carroll RJ. 2006. Wavelet-based functional mixed models. J. R. Stat. Soc. Ser. B Stat. Methodol. 68:179–99 [Google Scholar]
Mosteller F. 2010. The safety of anesthetics: the National Halothane Study. The Pleasures of Statistics: The Autobiography of Frederick Mosteller SE Fienberg, DC Hoaglin, JM Tanur 69–88 New York: Springer [Google Scholar]
Ni X, Zhang D, Zhang HH. 2010. Variable selection for semiparametric mixed models in longitudinal studies. Biometrics 66:79–88 [Google Scholar]
Normand S-LT, Shahian DM. 2007. Statistical and clinical aspects of hospital outcomes profiling. Stat. Sci. 22:206–26 [Google Scholar]
Paddock S, Louis TA. 2011. Percentile-based empirical distribution function estimates for performance evaluation of healthcare providers. J. R. Stat. Soc. Ser. C Appl. Stat. 60:575–89 [Google Scholar]
Paddock S, Ridgeway G, Lin R, Louis TA. 2006. Flexible distributions for triple-goal estimates in two-stage hierarchical models. Comput. Stat. Data Anal. 50:3243–62 [Google Scholar]
Pepe MS. 2003. The Statistical Evaluation of Medical Tests for Classification and Prediction Oxford, UK: Oxford Univ. Press
Pepe MS, Feng Z, Huang Y, Longton G, Prentice R. et al. 2008. Integrating the predictiveness of a marker with its performance as a classifier. Am. J. Epidemiol. 167:362–68 [Google Scholar]
Ross JS, Normand S-LT, Wang Y, Ko DT, Chen J. et al. 2010. Hospital volume and 30-day mortality for three common medical conditions. N. Engl. J. Med. 362:1110–18 [Google Scholar]
Shahian DM, Normand S-LT. 2003. The volume-outcome relationship: from Luft to leapfrog. Ann. Thorac. Surg. 75:1048–58 [Google Scholar]
Shahian DM, Normand S-LT. 2008. Comparison of “risk-adjusted” hospital outcomes. Circulation 117:1955–63 [Google Scholar]
Shen W, Louis TA. 1998. Triple-goal estimates in two-stage, hierarchical models. J. R. Stat. Soc. Ser. B Stat. Methodol. 60:455–71 [Google Scholar]
Silber JH, Rosenbaum PR, Brachet TJ, Ross RN, Bressler LJ. et al. 2010. The Hospital Compare mortality model and the volume-outcome relationship. Health Serv. Res. 45:1148–67 [Google Scholar]
Spencer G, Wang J, Donovan L, Tu JV. 2008. Report on coronary artery bypass surgery in Ontario, fiscal years 2005/06 and 2006/07 Tech. Rep., Inst. Clin. Eval. Sci. Toronto:
Spiegelhalter D, Best N, Carlin B, Linde AVD. 2002. Bayesian measures of model complexity and fit. J. R. Stat. Soc. Ser. B Stat. Methodol. 64:583–639 [Google Scholar]
Spiegelhalter D, Sherlaw-Johnson C, Bardsley M, Blunt I, Wood C, Grigg O. 2012. Statistical methods for healthcare regulation: rating, screening and surveillance. J. R. Stat. Soc. Ser. A Stat. Soc. 175:1–47 [Google Scholar]
Tomberlin T. 1988. Predicting accident frequencies for drivers classified by two factors. J. Am. Stat. Assoc. 83:309–21 [Google Scholar]
Wang Y. 2011. Smoothing Splines: Methods and Applications Boca Raton, FL: Chapman & Hall/CRC
Whoriskey P. 2006. Florida to link teacher pay to students' test scores. Washington Post March 22
Wood SN. 2006. Generalized Additive Models: An Introduction with R Boca Raton, FL: Chapman & Hall/CRC

/content/journals/10.1146/annurev-statistics-022513-115617

League Tables for Hospital Comparisons

Annual Review of Statistics and Its Application 3, 21 (2016); https://doi.org/10.1146/annurev-statistics-022513-115617

/content/journals/10.1146/annurev-statistics-022513-115617

Data & Media loading...

Article Type: Review Article

Most Cited Most Cited RSS feed

- Probabilistic Forecasting
  
  Tilmann Gneiting, and Matthias Katzfuss
  
  Vol. 1 (2014), pp. 125–151
- Functional Data Analysis
  
  Jane-Ling Wang, Jeng-Min Chiou, and Hans-Georg Müller
  
  Vol. 3 (2016), pp. 257–295
- Bayesian Computing with INLA: A Review
  
  Håvard Rue, Andrea Riebler, Sigrunn H. Sørbye, Janine B. Illian, Daniel P. Simpson, and Finn K. Lindgren
  
  Vol. 4 (2017), pp. 395–421
- Functional Regression
  
  Jeffrey S. Morris
  
  Vol. 2 (2015), pp. 321–359
- Topological Data Analysis
  
  Larry Wasserman
  
  Vol. 5 (2018), pp. 501–532
- Algorithmic Fairness: Choices, Assumptions, and Definitions
  
  Shira Mitchell, Eric Potash, Solon Barocas, Alexander D'Amour, and Kristian Lum
  
  Vol. 8 (2021), pp. 141–163
- Microbiome, Metagenomics, and High-Dimensional Compositional Data Analysis
  
  Hongzhe Li
  
  Vol. 2 (2015), pp. 73–94
- Learning Deep Generative Models
  
  Ruslan Salakhutdinov
  
  Vol. 2 (2015), pp. 361–385
- On p-Values and Bayes Factors
  
  Leonhard Held, and Manuela Ott
  
  Vol. 5 (2018), pp. 393–419
- High-Dimensional Statistics with a View Toward Applications in Biology
  
  Peter Bühlmann, Markus Kalisch, and Lukas Meier
  
  Vol. 1 (2014), pp. 255–278
More Less

Annual Review of Statistics and Its Application

Volume 3, 2016

Review Article

Free

League Tables for Hospital Comparisons

Abstract

Most Read This Month

Most Cited Most Cited RSS feed