Spatial Data Analysis

Sudipto Banerjee

doi:10.1146/annurev-publhealth-032315-021711

Annual Review of Public Health

Volume 37, 2016

Review Article

Free

Spatial Data Analysis

Sudipto Banerjee¹
View Affiliations Hide Affiliations

Affiliations: Department of Biostatistics, University of California, Los Angeles, California 90095; email: [email protected]
Vol. 37:47-60 (Volume publication date March 2016) https://doi.org/10.1146/annurev-publhealth-032315-021711
First published as a Review in Advance on January 20, 2016
© Annual Reviews

Abstract

With increasing accessibility to geographic information systems (GIS) software, statisticians and data analysts routinely encounter scientific data sets with geocoded locations. This has generated considerable interest in statistical modeling for location-referenced spatial data. In public health, spatial data routinely arise as aggregates over regions, such as counts or rates over counties, census tracts, or some other administrative delineation. Such data are often referred to as areal data. This review article provides a brief overview of statistical models that account for spatial dependence in areal data. It does so in the context of two applications: disease mapping and spatial survival analysis. Disease maps are used to highlight geographic areas with high and low prevalence, incidence, or mortality rates of a specific disease and the variability of such rates over a spatial domain. They can also be used to detect hot spots or spatial clusters that may arise owing to common environmental, demographic, or cultural effects shared by neighboring regions. Spatial survival analysis refers to the modeling and analysis for geographically referenced time-to-event data, where a subject is followed up to an event (e.g., death or onset of a disease) or is censored, whichever comes first. Spatial survival analysis is used to analyze clustered survival data when the clustering arises from geographical regions or strata. Illustrations are provided in these application domains.

Keyword(s): Bayesian hierarchical modeling, conditional autoregressive (CAR) models, cure rate models, disease mapping, multivariate CAR models, multivariate disease mapping, spatial survival analysis

Article metrics loading...

/content/journals/10.1146/annurev-publhealth-032315-021711

2016-03-18

2024-05-09

Full text loading...

/deliver/fulltext/publhealth/37/1/annurev-publhealth-032315-021711.html?itemId=/content/journals/10.1146/annurev-publhealth-032315-021711&mimeType=html&fmt=ahah

Literature Cited

Auchincloss AH, Gebreab SY, Mair C, Diez Roux AV. 1. 2012. A review of spatial methods in epidemiology, 2000–2010. Annu. Rev. Public Health 33:107–22 [Google Scholar]
Banerjee S, Carlin B. 2. 2002. Spatial semiparametric proportional hazards models for analyzing infant mortality rates in Minnesota counties. Case Studies in Bayesian Statistics VI C Gatsonis, R Kass, A Carriquiry, A Gelman, D Higdon 137–52 New York: Springer [Google Scholar]
Banerjee S, Carlin B. 3. 2003. Semiparametric spatiotemporal frailty modeling. Environmetrics 14:523–35 [Google Scholar]
Banerjee S, Carlin B. 4. 2004. Parametric spatial cure rate models for interval-censored time-to-relapse data. Biometrics 60:268–75 [Google Scholar]
Banerjee S, Carlin B, Gelfand A. 5. 2014. Hierarchical Modeling and Analysis for Spatial Data Boca Raton, FL: Chapman and Hall/CRC Press, 2nd ed..
Banerjee S, Dey D. 6. 2005. Semiparametric proportional odds model for spatially correlated survival data. Lifetime Data Anal. 11:175–91 [Google Scholar]
Banerjee S, Wall M, Carlin B. 7. 2003. Frailty modelling for spatially correlated survival data with application to infant mortality in Minnesota. Biostatistics 4:123–42 [Google Scholar]
Bastos L, Gamerman D. 8. 2006. Dynamical survival models with spatial frailty. Lifetime Data Anal. 12:441–60 [Google Scholar]
Bennett S. 9. 1983. Analysis of survival data by the proportional odds model. Stat. Med. 2:273–77 [Google Scholar]
Besag J, York J, Mollié A. 10. 1991. Bayesian image restoration, with two applications in spatial statistics (with discussion). Ann. Inst. Stat. Math. 43:1–59 [Google Scholar]
Carlin B, Banerjee S. 11. 2003. Hierarchical multivariate CAR models for spatio-temporally correlated survival data (with discussion). Bayesian Statistics 7 JM Bernardo, MJ Bayarri, JO Berger, AP Dawid, D Heckerman 45–64 Oxford, UK: Oxford Univ. Press [Google Scholar]
Chen M-H, Ibrahim JG, Sinha D. 12. 1999. A new Bayesian model for survival data with a surviving fraction. J. Am. Stat. Assoc. 94:909–19 [Google Scholar]
Cooner F, Banerjee S, Carlin B, Sinha D. 13. 2007. Flexible cure rate modeling under latent activation schemes. J. Am. Stat. Assoc. 102:560–72 [Google Scholar]
Cooner F, Banerjee S, McBean A. 14. 2006. Modelling geographically referenced survival data with a cure fraction. Stat. Methods Med. Res. 15:307–24 [Google Scholar]
Cox D, Oakes D. 15. 1984. Analysis of Survival Data London: Chapman and Hall
Cressie N. 16. 1993. Statistics for Spatial Data New York: Wiley, 2nd ed..
Cressie N, Wikle C. 17. 2011. Statistics for Spatio-Temporal Data New York: Wiley, 1st ed..
Cromley E, McLafferty S. 18. 2002. GIS and Public Health New York: Guilford
Dean CB, Ugarte MD, Militino AF. 19. 2001. Detecting interaction between random region and fixed age effects in disease mapping. Biometrics 57:197–202 [Google Scholar]
Finkelstein D. 20. 1986. A proportional hazards model for interval-censored failure time data. Biometrics 42:845–54 [Google Scholar]
Gelfand A, Vounatsou P. 21. 2003. Proper multivariate conditional autoregressive models for spatial data analysis. Biostatistics 4:11–25 [Google Scholar]
Gelman A, Carlin J, Stern H, Dunson D, Vehtari A, Rubin D. 22. 2013. Bayesian Data Analysis Boca Raton, FL: Chapman and Hall/CRC Press, 3rd ed..
Henderson R, Shikamura S, Gorst D. 23. 2002. Modeling spatial variation in leukemia survival data. J. Am. Stat. Assoc. 97:965–72 [Google Scholar]
Hodges JS, Cui Y, Sargent DJ, Carlin BP. 24. 2007. Smoothing balanced single-error-term analysis of variance. Technometrics 49:12–25 [Google Scholar]
Hurtado Rúa SM, Dey D. 25. 2012. A transformation class for spatio-temporal survival data with a cure fraction. Stat. Methods Med. Res. doi: 10.1177/0962280212445658
Ibrahim J, Chen MH, Sinha D. 26. 2001. Bayesian Survival Analysis New York: Springer-Verlag
Jin X, Banerjee S, Carlin B. 27. 2007. Order-free coregionalized lattice models with application to multiple disease mapping. J. R. Stat. Soc. B 69:817–38 [Google Scholar]
Jin X, Carlin B, Banerjee S. 28. 2005. Generalized hierarchical multivariate CAR models for areal data. Biometrics 61:950–61 [Google Scholar]
Lawson A, Choi J, Zhang J. 29. 2014. Prior choice in discrete latent modeling of spatially referenced cancer survival. Stat. Methods Med. Res. 23:183–200 [Google Scholar]
Leroux B, Lei X, Breslow N. 30. 1999. Estimation of disease rates in small areas: a new mixed model for spatial dependence. Statistical Models in Epidemiology, the Environment, and Clinical Trials ME Halloran, D Berry 135–78 New York: Springer [Google Scholar]
Li Y, Ryan L. 31. 2002. Modeling spatial survival data using semiparametric frailty models. Biometrics 58:287–97 [Google Scholar]
Møller J. 32. 2003. Spatial Statistics and Computational Methods New York: Springer
Murray R, Anthonisen N, Connett J, Wise R, Lindgren P. 33. et al. 1998. Effects of multiple attempts to quit smoking and relapses to smoking on pulmonary function. Lung Health Study Research Group. J. Clin. Epidemiol. 51:1317–26 [Google Scholar]
Othus M, Barlogie B, LeBlanc M, Crowley J. 34. 2012. Cure models as a useful statistical tool for analyzing survival. Clin. Cancer Res. 18:3731–36 [Google Scholar]
Robert C, Casella G. 35. 2005. Monte Carlo Statistical Methods New York: Springer
Rushton G. 36. 2003. Public health, GIS, and spatial analytic tools. Annu. Rev. Public Health 24:43–56 [Google Scholar]
Schabenberger O, Gotway C. 37. 2004. Statistical Methods for Spatial Data Analysis Boca Raton, FL: Chapman and Hall/CRC
Wackernagel H. 38. 2003. Multivariate Geostatistics: An Introduction With Applications New York: Springer, 3rd ed..
Wall M. 39. 2004. A close look at the spatial structure implied by the CAR and SAR models. J. Stat. Plann. Inference 121:311–24 [Google Scholar]
Waller L, Gotway C. 40. 2004. Applied Spatial Statistics for Public Health Data New York: Wiley
Webster R, Oliver M. 41. 2001. Geostatistics for Environmental Scientists New York: Wiley
Zhang J, Lawson AB. 42. 2011. Bayesian parametric accelerated failure time spatial model and its application to prostate cancer. J. Appl. Stat. 38:591–603 [Google Scholar]
Zhang Y, Hodges J, Banerjee S. 43. 2009. Smoothed ANOVA with spatial effects as a competitor to MCAR in multivariate spatial smoothing. Ann. Appl. Stat. 3:1805–30 [Google Scholar]
Zhou H, Lawson AB, Hebert J, Slate E, Hill E. 44. 2008. Joint spatial survival modelling for the date of diagnosis and the vital outcome for prostate cancer. Stat. Med. 27:3612–28 [Google Scholar]

/content/journals/10.1146/annurev-publhealth-032315-021711

Spatial Data Analysis

Annual Review of Public Health 37, 47 (2016); https://doi.org/10.1146/annurev-publhealth-032315-021711

/content/journals/10.1146/annurev-publhealth-032315-021711

Data & Media loading...

Article Type: Review Article

Most Cited Most Cited RSS feed

- REVIEW OF COMMUNITY-BASED RESEARCH: Assessing Partnership Approaches to Improve Public Health
  
  Barbara A. Israel, Amy J. Schulz, Edith A. Parker, and Adam B. Becker
  
  Vol. 19 (1998), pp. 173–202
- Nature and Health
  
  Terry Hartig, Richard Mitchell, Sjerp de Vries, and Howard Frumkin
  
  Vol. 35 (2014), pp. 207–228
- Measuring Social Class in US Public Health Research: Concepts, Methodologies, and Guidelines
  
  N. Krieger, D. R. Williams, and N. E. Moss
  
  Vol. 18 (1997), pp. 341–378
- The Epidemiology of Depression Across Cultures
  
  Ronald C. Kessler, and Evelyn J. Bromet
  
  Vol. 34 (2013), pp. 119–138
- Acute Respiratory Effects of Particulate Air Pollution
  
  D. W. Dockery, and C. A. Pope
  
  Vol. 15 (1994), pp. 107–132
- The Role of Behavioral Science Theory in Development and Implementation of Public Health Interventions
  
  Karen Glanz, and Donald B. Bishop
  
  Vol. 31 (2010), pp. 399–418
- The Social Determinants of Health: Coming of Age
  
  Paula Braveman, Susan Egerter, and David R. Williams
  
  Vol. 32 (2011), pp. 381–398
- Racism and Health: Evidence and Needed Research
  
  David R. Williams, Jourdyn A. Lawrence, and Brigette A. Davis
  
  Vol. 40 (2019), pp. 105–125
- The Prescription Opioid and Heroin Crisis: A Public Health Approach to an Epidemic of Addiction
  
  Andrew Kolodny, David T. Courtwright, Catherine S. Hwang, Peter Kreiner, John L. Eadie, Thomas W. Clark, and G. Caleb Alexander
  
  Vol. 36 (2015), pp. 559–574
- Mediation Analysis: A Practitioner's Guide
  
  Tyler J. VanderWeele
  
  Vol. 37 (2016), pp. 17–32
More Less

Annual Review of Public Health

Volume 37, 2016

Review Article

Free

Spatial Data Analysis

Abstract

Most Read This Month

Most Cited Most Cited RSS feed