Network Sampling: From Snowball and Multiplicity to Respondent-Driven Sampling

Douglas D. Heckathorn; Christopher J. Cameron

doi:10.1146/annurev-soc-060116-053556

Network Sampling: From Snowball and Multiplicity to Respondent-Driven Sampling

Douglas D. Heckathorn¹, and Christopher J. Cameron¹
View Affiliations Hide Affiliations

Affiliations: Department of Sociology, Cornell University, Ithaca, New York 14853-7601; email: [email protected], [email protected]
Vol. 43:101-119 (Volume publication date July 2017) https://doi.org/10.1146/annurev-soc-060116-053556
First published as a Review in Advance on May 10, 2017
© Annual Reviews

Abstract

Network sampling emerged as a set of methods for drawing statistically valid samples of hard-to-reach populations. The first form of network sampling, multiplicity sampling, involved asking respondents about events affecting those in their personal networks; it was subsequently applied to studies of homicide, HIV, and other topics, but its usefulness is limited to public events. Link-tracing designs employ a different approach to study hard-to-reach populations, using a set of respondents that expands in waves as each round of respondents recruit their peers. Link-tracing as applied to hidden populations, often described as snowball sampling, was initially considered a form of convenience sampling. This changed with the development of respondent-driven sampling (RDS), a widely used network sampling method in which the link-tracing design is adapted to provide the basis for statistical inference. The literature on RDS is large and rapidly expanding, involving contributions by numerous independent research groups employing data from dozens of different countries. Within this literature, many important research questions remain unresolved, including how best to choose among alternative RDS estimators, how to refine existing estimators to make them less dependent on assumptions that are sometimes counterfactual, and perhaps the greatest unresolved issue, how best to calculate the variability of the estimates.

Keyword(s): hidden populations, link-tracing sampling, Markov, network sampling, RDS, respondent-driven sampling, social networks

Article metrics loading...

/content/journals/10.1146/annurev-soc-060116-053556

2017-07-31

2024-05-08

Full text loading...

/deliver/fulltext/soc/43/1/annurev-soc-060116-053556.html?itemId=/content/journals/10.1146/annurev-soc-060116-053556&mimeType=html&fmt=ahah

Literature Cited

Barash V, Cameron C, Spiller M, Heckathorn DD. 2016. Respondent-driven sampling—testing assumptions: sampling with replacement. J. Off. Stat. 32:129–73 [Google Scholar]
Becker HS. 1963. Outsiders: Studies in the Sociology of Deviance New York: Macmillan
Bernhardt A, Milkman R, Theodore N, Heckathorn D, Auer M. et al. 2009. Broken Laws, Unprotected Workers: Violations of Employment and Labor Laws in America's Cities New York: Russell Sage Found.
Biernacki P, Waldorf D. 1981. Snowball sampling: problems and techniques of chain referral sampling. Sociol. Methods Res. 10:141–63 [Google Scholar]
Brown LD, Eaton ML, Freedman DA, Klein SP, Olshen RA. et al. 1999. Statistical Controversies in Census 2000 Tech. Rep. 537 Dep. of Stat., Univ. Calif. Berkeley:
Cochran WG. 1977. Sampling Techniques New York: Wiley
Coleman JS. 1958. Relational analysis: the study of social organizations with survey methods. Hum. Organ. 17:28–36 [Google Scholar]
Dodds PS, Muhamad R, Watts DJ. 2003. An experimental study of search in global social networks. Science. 3015634827–29
Erickson BH. 1979. Some problems of inference from chain data. Sociol. Methodol. 10:1276–302 [Google Scholar]
Frank O. 1979. Estimation of population totals by use of snowball samples. Perspectives on Social Network Research P Holland, S Leinhardt 319–47 New York: Academic Press [Google Scholar]
Frank O, Snijders T. 1994. Estimating the size of hidden populations using snowball sampling. J. Off. Stat. 10:53–67 [Google Scholar]
Gile K. 2011. Improved inference for respondent-driven sampling data with application to HIV prevalence estimation. J. Am. Stat. Assoc. 106:135–46 [Google Scholar]
Gile K, Handcock MS. 2010. Respondent-driven sampling: an assessment of current methodology. Sociol. Methodol. 40:285–327 [Google Scholar]
Gile K, Handcock MS. 2015. Network model-assisted inference from respondent-driven sampling data. J. R. Stat. Soc. A 178:3619–39 [Google Scholar]
Gile KJ, Johnston LG, Salganik MJ. 2015. Diagnostics for respondent-driven sampling. J. R. Stat. Soc. A 178:1241–69 [Google Scholar]
Goel S, Salganik MJ. 2009. Respondent-driven sampling as Markov chain Monte Carlo. Stat. Med. 28:172202–29 [Google Scholar]
Goel S, Salganik MJ. 2010. Assessing respondent-driven sampling. PNAS 107:6743–47 [Google Scholar]
Goodman LA. 1961. Snowball sampling. Ann. Math. Stat. 32:148–70 [Google Scholar]
Handcock MS, Gile KJ. 2011. Comment: on the concept of snowball sampling. Sociol. Methodol. 41:1367–71 [Google Scholar]
Hansen MH, Hurwitz WN. 1943. On the theory of sampling from finite populations. Ann. Math. Stat. 14:4333–62 [Google Scholar]
Hastings WK. 1970. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57:97–109 [Google Scholar]
Heckathorn DD. 1997. Respondent driven sampling: a new approach to the study of hidden samples. Soc. Probl. 44:2174–99 [Google Scholar]
Heckathorn DD. 2002. Respondent-driven sampling II: deriving valid population estimates from chain-referral samples of hidden populations. Soc. Probl. 49:11–34 [Google Scholar]
Heckathorn DD. 2007. Extensions of respondent-driven sampling: analyzing continuous variables and controlling for differential recruitment. Sociol. Methodol. 37:151–207 [Google Scholar]
Heckathorn DD. 2008. Assumptions of RDS: analytic versus functional assumptions Presented at CDC Consult. Anal. Data Collect. Respond.-Driven Sampl. Atlanta, GA:
Heckathorn DD, Jeffri J. 2003. Social Networks of Jazz Musicians. Changing the Beat: A Study of the Work Life of Jazz Musicians. Vol. III. Respondent-Driven Sampling48–61 Washington, DC: Natl. Endow. Arts [Google Scholar]
Heckathorn DD, Semaan S, Broadhead RS, Hughes JJ. 2002. Extensions of respondent-driven sampling: a new approach to the study of injection drug users aged 18–25. AIDS Behav 13:155–67 [Google Scholar]
Kajubi P, Kamya MR, Raymond HF, Chen S, Rutherford GW. et al. 2008. Gay and bisexual men in Kampala, Uganda. AIDS Behav 12:492–504 [Google Scholar]
Klovdahl AS. 1989. Urban social networks: some methodological problems and possibilities. The Small World M Kochen 176–210 New Jersey: Norwood [Google Scholar]
Lansky A, Drake A, Pham HT. 2009. HIV-associated behaviors among injecting drug users—23 cities, United States, May 2005–February 2006. Morb. Mortal. Weekly Rep. 58:329–32 [Google Scholar]
Laumann EO, Gagnon JH, Michaels S, Michael RT, Coleman JS. 1989. Monitoring the AIDS epidemic in the U.S.: a network approach. Science 244:49091186–89 [Google Scholar]
Lazarsfeld PF, Berelson B, Gaudet H. 1944. The People's Choice: How the Voter Makes Up His Mind in a Presidential Campaign New York: Duell, Sloan and Pierce
Lindesmith AR. 1968. Addiction and Opiates Chicago: Aldine
Lu X. 2013. Linked ego networks: improving estimate reliability and validity with respondent-driven sampling. Soc. Netw. 35:4669–85 [Google Scholar]
Lusher D, Koskinen J, Robins G. 2013. Exponential Random Graph Models for Social Networks Cambridge, UK: Cambridge Univ. Press
MacKellar D, Valleroy L, Karon J, Lemp G, Janssen R. 1996. The Young Men's Survey: methods for estimating HIV seroprevalence and risk factors among young men who have sex with men. Public Health Rep 111:Suppl 1138–44 [Google Scholar]
McCreesh N, Johnson LG, Copas A, Sonnenberg P, Seeley J. et al. 2011. Evaluation of the role of location and distance in recruitment in respondent-driven sampling. Int. J. Health Geogr. 10:156 [Google Scholar]
Merton RK. 1949. Patterns of influence: a study of interpersonal influence and communications behavior in a local community. Communications Research, 1948–1949 P Lazarsfeld, F Stanton, 180–219 New York: Harper [Google Scholar]
Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E. 1953. The Monte Carlo method. J. Chem. Phys. 21:1087–92 [Google Scholar]
Neely WW. 2009. Statistical theory for respondent-driven sampling PhD Thesis Univ. Wisconsin Madison:
Poon AFY, Brouwer KC, Strathdee SA, Firestone-Cruz M, Lozada RM. et al. 2009. Parsing social network survey data from hidden populations using stochastic context-free grammars. PLOS ONE 4:9e6777 [Google Scholar]
Ramirez-Valles J, Heckathorn DD, Vázquez R, Diaz RM, Campbell RT. 2005. From networks to populations: the development and application of respondent-driven sampling among IDUs and Latino gay men. AIDS Behav 9:4387–402 [Google Scholar]
Rapoport A. 1979. A probabilistic approach to networks. Soc. Netw. 2:11–18 [Google Scholar]
Rohe K. 2015. Network driven sampling; a critical threshold for design effects. arXiv:1505.05461 [math.ST]
Rothbart GS, Fine M, Sudman S. 1982. On finding and interviewing the needles in the haystack: the use of multiplicity sampling. Public Opin. Q. 46:408–21 [Google Scholar]
Salganik MJ. 2006. Variance estimation, design effects, and sample size calculations for respondent-driven sampling. J. Urban Health 83:i98–112 [Google Scholar]
Salganik MJ, Heckathorn DD. 2004. Sampling and estimation in hidden populations using respondent-driven sampling. Sociol. Methodol. 34:193–239 [Google Scholar]
Shi Y, Cameron CJ, Heckathorn DD. 2016. Model-based and design-based inference: reducing bias due to differential recruitment in respondent-driven sampling. Sociol. Methods Res. https://doi.org/10.1177/0049124116672682 [Crossref]
Sirken MG. 1970. Household surveys with multiplicity. J. Am. Stat. Assoc. 65:257–66 [Google Scholar]
Snijders T. 1992. Estimation on the basis of snowball samples: how to weight?. Bull. Méthodol. Sociol. 36:59–70 [Google Scholar]
Spreen M. 1992. Rare populations, hidden populations, and link-tracing designs: What and why?. Bull. Methodol. Sociol. 36:34–58 [Google Scholar]
Sudman S, Kalton G. 1986. New developments in the sampling of special populations. Annu. Rev. Sociol. 12:401–29 [Google Scholar]
Szwarcwald CL, Borges de Souza PR Jr., Damacena GN, Barbosa A Jr., Kendall C. 2011. Analysis of data collected by RDS among sex workers in 10 Brazilian cities, 2009: estimation of the prevalence of HIV, variance, and design effect. J. Acquir. Immune Defic. Syndr 57:S129–35 [Google Scholar]
Tajfel H. 1982. Social Identity and Intergroup Relations Cambridge, UK: Cambridge Univ. Press
Tajfel H, Turner J. 1979. An integrative theory of intergroup conflict. The Social Psychology of Intergroup Relations WG Austin, S Worchel 33–47 Monterey, CA: Brooks-Cole [Google Scholar]
Thompson SK, Collins LM. 2002. Adaptive sampling in research on risk-related behaviors. Drug Alcohol Depend 68:S57–67 [Google Scholar]
Thompson SK, Frank O. 2000. Model-based estimation with link-tracing sampling designs. Surv. Methodol. 26:87–98 [Google Scholar]
Thompson SK, Seber AFG. 1995. Adaptive Sampling New York: Wiley
Verdery AM, Mouw T, Bauldry S, Mucha PJ. 2015. Network structure and biased variance estimation in respondent driven sampling. PLOS ONE 10:12e0145296 [Google Scholar]
Volz E, Heckathorn D. 2008. Probability based estimation theory for respondent driven sampling. J. Off. Stat. 24:79–97 [Google Scholar]
Volz E, Wejnert C, Cameron C, Spiller M, Barash V. et al. 2012. Respondent-Driven Sampling Analysis Tool (RDSAT) Version 7.1. Software package http://respondentdrivensampling.org
Wald A. 1947. Sequential Analysis New York: Wiley
Watters JK, Biernacki P. 1989. Targeted sampling: options for the study of hidden populations. Soc. Probl. 36:4416–30 [Google Scholar]
Wejnert C. 2009. An empirical test of respondent-driven sampling: point estimates, variance, degree measures, and out-of-equilibrium data. Sociol. Methodol. 39:173–116 [Google Scholar]
Wejnert C, Pham H, Krishna N, Le B, DiNenno E. 2012. Estimating design effect and calculating sample size for respondent-driven sampling studies of injection drug users in the United States. AIDS Behav 16:4797–806 [Google Scholar]
Yamanis TJ, Merli MG, Neely WW, Tian FF, Moody J. et al. 2013. An empirical analysis of the impact of recruitment patterns on RDS estimates among a socially ordered population of female sex workers in China. Sociol. Methods Res. 42:3392–425 [Google Scholar]

/content/journals/10.1146/annurev-soc-060116-053556

Article Type: Review Article

Most Cited Most Cited RSS feed

- Birds of a Feather: Homophily in Social Networks
  
  Miller McPherson, Lynn Smith-Lovin, and James M Cook
  
  Vol. 27 (2001), pp. 415–444
- Social Capital: Its Origins and Applications in Modern Sociology
  
  Alejandro Portes
  
  Vol. 24 (1998), pp. 1–24
- Conceptualizing Stigma
  
  Bruce G. Link, and Jo C. Phelan
  
  Vol. 27 (2001), pp. 363–385
- Framing Processes and Social Movements: An Overview and Assessment
  
  Robert D. Benford, and David A. Snow
  
  Vol. 26 (2000), pp. 611–639
- Organizational Learning
  
  Barbara Levitt, and James G. March
  
  Vol. 14 (1988), pp. 319–338
- The Study of Boundaries in the Social Sciences
  
  Michèle Lamont, and Virág Molnár
  
  Vol. 28 (2002), pp. 167–195
- Assessing “Neighborhood Effects”: Social Processes and New Directions in Research
  
  Robert J. Sampson, Jeffrey D. Morenoff, and Thomas Gannon-Rowley
  
  Vol. 28 (2002), pp. 443–478
- Social Exchange Theory
  
  R M Emerson
  
  Vol. 2 (1976), pp. 335–362
- Culture and Cognition
  
  Paul DiMaggio
  
  Vol. 23 (1997), pp. 263–287
- Focus Groups
  
  David L. Morgan
  
  Vol. 22 (1996), pp. 129–152
More Less

Annual Review of Sociology

Volume 43, 2017

Review Article

Free

Network Sampling: From Snowball and Multiplicity to Respondent-Driven Sampling

Abstract

Most Read This Month

Most Cited Most Cited RSS feed

Birds of a Feather: Homophily in Social Networks

Social Capital: Its Origins and Applications in Modern Sociology

Conceptualizing Stigma

Framing Processes and Social Movements: An Overview and Assessment

Organizational Learning

The Study of Boundaries in the Social Sciences

Assessing “Neighborhood Effects”: Social Processes and New Directions in Research

Social Exchange Theory

Culture and Cognition

Focus Groups