1932

Abstract

Over the past few years, interest in the identification of rare variants that influence human phenotype has led to the development of many statistical methods for testing for association between sets of rare variants and binary or quantitative traits. Here, I review some of the most important ideas that underlie these methods and the most relevant issues when choosing a method for analysis. In addition to the tests for association, I review crucial issues in performing a rare variant study, from experimental design to interpretation and validation. I also discuss the many challenges of these studies, some of their limitations, and future research directions.

Loading

Article metrics loading...

/content/journals/10.1146/annurev-genom-083115-022609
2016-08-31
2024-04-20
Loading full text...

Full text loading...

/deliver/fulltext/genom/17/1/annurev-genom-083115-022609.html?itemId=/content/journals/10.1146/annurev-genom-083115-022609&mimeType=html&fmt=ahah

Literature Cited

  1. Asimit JL, Day-Williams AG, Morris AP, Zeggini E. 1.  2012. ARIEL and AMELIA: testing for an accumulation of rare variants using next-generation sequencing data. Hum. Hered. 73:84–94 [Google Scholar]
  2. Asimit JL, Zeggini E. 2.  2010. Rare variant association analysis methods for complex traits. Annu. Rev. Genet. 44:293–308 [Google Scholar]
  3. Bansal V, Libiger O, Torkamani A, Schork NJ. 3.  2010. Statistical analysis strategies for association studies involving rare variants. Nat. Rev. Genet. 11:773–85 [Google Scholar]
  4. Basu S, Pan W. 4.  2011. Comparison of statistical tests for disease association with rare variants. Genet. Epidemiol. 35:606–19 [Google Scholar]
  5. Capanu M, Begg CB. 5.  2011. Hierarchical modeling for estimating relative risks of rare genetic variants: properties of the pseudo-likelihood method. Biometrics 67:371–80 [Google Scholar]
  6. Chen LS, Hsu L, Gamazon ER, Cox NJ, Nicolae DL. 6.  2012. An exponential combination procedure for set-based association tests in sequencing studies. Am. J. Hum. Genet. 91:977–86 [Google Scholar]
  7. Davydov EV, Goode DL, Sirota M, Cooper GM, Sidow A, Batzoglou S. 7.  2010. Identifying a high fraction of the human genome to be under selective constraint using GERP++. PLOS Comput. Biol. 6:e1001025 [Google Scholar]
  8. Dering C, Hemmelmann C, Pugh E, Ziegler A. 8.  2011. Statistical analysis of rare sequence variants: an overview of collapsing methods. Genet. Epidemiol. 35:Suppl. 1S12–17 [Google Scholar]
  9. Derkach A, Lawless JF, Sun L. 9.  2013. Robust and powerful tests for rare variants using Fisher's method to combine evidence of association from two or more complementary tests. Genet. Epidemiol. 37:110–21 [Google Scholar]
  10. Derkach A, Lawless JF, Sun L. 10.  2014. Pooled association tests for rare genetic variance: a review and some new results. Stat. Sci. 29:302–21 [Google Scholar]
  11. 11. GTEx Consort. 2015. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348:648–60 [Google Scholar]
  12. Han F, Pan W. 12.  2010. A data-adaptive sum test for disease association with multiple common or rare variants. Hum. Hered. 70:42–54 [Google Scholar]
  13. Hoffmann TJ, Marini NJ, Witte JS. 13.  2010. Comprehensive approach to analyzing rare genetic variants. PLOS ONE 5:e13584 [Google Scholar]
  14. Ionita-Laza I, Buxbaum JD, Laird NM, Lange C. 14.  2011. A new testing strategy to identify rare variants with either risk or protective effect on disease. PLOS Genet. 7:e1001289 [Google Scholar]
  15. King CR, Nicolae DL. 15.  2014. GWAS to sequencing: divergence in study design and analysis. Genes 5:460–76 [Google Scholar]
  16. King CR, Rathouz PJ, Nicolae DL. 16.  2010. An evolutionary framework for association testing in resequencing studies. PLOS Genet. 6:e1001202 [Google Scholar]
  17. King CR, Rathouz PJ, Nicolae DL. 17.  2015. Prediction and replication from case-control sequencing studies using custom genotyping and additional sequencing. arXiv:1312.7714v3
  18. Kircher M, Witten DM, Jain P, O'Roak BJ, Cooper GM, Shendure J. 18.  2014. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46:310–15 [Google Scholar]
  19. Lee S, Abecasis GR, Boehnke M, Lin X. 19.  2014. Rare-variant association analysis: study designs and statistical tests. Am. J. Hum. Genet. 95:5–23 [Google Scholar]
  20. Lee S, Emond MJ, Bamshad MJ, Barnes KC, Rieder MJ. 20.  et al. 2012. Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. Am. J. Hum. Genet. 91:224–37 [Google Scholar]
  21. Li B, Leal S. 21.  2008. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am. J. Hum. Genet. 83:311–21 [Google Scholar]
  22. Lin DY, Tang ZZ. 22.  2011. A general framework for detecting disease associations with rare variants in sequencing studies. Am. J. Hum. Genet. 89:354–67 [Google Scholar]
  23. Liu Q, Nicolae DL, Chen LS. 23.  2013. Marbled inflation from population structure in gene-based association studies with rare variants. Genet. Epidemiol. 37:286–89 [Google Scholar]
  24. Madsen BE, Browning SR. 24.  2009. A groupwise association test for rare mutations using a weighted sum statistic. PLOS Genet. 5:e1000384 [Google Scholar]
  25. Mathieson I, McVean G. 25.  2012. Differential confounding of rare and common variants in spatially structured populations. Nat. Genet. 44:243–46 [Google Scholar]
  26. Morgenthaler S, Thilly WG. 26.  2007. A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST). Mutat. Res. 615:28–56 [Google Scholar]
  27. Morris AP, Zeggini E. 27.  2010. An evaluation of statistical approaches to rare variant analysis in genetic association studies. Genet. Epidemiol. 34:188–93 [Google Scholar]
  28. Neale B, Rivas M, Voight B, Altshuler D, Devlin B. 28.  et al. 2011. Testing for an unusual distribution of rare variants. PLOS Genet. 7:e1001322 [Google Scholar]
  29. Pan W. 29.  2009. Asymptotic tests of association with multiple SNPs in linkage disequilibrium. Genet. Epidemiol. 33:497–507 [Google Scholar]
  30. Price AL, Kryukov GV, de Bakker PIW, Purcell SM, Staples J. 30.  et al. 2010. Pooled association tests for rare variants in exon-resequencing studies. Am. J. Hum. Genet. 86:832–38 [Google Scholar]
  31. Quintana MA, Berstein JL, Thomas DC, Conti DV. 31.  2011. Incorporating model uncertainty in detecting rare variants: the Bayesian risk index. Genet. Epidemiol. 35:638–49 [Google Scholar]
  32. Reimherr M, Nicolae DL. 32.  2014. A functional data analysis approach for genetic association studies. Ann. Appl. Stat. 8:406–29 [Google Scholar]
  33. Sul JH, Han B, He D, Eskin E. 33.  2011. An optimal weighted aggregated association test for identification of rare variants involved in common diseases. Genetics 188:181–88 [Google Scholar]
  34. Sun J, Zheng Y, Hsu L. 34.  2013. A unified mixed-effects model for rare-variant association in sequencing studies. Genet. Epidemiol. 37:334–44 [Google Scholar]
  35. Wang GT, Zhang D, He Z, Hang D, Li B, Leal S. 35.  2015. Pitfalls in development of statistical methods for rare variant association studies. Presented at Annu. Meet. Am. Soc. Hum. Genet., 65th, Baltimore, MD, Oct. 6–10
  36. Wu M, Lee S, Cai T, Li Y, Boehnke M, Lin X. 36.  2011. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet. 89:82–93 [Google Scholar]
  37. Yi N, Zhi D. 37.  2010. Bayesian analysis of rare variants in genetic association studies. Genet. Epidemiol. 35:57–69 [Google Scholar]
  38. Zhou H, Sehl ME, Sinsheimer JS, Lange K. 38.  2010. Association screening of common and rare genetic variants by penalized regression. Bioinformatics 26:2375–82 [Google Scholar]
/content/journals/10.1146/annurev-genom-083115-022609
Loading
/content/journals/10.1146/annurev-genom-083115-022609
Loading

Data & Media loading...

  • Article Type: Review Article
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error