1932

Abstract

Most countries collect short-term food consumption information of individuals on a regular basis. These data, after much analysis and interpretation, are used to assess the nutritional status of population subgroups, design food assistance programs, guide nutritional and food policy, and—in epidemiological applications—uncover associations between diet and health. In this review, we focus on surveillance, a broad term that includes, for example, estimation of nutritional status and evaluation of the adequacy of the diet. From a statistical viewpoint, dietary intake and evaluation questions pose tremendous methodological challenges. Nutrient and food adequacy are defined in terms of long-term intakes, yet we can only practically observe short-term consumption, perhaps over one or two days. Food consumption measurements are noisy and subject to both systematic and random error, and in addition, there are very large day-to-day differences in a person's food consumption. Observed distributions of food and nutrient intake tend to be skewed, with long tails to the right and (in the case of episodically consumed items) with a mass at zero that can represent a large proportion of the distribution. We review the literature on this topic and describe some of the newest questions and proposed methodological solutions. The focus is on the use of large national food consumption surveys to address public policy and research questions, but much of what we discuss is applicable in a broader context.

Loading

Article metrics loading...

/content/journals/10.1146/annurev-statistics-041715-033615
2017-03-07
2024-04-26
Loading full text...

Full text loading...

/deliver/fulltext/statistics/4/1/annurev-statistics-041715-033615.html?itemId=/content/journals/10.1146/annurev-statistics-041715-033615&mimeType=html&fmt=ahah

Literature Cited

  1. Agresti A. 1990. Categorical Data Analysis New York: Wiley
  2. Arab L, Wesseling-Perry K, Jardack P. 2010. Eight self-administered 24-hour dietary recalls using the internet are feasible in African Americans and Caucasians: the Energetics Study. J. Am. Diet. Assoc. 110:857–64 [Google Scholar]
  3. Basiotis PP, Welsh SO, Cronin FJ, Kelsay JL, Mertz W. 1987. Number of days of food intake records required to estimate individual and group nutrient intakes with defined confidence. J. Nutr. 117:1638–41 [Google Scholar]
  4. Basulto-Elías G, Carriquiry AL, De Brabanter K, Nordman D. 2016. Bivariate deconvolution. PhD thesis, Iowa State Univ., Ames
  5. Beale EML. 1962. Some uses of computers in operational research. Ind. Organ. 31:27–28 [Google Scholar]
  6. Beaton GH. 1994. Criteria of an adequate diet. Modern Nutrition in Health and Disease ME Shils, JA Olson, M Shike 1491–505 Philadelphia: Lea & Febiger, 8th ed.. [Google Scholar]
  7. Beaton GH, Miner J, Corey P, McGuire V, Cousins M. et al. 1979. Sources of variance in 24-hour dietary recall data: implications for nutrition study design and interpretation. Am. J. Clin. Nutr. 32:2546–59 [Google Scholar]
  8. Belanger CF, Hennekens CH, Rosner B, Speizer FE. 1978. The nurses’ health study. Am. J. Nurs. 78:61039–40 [Google Scholar]
  9. Bingham SA. 2002. Biomarkers in nutritional epidemiology. Public Health Nutr. 5:969–76 [Google Scholar]
  10. Bingham SA, Cummings JH. 1985. Urine nitrogen as an independent validatory measure of dietary intake: a study of nitrogen balance in individuals consuming their normal diet. Am. J. Clin. Nutr. 42:1276–89 [Google Scholar]
  11. Black AE, Cole TJ. 2000. Within- and between-subject variation in energy expenditure measured by the doubly-labeled water technique: implications for validating reported dietary energy intake. Eur. J. Clin. Nutr. 54:386–94 [Google Scholar]
  12. Block G, Woods M, Potosky A, Clifford C. 1990. Validation of a self-administered diet history questionnaire using multiple diet records. J. Clin. Epidemiol. 43:1327–35 [Google Scholar]
  13. Carriquiry AL. 1999. Assessing the prevalence of nutrient inadequacy. Public Health Nutr. 2:23–33 [Google Scholar]
  14. Carriquiry AL, Fuller WA. 1995. Estimation of day-to-day correlations for dietary components. Proceedings of the Biometrics Section362–64 Alexandria, VA: Am. Stat. Assoc.
  15. Carriquiry AL, Spungen JH, Murphy SP, Pehrsson PH, Dwyer JT. et al. 2016. Variability in iodine concentration in foods: considerations for dietary assessment. Am. J. Clin. Nutr. 104:877S–87S [Google Scholar]
  16. Carroll RJ. 2014. Estimating the distribution of dietary consumption patterns. Stat. Sci. 29:2–8 [Google Scholar]
  17. Carroll RJ, Hall P. 1988. Optimal rates of convergence for deconvolving a density. J. Am. Stat. Assoc. 83:1184–86 [Google Scholar]
  18. Carroll RJ, Ruppert D, Stefanski LA, Craniceanu C. 2006. Measurement Error in Nonlinear Models: A Modern Perspective New York: Chapman & Hall/CRC
  19. CDC (Cent. Dis. Control) 2014. National Health and Nutrition Examination Survey. Hyattsville, MD: U.S. Dep. Health Hum. Serv.
  20. Combs GF Jr., Trumbo PR, McKinley MC, Milner J, Studenski S. et al. 2013. Biomarkers in nutrition: new frontiers in research and application. Ann. N. Y. Acad. Sci. 1278:1–10 [Google Scholar]
  21. Comte F, Samson A, Stirnemann JJ. 2014. Deconvolution estimation of onset of pregnancy with replicate observations. Scand. J. Stat. 41:325–45 [Google Scholar]
  22. Dekkers ALM, Verkaik-Kloosterman J, van Rossum CTM, Ocké MC. 2014. SPADE, a new statistical program to estimate habitual dietary intake from multiple food sources and dietary supplements. J. Nutr. 144:2083–91 [Google Scholar]
  23. Delaigle A, Gijbels I. 2004. Practical bandwidth selection in deconvolution kernel density estimation. Comput. Stat. Data Anal. 45:249–67 [Google Scholar]
  24. Delaigle A, Meister A. 2008. Density estimation with heteroscedastic errors. Bernoulli 2:562–79 [Google Scholar]
  25. Dodd KW, Guenther PM, Freedman LS, Subar AF, Kipnis V. et al. 2006. Statistical methods for estimating usual intake of nutrients and foods: a review of the theory. J. Am. Diet. Assoc. 106:1640–50 [Google Scholar]
  26. Dougherty DM, Mathias CW, Dawes MA, Furr RM, Charles NE. et al. 2013. Impulsivity, memory, attention and decision-making among adolescent marijuana users. Psychopharmacology 226:307–19 [Google Scholar]
  27. Fan J. 1992. Design-adaptive nonparametric regression. J. Am. Stat. Assoc. 87:998–1004 [Google Scholar]
  28. Fienberg SE, Steorts RC. 2014. Discussion of “Estimating the distribution of dietary consumption patterns.”. Stat. Sci. 29:195–96 [Google Scholar]
  29. Freedman DA. 1965. On the asymptotic behavior of Bayes estimates in the discrete case II. Ann. Math. Stat. 36:454–56 [Google Scholar]
  30. Freedman LS, Commins JM, Moler JE, Arab L, Baer DJ. et al. 2014. Pooled results from five validation studies of dietary self-report instruments using recovery biomarkers for energy and protein intake. Am. J. Epidemiol. 164:1–17 [Google Scholar]
  31. Freedman LS, Guenther PM, Dodd KW, Krebs-Smith SM, Midthune D. 2010. The population distribution of ratios of usual intakes of dietary components that are consumed every day can be estimated from repeated 24-hour recalls. J. Nutr. 140:111–16 [Google Scholar]
  32. Freedman LS, Midthune D, Carroll RJ, Commins JM, Arab L. et al. 2015. Application of a new statistical model for measurement error to the evaluation of dietary self-report instruments. Epidemiology 26:925–33 [Google Scholar]
  33. Freedman LS, Schatzkin A, Midthune D, Kipnis V. 2011. Dealing with dietary measurement error in nutritional cohort studies. J. Natl. Cancer Inst. 103:1086–92 [Google Scholar]
  34. Freisling H, van Bakel MM, Biessy C, May A, Byrnes G. et al. 2012. Dietary reporting errors on 24-hour recalls and dietary questionnaires are associated with BMI across six European countries as evaluated with recovery biomarkers for protein and potassium intake. Br. J. Nutr. 107:910–20 [Google Scholar]
  35. Fuller WA. 1987. Measurement Error Models New York: Wiley
  36. Gelman A. 2007. Struggles with survey weighting and regression modeling. Stat. Sci. 22:153–64 [Google Scholar]
  37. Genest C, Ghoudi K, Rivest LP. 1995. A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika 82:543–52 [Google Scholar]
  38. Ghosh M, Steorts R. 2013. Two-stage Bayesian benchmarking as applied to small area estimation. TEST 22:670–87 [Google Scholar]
  39. Gibson RS. 2005. Principles of Nutritional Assessment New York: Oxford Univ. Press, 2nd ed..
  40. Goyeneche JJ, Carriquiry AL, Fuller WA. 1997. Estimating bivariate usual intake distributions. Proceedings of the Biometrics Section106–10 Alexandria, VA: Am. Stat. Assoc.
  41. Gu C. 1993. Smoothing spline density estimation: theory. Ann. Stat. 21:217–34 [Google Scholar]
  42. Guenther PM, Reedy J, Krebs-Smith SM. 2008. Development of the Healthy Eating Index—2005. J. Am. Diet. Assoc. 108:1896–1901 [Google Scholar]
  43. Harray SJ, Boushey CJ, Pollard CM, Delp EJ, Ahmad Z. et al. 2015. A novel dietary assessment method to measure a healthy and sustainable diet using the mobile food record: protocol and methodology. Nutrients 7:5275–395 [Google Scholar]
  44. Haytowitz DB, Lemar LE, Pehrsson PR. 2009. USDA's Nutrient Databank System—a tool for handling data from diverse sources. J. Food Compos. Anal. 22:433–41 [Google Scholar]
  45. Haytowitz DB, Pehrsson PR, Holden JM. 2008. The National Food and Nutrient Analysis Program: a decade of progress. J. Food Compos. Anal. 21:S94–S102 [Google Scholar]
  46. Heitmann BL, Lissner L. 1995. Dietary underreporting by obese individuals—is it specific or non-specific?. BMJ 311:986–89 [Google Scholar]
  47. IOM (Inst. Med.) 2011. Dietary Reference Intakes: Calcium and Vitamin D CA Ross, CL Taylor, AL Yaktine, H Del Valle Washington, DC: Natl. Acad. Press
  48. IOM (Inst. Med.) 2015. Review of WIC Food Packages: Proposed Framework for Revisions K Rasmussen, S Whaley, M Latulippe Washington, DC: Natl. Acad. Press
  49. Kipnis V, Carroll RJ, Freedman LS, Li L. 1999. Implications of a new dietary measurement error model for estimation of relative risk: application to four calibration studies. Am. J. Epidemiol. 150:642–51 [Google Scholar]
  50. Kipnis V, Midthune D, Freedman LS, Bingham S, Schatzkin A. et al. 2001. Empirical evidence of correlated biases in dietary assessment instruments and its implications. Am. J. Epidemiol. 153:394–403 [Google Scholar]
  51. Kipnis V, Subar AF, Midhune D, Freedman LS, Ballard-Barbash R. et al. 2003. Structure of dietary measurement error: results of the OPEN biomarker study. Am. J. Epidemiol. 158:14–21 [Google Scholar]
  52. Lin HL. 2013. Statistical methods in disease risk analysis, disease testing and nutrition epidemiology PhD thesis, Iowa State Univ., Ames
  53. Lin LK, Vonesh EF. 1989. An empirical nonlinear data-fitting approach for transforming data to normality. Am. Stat. 43:237–43 [Google Scholar]
  54. Masry E. 1991. Multivariate probability density deconvolution for stationary random processes. IEEE Trans. Inform. Theory 37:1105–15 [Google Scholar]
  55. Mendelsohn J, Rice JA. 1982. Deconvolution of microfluorometric histograms with B splines. J. Am. Stat. Assoc. 77:748–53 [Google Scholar]
  56. Mertz W, Kelsay JL. 1984. Rationale and design of the Beltsville one-year dietary intake study. Am. J. Clin. Nutr. 40:S1323–26 [Google Scholar]
  57. Moshfegh AG, Rhodes DG, Baer DJ, Murayi T, Clemens JC. et al. 2008. The U.S. Department of Agriculture automated multiple-pass method reduces bias in the collection of energy intakes. Am. J. Clin. Nutr. 88:324–32 [Google Scholar]
  58. Murphy S, Ruel M, Carriquiry A. 2012. Should household consumption and expenditure surveys (HCES) be used for nutritional assessment and planning?. Food Nutr. Bull. 33:S235–41 [Google Scholar]
  59. Neumann MH, Hossjer O. 1997. On the effect of estimating the error density in non-parametric deconvolution. J. Non-Parametr. Stat. 7:307–30 [Google Scholar]
  60. Nusser SM, Carriquiry AL, Dodd KW, Fuller WA. 1996. A semi-parametric approach to estimating usual intake distributions. J. Am. Stat. Assoc. 91:1440–49 [Google Scholar]
  61. Nusser SM, Fuller WA, Guenther PM. 1997. Estimating usual dietary intake distributions: adjusting for measurement error and non-normality in 24-hour food intake data. Survey Measurement and Process Quality L Lyeberg, P Biemer, M Collins, E Deleeuw, C Dippo 670–89 New York: Wiley [Google Scholar]
  62. Patterson RE, Krystal AR, Carter RA, Fels-Tinker L, Bolton MP, Augurs-Collins T. 1999. Measurement characteristics of the women's health initiative food frequency questionnaire. Ann. Epidemiol. 9:178–87 [Google Scholar]
  63. Politis DN. 2003. The impact of bootstrap methods on time series analysis. Stat. Sci. 18:219–30 [Google Scholar]
  64. Prentice RL. 2007. Observational studies, clinical trials and the women's health initiative. Lifetime Data Anal. 13:449–62 [Google Scholar]
  65. Prentice RL, Huang Y. 2011. Measurement error modeling and nutritional epidemiology association analysis. Can. J. Stat. 39:498–509 [Google Scholar]
  66. Prentice RL, Huang Y, Kuller LH, Tinker LF, Horn LV. et al. 2011. Biomarker-calibrated energy and protein consumption and cardiovascular disease risk among postmenopausal women. Epidemiology 22:170–79 [Google Scholar]
  67. Rollo ME, Ash S, Lyons-Wall P, Russell AW. 2015. Evaluation of a mobile phone image-based dietary assessment method in adults with type-II diabetes. Nutrients 7:4897–910 [Google Scholar]
  68. Rosner B, Willett WC, Spiegelman D. 1989. Correction of logistic regression relative risks and confidence intervals for systematic within-person measurement error. Stat. Med. 8:1051–70 [Google Scholar]
  69. Schakel SF, Buzzard IM, Gebhardt SE. 1997. Procedures for estimating nutrient values for food composition databases. J. Food Compos. Anal. 10:102–14 [Google Scholar]
  70. Schatzkin A, Kipnis V. 2004. Could exposure assessment problems give us wrong answers to nutrition and cancer questions?. J. Natl. Cancer Inst. 96:1564–65 [Google Scholar]
  71. Schoeller DA, Hnilicka JM. 1996. Reliability of the doubly labeled water method for the measurement of total daily energy expenditure in free-living subjects. J. Nutr. 126:348S–54S [Google Scholar]
  72. Souverein OW, Dekkers AL, Geelen A, Haubrock J, de Vries JH. et al. 2011. Comparing four methods to estimate usual intake distributions. Eur. J. Clin. Nutr. 65:S92–S101 [Google Scholar]
  73. Stefanski LA, Carroll RJ. 1990. Deconvolving kernel density estimators. Statistics 21:169–84 [Google Scholar]
  74. Stephens MA. 1974. EDF statistics for goodness of fit and some comparisons. J. Am. Stat. Assoc. 69:730–37 [Google Scholar]
  75. Stiebeling H. 1941. Are we all well fed? A report on the diets of families in the United States. Misc. Publ. Number 430, US Dep. Agric., Washington, DC
  76. Stirnemann JJ, Comte F, Samson A. 2012. Density estimation of a biomedical variable subject to measurement error using an auxiliary set of replicate observations. Stat. Med. 31:4154–63 [Google Scholar]
  77. Subar AF, Crafts J, Zimmerman TP, Wilson M, Mittl B. et al. 2010. Assessment of the accuracy of portion size reports using computer-based photograph aids in the development of an automated self-administered 24-hr recall. J. Am. Diet. Assoc. 110:55–64 [Google Scholar]
  78. Subar AF, Kipnis V, Troiano RP, Midhune D, Schoeller DA. et al. 2003. Using intake biomarkers to evaluate the extent of dietary misreporting in a large sample of adults: the OPEN study. Am. J. Epidemiol. 158:1–13 [Google Scholar]
  79. Subar AF, Thompson FE, Kipnis V, Midthune D, Hurwitz P. et al. 2001. Comparative validation of the Block, Willett and National Cancer Institute food frequency questionnaires: the Eating at America's Table Study. Am. J. Epidemiol. 154:1089–99 [Google Scholar]
  80. Tippett KS, Mickle SJ, Goldman JD, Sykes KE, Cook DA. et al. 1995. Food and nutrient intake by individuals in the United States, 1 day, 1989–91. NFS Rep. No. 91-2. Natl. Tech. Inf. Serv., Springfield, VA
  81. Tooze JA, Midhune DW, Dodd KW, Freedman LS, Krebs-Smith SM. et al. 2006. A new statistical method for estimating the usual intake of episodically consumed foods with application to their distribution. J. Am. Diet. Assoc. 106:1575–87 [Google Scholar]
  82. USDA (U.S. Dep. Agric.) 1987. Nationwide food consumption survey: Continuing survey of food intakes by individuals, low-income women 19–50 years and their children 1–5 years, 4 days 1985. NCFS, CSFII Rep. No. 85-4. Natl. Tech. Inf. Serv., Springfield, VA
  83. USDA (U.S. Dep. Agric.) 1993. Food and nutrient intake by individuals in the United States, 1 day, 1987–88. NFCS Rep. No. 87-I-1. Natl. Tech. Inf. Serv., Springfield, VA
  84. USDA (U.S. Dep. Agric.) 2010. Dietary Guidelines for Americans 2010 Washington, DC: U.S. Gov. Print. Off, 7th ed..
  85. Vandervijvere S, Geelen A, Gonzalez-Gross A, van't Veer P, Dallongeville J. et al. 2013. Evaluation of food and nutrient intake assessment using concentration biomarkers in European adolescents from the Healthy Lifestyle in Europe by Nutrition in Adolescence study. Br. J. Nutr. 109:736–47 [Google Scholar]
  86. Veugelers PJ, Ekwaru JP. 2014. A statistical error in the estimation of the recommended dietary allowance for vitamin D. Nutrients 6:4472–75 [Google Scholar]
  87. Wahba G. 1975. Optimal convergence properties of variable knot, kernel and orthogonal series methods for density estimation. Ann. Stat. 1:15–29 [Google Scholar]
  88. Wang CY, Carriquiry AL, Chen T-C, Loria CM, Pfeiffer CM. et al. 2015. Estimating the population distribution of usual 24-hour sodium excretion from timed urine void specimens using a statistical approach accounting for correlated measurement errors. J. Nutr. 145:1017–24 [Google Scholar]
  89. Weissel R, Dop MC. 2012. The Adult Male Equivalent concept and its application to Household Consumption and Expenditure Surveys (HCES). Food Nutr. Bull. 33:S157–63 [Google Scholar]
  90. Willett WC, Sampson L, Stampfer MJ, Rosner B, Bain C. et al. 1985. Reproducibility and validity of a semiquantitative food frequency questionnaire. Am. J. Epidemiol. 122:51–65 [Google Scholar]
  91. Zhang S, Krebs-Smith S, Midhune D, Perez A, Buckman DW. et al. 2011a. Fitting a bivariate measurement error model for episodically consumed dietary components. Int. J. Biostat. 7:1–29 [Google Scholar]
  92. Zhang S, Midhune D, Guenther P, Krebs-Smith S, Kipnis V. et al. 2011b. A new multivariate measurement error model with zero-inflated dietary data, and its application in dietary assessment. Ann. Appl. Stat. 5:1456–87 [Google Scholar]
/content/journals/10.1146/annurev-statistics-041715-033615
Loading
/content/journals/10.1146/annurev-statistics-041715-033615
Loading

Data & Media loading...

  • Article Type: Review Article
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error