Annual Reviews home
0
Skip to content
  • For Librarians & Agents
  • For Authors
  • Knowable Magazine
  • Institutional Login
  • Login
  • Register
  • Activate
  • 0 Cart
  • Help
Annual Reviews home
  • JOURNALS A-Z
    • Analytical Chemistry
    • Animal Biosciences
    • Anthropology
    • Astronomy and Astrophysics
    • Biochemistry
    • Biomedical Data Science
    • Biomedical Engineering
    • Biophysics
    • Cancer Biology
    • Cell and Developmental Biology
    • Chemical and Biomolecular Engineering
    • Clinical Psychology
    • Computer Science
    • Condensed Matter Physics
    • Control, Robotics, and Autonomous Systems
    • Criminology
    • Developmental Psychology
    • Earth and Planetary Sciences
    • Ecology, Evolution, and Systematics
    • Economics
    • Entomology
    • Environment and Resources
    • Financial Economics
    • Fluid Mechanics
    • Food Science and Technology
    • Genetics
    • Genomics and Human Genetics
    • Immunology
    • Law and Social Science
    • Linguistics
    • Marine Science
    • Materials Research
    • Medicine
    • Microbiology
    • Neuroscience
    • Nuclear and Particle Science
    • Nutrition
    • Organizational Psychology and Organizational Behavior
    • Pathology: Mechanisms of Disease
    • Pharmacology and Toxicology
    • Physical Chemistry
    • Physiology
    • Phytopathology
    • Plant Biology
    • Political Science
    • Psychology
    • Public Health
    • Resource Economics
    • Sociology
    • Statistics and Its Application
    • Virology
    • Vision Science
    • Article Collections
    • Events
    • Shot of Science
  • JOURNAL INFO
    • Copyright & Permissions
    • Add To Your Course Reader
    • Expected Publication Dates
    • Impact Factor Rankings
    • Access Metadata
    • RSS Feeds
  • PRICING & SUBSCRIPTIONS
    • General Ordering Info
    • Online Activation Instructions
    • Personal Pricing
    • Institutional Pricing
    • Society Partnerships
  •     S2O    
  •     GIVE    
  • ABOUT
    • What We Do
    • Founder & History
    • Our Team
    • Careers
    • Press Center
    • Events
    • News
    • Global Access
    • DEI
    • Directory
    • Help/FAQs
    • Contact Us
  • Home >
  • Annual Review of Public Health >
  • Volume 18, 1997 >
  • Preisser, pp 51-82
  • Save
  • Email
  • Share

CATEGORICAL DATA ANALYSIS IN PUBLIC HEALTH

  • Home
  • Annual Review of Public Health
  • Volume 18, 1997
  • Preisser, pp 51-82
  • Facebook
  • Twitter
  • LinkedIn
Download PDF

CATEGORICAL DATA ANALYSIS IN PUBLIC HEALTH

Annual Review of Public Health

Vol. 18:51-82 (Volume publication date May 1997)
https://doi.org/10.1146/annurev.publhealth.18.1.51

John S. Preisser

Section on Biostatistics, Department of Public Health Sciences, Bowman Gray School of Medicine, Winston-Salem, North Carolina 27157

Gary G. Koch

Department of Biostatistics, School of Public Health, University of North Carolina, Chapel Hill, North Carolina 27599-7400

Download PDF Article Metrics
  • Permissions
  • Reprints

  • Download Citation
  • Citation Alerts
Sections
  • Abstract
  • Key Words 
  • INTRODUCTION
  • CONTINGENCY TABLE METHODS
  • DEVELOPMENTS IN LOGISTIC REGRESSION
  • GENERALIZED ESTIMATING EQUATIONS
  • WEIGHTED LEAST SQUARES
  • CONCLUDING REMARKS
  • Acknowledgments
  • Literature Cited

Abstract

▪ Abstract A greater variety of categorical data methods are used today than 15 years ago. This article surveys categorical data methods widely applied in public health research. Whereas large sample chi-square methods, logistic regression analysis, and weighted least squares modeling of repeated measures once comprised the primary analytic tools for categorical data problems, today's methodology is comprised of a much broader range of tools made available by increasing computational efficiency. These include computational algorithms for exact inference of small samples and sparsely distributed data, conditional logistic regression for modeling highly stratified data, and generalized estimating equations for cluster samples. The latter, in particular, has found wide use in modeling the marginal probabilities of correlated counted, binary, and multinomial outcomes. The various methods are illustrated with examples including a study of the prevalence of cerebral palsy in very low birthweight infants and a study of cancer screening in primary care settings.

Key Words 

exact inference; conditional logistic regression; proportional odds; generalized estimating equations; weighted least squares.

INTRODUCTION

Public health research is frequently concerned with the relationship of categorical response variables with one or more explanatory variables. A categorical response variable may have two or more possibly ordered categories. For example, in a study of the factors related to cerebral palsy in premature neonates, the outcome of interest is whether an infant has developed cerebral palsy by one year of age. Categorical data methods are used to describe trends in the rate of cerebral palsy in infants born at a particular North Carolina hospital. In a state-wide study of cancer screening in primary care practices in Colorado, interest is in the physician and patient factors which predict the extent of breast cancer screening that women receive. The outcome for each woman takes a value of 0, 1, or 2 for the number of types of screening procedures (mammogram and clinical breast exam) received in the past year. The statistical analysis should consider that the level of one woman's cancer screening may be correlated to that of another patient in the same medical practice. An array of categorical data methods are available for analyzing such data. Only 15 years ago, much of it was either nonexistent, not widely known, or not easily implemented because computational advances lagged behind methodological ones.

Historically, emphasis has been placed on large sample chi-square methods for the analysis of contingency tables (43). In more recent years, with the advent of increasingly powerful computing, there has been an emergence of interest in exact inference (3). Exact methods have enabled estimation and hypothesis testing for tables with small samples and tables of moderate size with highly skewed marginal distributions for which large sample approximations are not justified. For tables in which sample size requirements are met, Mantel-Haenszel methods (47) may be employed. Less widely known are variations of these randomization methods that adjust for continuous covariates in the spirit of nonparametric analysis of covariance (40).

Unconditional or ordinary logistic regression, introduced in its contemporary form by Cox (18), remains one of the more commonly employed statistical methods and provides a means of simultaneously assessing the relationships of covariates to a dichotomous response. For responses with more than two categories, extensions of unconditional logistic regression (42) include the proportional odds model (61) for an ordered response. In the cerebral palsy example, a three-level outcome is death before one year, cerebral palsy, or healthy status at one year of age. The proportional odds model may be applied to the cancer screening study, but the estimation procedure should take into account the correlation of patient responses within the same practice. If the response is nominal, polychotomous logistic regression may apply. This paper emphasizes the odds ratio as a measure of association; however, other measures may be of interest such as the relative risk sometimes modeled in epidemiology (73).

In public health, data sometimes result from highly stratified random samples where there are many strata but only a few observations per stratum (65). Conditional logistic regression, traditionally used in case-control studies, applies to such data when interest is in a factor that varies within stratum, such as treatment in multi-center clinical trials with only a few patients per center. It also applies to cluster samples of small size such as multi-period crossover trials by assuming independence conditional upon subject specific parameters. In some logistic or conditional logistic regression problems, the sample size may be too small, or the data too sparse to justify large sample methods. In these situations, exact logistic regression provides estimates, tests, and confidence intervals for parameters that may be nonestimable otherwise.

This paper does not cover loglinear models (8, 38, 39) explicitly, but we note their historically important role in categorical data analysis, especially in the field of sociology (15). These methods are used to model counts and to describe the associations among a collection of categorical variables without necessarily defining a particular subset as the response variables. Their model parameters have interpretations conditional on the values of the other variables and they do not extend easily to cluster sample situations (53).

In this review we emphasize marginal models in contrast to transitional and random effects models whose parameters have different interpretations (see concluding remarks). Marginal models are concerned with the distribution of the response averaged over the possible values of the other variables, and their regression coefficients have the same interpretation as coefficients from a cross-sectional analysis. They receive wide use in public health research where interpretations are often made on the effect of certain exposures, conditions, interventions, or risk factors on a response that may be a specific disease or health behavior. For independent data, marginal modeling has been unified for different types of responses into the theory of generalized linear models (22, 62, 68). These models, which include linear regression, logistic regression and poisson regression, share similar fitting algorithms and large sample theory.

In more recent years, nonlikelihood-based large sample methods have been developed to analyze data from studies that involve cluster samples. The generalized estimating equations (GEE) procedure (51, 52, 92) can handle large numbers of clusters of varying size and is used to fit a broad range of models including loglinear and logistic models for correlated poisson, binary, and multinomial outcomes. For example, GEE can be used to fit a proportional odds model to the cancer screening data while accounting for the intracorrelation of patients within a medical practice. Prior to GEE, correlated categorical data were frequently analyzed by weighted least squares (WLS) methods (41, 44). WLS, however, is limited to situations where all predictors are categorical with sufficient sample sizes per cross-classification for the asymptotic properties to hold. Nevertheless, WLS is still useful for modeling such response functions as rank measures of association and incidence densities.

In choosing an appropriate statistical method for a categorical data situation, one should consider the measurement scale of the response variable as well as the form in which the data are best characterized. Categorical data methods apply to situations when the response variable is nominal or ordinal. A nominal scale only indicates whether a subject belongs to a particular category. Ordinal scales, on the other hand, assign values to the categories ranking them from poorest to best. Sometimes, however, aggregate measures, such as counts, incidence densities, or grouped survival times, are of interest. We distinguish between data represented in contingency table form to those in case record form to more readily identify appropriate statistical methods. Note, however, that the distinction between these two categorical data structures is tenuous, at best, because case record data can be conceptualized as a contingency table, albeit with sparse or zero cells, and vice versa, a table can be expressed as case records. Table 1 presents some useful statistical methods for categorical data.

image
CLICK TO VIEW
TABLE 1

Statistical methods for categorical data analysis in public health

This paper reviews and illustrates categorical data methods commonly used today, with particular emphasis on developments of the past 15 years. The sections are organized according to different categorical data situations that typically arise in public health research. A more comprehensive review, from a perspective of categorical data modeling in the past fifty years, is provided by Imrey et al (37). Due to the enormity of our subject, we do not attempt an exhaustive review of the literature, but often refer to other reviews or books that document the many contributions to the field. The next section reviews methods for contingency tables with special emphasis on exact inference. Unconditional, ordered, conditional, and exact logistic regression are reviewed in the third section. GEE and WLS are considered in the following two sections, respectively. The final section mentions other contributions and comments on future directions of categorical data analysis in public health.

CONTINGENCY TABLE METHODS

The statistical analyses of complex data sets often involve a combination of nonparametric and model-based testing and estimation procedures. This section discusses inference for contingency tables, especially for small samples. Later sections focus on model-based inference for case record data. Statistical analyses of contingency tables involve the analysis of two-way tables for the assessment of significant association between two variables, and the analysis of sets of two-way tables for testing conditional independence of two variables given additional variables or for testing homogeneity of odds ratios across strata formed by the additional variables. Statistics exist for situations in which both the row (exposure or factor) and column (response) variable are nominal, when only the column variable is ordinal, or when both row and column variable are ordinal. These methods involve statistics that assign scores, often integers or ranks, to the categories of ordinal variables.

The role of Mantel-Haenszel methods in public health research is well documented (11, 41, 42, 47). Historically, several related procedures were proposed that combined information relating a primary factor and a response variable across a set of 2 × 2 contingency tables, most notably the statistic by Mantel & Haenszel (60). This statistic and its extensions continue to receive wide use, partly because they require few assumptions. The only requirement is randomization of subjects to levels of the factor either explicitly as in randomized controlled clinical trials or implicitly by hypotheses or from conditional distribution arguments for observational data. They require only that the overall sample size be reasonably large for asymptotic results to hold. Thus they are applicable to many situations, including matched case-control studies in which tables represent matched sets of only a few subjects. A potential limitation is that generalizations of results to a target population require nonstatistical arguments of the representativeness of the study subjects to a cross-section of individuals in a target population because, in a strict statistical sense, the conclusions apply only to the study sample. Such representativeness is also an inherent and unverifiable assumption of model-based methods. An overview of Mantel-Haenszel methods is given by Kuritz et al (47).

When large sample methods cannot be justified, owing either to small samples or highly skewed observed table margins, exact methods are employed. Conditional exact methods, which we call exact methods for convenience, are based on the enumeration of a reference set of tables with the margins fixed to the totals observed in the data. P-values are determined by summing probabilities associated with tables from the reference set identified as more extreme than the table observed. These are tables with probabilities not exceeding the one observed or having some meaningful test statistic identifying a deviation from the null hypothesis greater than the deviation represented by the observed table. Monte Carlo methods are used when the sample size is too small for asymptotic inference, but too large for calculation of all probabilities in the complete permutation distribution required for exact inference. The Monte Carlo method consists of repeated sampling from the reference set and computing an unbiased point estimate and confidence interval for the exact p-value based on the sample. All of the analyses presented in this section were performed with the statistical package StatXact (83), the most comprehensive software available for performing exact significance tests and confidence interval estimation. Agresti (3) gives a comprehensive review of exact inference.

To facilitate discussion of these methods some notation is established. Let h = 1,2,…,q index the strata and let nhij denote the number of subjects in the sample who are jointly classified as belonging to the i-th factor level (row), i=1,…,s, the j-th level of the response variable (column), j=1,…,r, and the hth stratum. The resulting s × r contingency table for the hth stratum is displayed in Table 2. Nonparametric inference for a set of q tables is based on conditioning on appropriate marginal totals. Fixing row margins () and column margins () in a two-way table from a simple random sample induces the noncentral multivariate hypergeometric distribution, which is the foundation for both large sample asymptotic-based Mantel-Haenszel methods and exact inference for small samples based upon permutation distributions.

image
CLICK TO VIEW
TABLE 2

Observed contingency table for stratum h

The Analysis of 2 × 2 Tables

Exact inference for the 2 × 2 table was proposed by Fisher (3). Letting θ denote the odds ratio, and H0 and H1 the null and research hypothesis, respectively, the one-sided p-value for the Fisher's exact test for H0: θ = 1 vs H1: θ > 1 is obtained by summing the probabilities corresponding to tables in which the sample odds ratio is at least as large as observed, or equivalently those tables whose cell count for the first row and first column is at least as large as n11. Fisher showed that conditioning on the row and column margins from the observed table with cell counts (n11, n12, n21, n22) gives the probability of observing n11 = t as

1
equation 1
where the index of summation, u, ranges from the maximum of 0 and to the minimum of and , the possible values for n11 for the given marginal totals. Under H0, Expression 1 with θ = 1 is the hypergeometric distribution. A two-sided p-value is calculated by summing the probabilities of tables from the reference set whose probabilities are no larger than the probability of the observed table.

Exact inference can also provide point estimates and confidence intervals for θ. The conditional maximum likelihood (CML) estimator is the value of θ that maximizes Expression 1 given the observed table. For tables with a zero cell the unconditional maximum likelihood (ML) estimator of θ given by (n11n22)/(n12n21), and the CML estimator do not exist, although one can obtain a median unbiased estimate (34). In such situations, exact confidence intervals can be obtained while the asymptotic ones do not exist. Because exact methods tend to be conservative owing to the discrete distribution of the p-value, exact tests and confidence intervals are often determined excluding half the probability of the observed table, the so-called mid-p adjustment (3).

Example: osteogenic sarcoma

In a study of nonmetastatic sarcoma, investigators were interested in determining the predictors of disease within three years (DIS) (29). The data for 46 patients appear in Table 3. Associations are examined for 2 × 2 tables formed by the response DIS and each predictor, any osteoblastic pathology (AOP), low level of lymphocytic infiltration (LI), and male gender (MALE), respectively, where, in the table, Y = yes and N = no. Because some cells have zero or small counts the Fisher's exact test is applied. Note that for the test of association of LI with DIS, the 2 × 2 table formed by collapsing over AOP and MALE has cell counts n11=10, n12=0, n21=19, and n22=17. Setting θ = 1 in Expression 1, the probability under H0 of observing this table is .0049, which is the one-sided p-value for testing H0: θ = 1 vs H1: θ>1 since n11 observed was the largest possible. The complete set of possible values of n11 for the given marginal totals range from 0 to 10 giving a reference set of 11 tables. To calculate the two-sided p-value, the tables whose probabilities are less than .0049 are identified. These are tables n11=0, with P = .00005, n11=1 with P = .0002, and n11=2 with P = .0024. Summing these probabilities with the one observed gives a two-sided Fisher's exact test p-value of .0075. Because n12=0, the asymptotic-based confidence interval for θ involving LI does not exist. An exact mid-p adjusted interval, however, is given in Table 4 with a general summary of results. Note that the presence of each covariate is positively associated with disease.

image
CLICK TO VIEW
TABLE 3

Osteogenic sarcoma data*

image
CLICK TO VIEW
TABLE 4

P-values and 95% confidence intervals for osteogenic sarcoma data

Tests of Marginal Homogeneity in 2 × 2 Tables

In the previous section, marginal totals in the 2 × 2 table were conditioned upon to obtain inference on the odds ratio, θ. In some situations, however, it is the parameters of the marginal distributions of each variable that are of interest, and not θ. Studies of matched pairs or repeated measures are concerned with tests of marginal homogeneity, H0: vs H1: , where is the probability of observing the first level of the row variable, and is the probability of observing the first level of the column variable. These hypotheses address questions concerning the equality of matched cases and controls with respect to an exposure variable, or equality of an outcome at two different times. The statistic, (n12−n21)/(n12+n21), is based on the discordant pairs, or those that do not agree. The large sample version of the test is due to McNemar (64) and is described in Fleiss (27). Agresti (3) describes an exact approach for small samples.

The Analysis of s × r Tables

For s × r tables with sufficiently large samples, Mantel-Haenszel statistics having asymptotic chi-square distributions under the null hypothesis are often employed. For small samples, conditioning on the margins yields the noncentral multivariate hypergeometric distribution, an extension of Expression 1, from which probabilities of tables are derived. When rows and columns are nominal, the Freeman-Halton test provides a generalization of the Fisher's exact test. Its p-value is defined as the null probability of the set of tables having probability no greater than that of the observed table. Alternatively, the p-value can be based upon a meaningful statistic such as Pearson's chi-squared statistic, , where . The exact p-value is determined by summing the probabilities of those tables in the reference set with X2 at least as great as the one observed. This differs from the large sample Pearson chi-squared test which compares X2 to its asymptotic distribution. It can also give very different results than the Freeman-Halton test, because tables that are more contradictory to H0 according to X2 need not be less likely (3). For this reason, exact p-values based on a test statistic such as X2 are sometimes preferred. This is the main principle in exact methods, that exact p-values are derived with respect to a reference set of test statistics that otherwise might be compared to chi-square values if the asymptotic assumptions were justified.

Exact p-values are obtained in similar ways for tables in which the column or both the row and the column variable are ordinal. When only the column variable is ordinal, hypothesis tests can be carried out by general linear rank statistics. For tables with two rows, these take the form , where the {wj} are column scores. This statistic compares two multinomial populations with ordered responses, or c populations, each generating a binary response for which the same binomial distribution is the null hypothesis versus a trend alternative for the response rates. The latter finds application, for example, in dose-response studies and usually has integer scores, wj = j−1, assigned. Arbitrary scores yield permutation tests, although a common choice are ranks (or midranks for ties) giving the Wilcoxon Rank Sum Test. The Kruskal-Wallis statistic generalizes the Wilcoxon statistic for comparing ordered response distributions in more than two multinomials. When both variables are ordinal, arbitrary scores assigned to rows and columns give general linear by linear association statistics for which exact or asymptotic p-values are available. These statistics include Spearman and Pearson–type correlation statistics, and Jonckheere-Terpstra statistics (83), the latter reducing to the Wilcoxon Rank Sum Test for 2 × J tables. In a table with integer scores for both the row and column variable, an exact estimate is available for the common local odds ratio corresponding to any two adjacent rows and any two adjacent columns (2).

Example: outcomes in newborns: infant mortality and cerebral palsy.

Table 5 presents data on 216 very low birthweight (VLBW) infants who had birthweight between 500 and 1350 grams, and were born in the same obstetric referral center between 7/1/88 and 6/30/92 to a mother residing in Forsyth County, North Carolina. The goal is to determine whether trends in infant mortality and cerebral palsy at one year of age (adjusted for due date) are statistically significant or likely due to chance. Some studies have reported an increase in the rate of CP in the 1980s for surviving infants explained by an increasing survival of VLBW infants (6). The outcome variable is the status of the infant at the one-year follow-up visit: H = healthy status or possible cerebral palsy, C = definite diagnosis of cerebral palsy, and D = died before the one-year follow-up visit. A definite diagnosis of CP occurred when both an attending physician and physical therapist agreed on the diagnosis. There were only 4 cases of possible CP, so these were combined with healthy infants or those without a definite diagnosis of CP. To apply the trend test to mortality, columns H and C are combined, forming the response survived versus died. The set of 4 binomial distributions for each gender are tested for equality versus a trend alternative in response rates. Sample sizes are sufficiently large to justify asymptotic based p-values for the linear rank statistic with integer scores, known as the Cochran-Armitage test (83). Two-sided p-values for males and females and overall (ignoring gender) are .79, .011, and .038, respectively. Thus, there is a statistically significant decreasing trend for females but not for males, perhaps because of a high rate in 1990–1991. The second question is whether the rate of cerebral palsy decreased for those infants who survived to follow-up. A trend test is applied to the data in Table 6, but the p-values are determined by exact methods because of zero and low counts of cerebral palsy. The exact p-values for males, females, and overall are .27, .017, and .010, respectively. CML estimates (and exact 95% confidence intervals) of the equal local odds ratios are 1.4 (0.78, 2.8), 6.4 (1.09, 276), and 2.0 (1.15, 3.77), respectively, favoring a higher CP rate in earlier years. In sum, there is a statistically significant decreasing trend for females in the rate of CP, but not for males.

image
CLICK TO VIEW
TABLE 5

Outcomes in very low birthweight infants

image
CLICK TO VIEW
TABLE 6

Number of infants (out of those who survived) with cerebral palsy

The Analysis of Sets of 2 × 2 Tables

TESTS OF CONDITIONAL INDEPENDENCE AND ESTIMATION OF THE COMMON ODDS RATIO

Agresti (3) describes Birch's test, an exact version of the Mantel-Haenszel test for q 2 × 2 tables. The p-value for H0: θ = 1 vs H1: θ > 1 is the null probability that is at least as large as observed, given the observed marginal totals . Whereas the Mantel-Haenszel test is directed at the alternative hypothesis that a weighted average of the stratum-specific odds ratios differs from 1, Birch's test addresses the extent to which the strata have a homogeneous odds ratio that is greater than one. For one stratum, Birch's test reduces to Fisher's exact test.

If H0 is rejected, the average factor-response association may be estimated by the Mantel-Haenszel estimator if the sample size is sufficiently large (47). Alternatively, conditioning on the strata totals, and assuming a common odds ratio, θ, the CML estimator of θ, and exact confidence intervals can be obtained. The estimate of θ is obtained by maximizing the conditional likelihood of the joint distribution of which is a product of q terms of the type given in Expression 1. If T attains its minimum or maximum possible value, asymptotic confidence intervals for the Mantel-Haenszel estimator do not exist, but one-sided exact intervals can be obtained.

TESTS OF HOMOGENEITY OF ODDS RATIO

The exact test of conditional independence involves the odds ratio being constant across strata. Zelen (93) gives an exact test for homogeneity of odds ratios. For large samples, the likelihood ratio test (1) provides a test of the stratum by factor interaction in a model with the main effects. Tarone (85) has provided a correction to the asymptotically not strictly valid “Breslow-Day” test (11).

Example: outcomes in newborns: comparing rates of cerebral palsy

A possible explanation for the decline in CP is the introduction of surfactant replacement therapy in 1990 to prevent or ameliorate lung disease in some newborns (10). This suggests examining the odds ratio for CP and pre-1990 vs post-1990 (formed by combining column 1 with 2 and column 3 with 4 in Table 6) to determine if the rate of CP is different before versus after the introduction of surfactant. Since Table 6 suggests that the prevalence of CP is greater in males than females, gender is considered as a stratification variable. Zelen's exact test for a constant odds ratio gives p-value = .49, suggesting that the odds ratios are homogeneous. The exact p-value for the test of conditional independence is .026, so the rate of CP before 1990 is significantly greater than the rate after 1990. Asymptotic and exact results are given in Table 7.

image
CLICK TO VIEW
TABLE 7

Results for relationship of pre/post 1990 with cerebral palsy

The Analysis of Sets of 2 × r Tables

Sets of 2 × r tables present a special case for exact inference, since the analysis of sets of s × r tables for s > 2 is generally not computationally feasible at the present state of computing (3). A flexible class of statistics is given by the stratified linear rank statistic , which is a generalization of the linear rank statistic summed over strata. For large samples they correspond to Mantel-Haenszel methods, but exact inference is also possible. For an ordinal response with integer scores, a CML estimate is available in StatXact (83) for the equal odds ratio assumed constant within tables and across strata. Hilton & Mehta (35) provide exact sample size calculations for ordinal data.

Example: outcomes in newborns: joint consideration of CP and death

The variable, outcome at follow-up visit (OFUV), has three categories displayed in Table 5. Joint consideration of CP and death is of interest because the CP rate is affected by the mortality rate. The stratified linear rank statistic is used to test the association of pre/post 1990 with OFUV, adjusting for gender. Using Wilcoxon (rank) scores, the exact two-sided p-value is .085, compared to the asymptotic p-value of .079. So there is suggestive evidence that the outcomes of VLBW infants improved after 1990.

Nonparametric Analysis of Covariance for Sets of s × r Tables

As stated in the previous section, exact methods are generally not available beyond the q × 2 × r case. For large samples, the Mantel-Haenszel statistic extends to q × s × r tables, where q is the number of tables equal to the number of levels resulting from the cross-classification of stratifying variables. Since the landmark paper in 1959, these extended Mantel-Haenszel methods have been developed to test hypotheses involving average effects of the primary factor on the distribution of the response variable, adjusted for the potential stratum effects. Such statistics exist when both factor and response are nominal, only the response is ordinal, or both are ordinal. Kuritz et al (47) discuss these methods including their application to repeated measures situations.

Mantel-Haenszel methods may be adapted to test the independence of two categorical variables given additional covariates that may be categorical or continuous. The technique combines the rank analysis of covariance method of Quade (77) with the randomization model framework of extended Mantel-Haenszel statistics to carry out nonparametric comparisons between groups, after adjusting for the effects of the covariates. The methodology described in Koch et al (40, 41) and illustrated in Stokes et al (84) is as follows: First, the column variable (response) and the continuous covariate are ranked; then the ranked response is regressed, via a simple linear regression model, on the ranked covariate to produce residuals. The Mantel-Haenszel test defines the groups as row categories and the residuals as the column (outcome) variable, using the calculated values of the residuals for column category scores.

Example: outcomes in newborns: comparing rates of cerebral palsy

Since birthweight (BW) is an important determinant of CP (6), rank analysis of covariance is used to test the association for OFUV and pre/post 1990 adjusting for BW. First both OFUV and BW are ranked. Then residuals are obtained from the linear regression of the ranks of OFUV on the ranks of BW. A linear rank statistic is determined with pre/post 1990 defining rows and the residuals as scores for columns. The p-value .066 is suggestive for differences in the rates of outcomes for infants, adjusting for birthweight. To adjust for gender and BW, the ranking and regression steps are carried out for each gender. Then a stratified linear rank statistic is applied with gender as the stratification variable, and with pre/post 1990 as the row variable and residuals as the column variable. The nonsignificant p-value of .14 may indicate that the differences in outcomes for infants before versus after 1990 are partly explained by differences in the distributions of gender and BW. In the next section, logistic regression is applied to try to further explain the differences in CP and mortality rates.

DEVELOPMENTS IN LOGISTIC REGRESSION

Logistic regression is used to describe the relationship between a categorical response variable and a set of explanatory variables that may include both categorical and continuous variables. We review here unconditional logistic regression for dichotomous responses. Extensions of logistic regression to multi-category responses are discussed below in particular, the proportional odds model for ordinal responses. Applications of conditional logistic regression are presented including case-control studies as well as lesser known applications for highly stratified data with small sample sizes per stratum. We also discuss exact logistic regression for small sample sizes or sparsely distributed data. In the remainder of this article, models are presented for categorical data, with notation introduced as needed for case record data.

Unconditional Logistic Regression

Unconditional logistic regression remains a widely used and effective tool for assessing the magnitude of the effect of explanatory variables on response probabilities. Let the response variable yi=1 with probability πi for the ith individual if the outcome of interest is observed, and 0 otherwise. Also suppose that all individuals' responses are statistically independent and that πi may depend on a set of p explanatory variables, , through an unknown regression parameter vector, , and γ, an intercept parameter for a reference population. To show this dependence, write πi(βA), where βA = (γ, β) is the augmented parameter vector that includes the intercept. The joint distribution of the yi, is

2
equation 2
Viewed as a function of the unknown βA with the data fixed, Expression 2 is the likelihood function. The logistic model for πi is
3
equation 3
where ηi is called the linear predictor and xi′β is notation for . The odds ratio of a positive response for a unit increase in xk is . The ML estimate is obtained by inserting Expression 3 into Expression 2 and differentiating with respect to βA to obtain the likelihood score equations
4
equation 4
where , the augmented covariate vector for the ith individual. The solution to Equation 4 requires iteration since it is nonlinear in βA. For sufficiently large samples has an approximate multivariate normal distribution. Inverting the Fisher information matrix yields the covariance matrix for from which standard errors are extracted to construct confidence intervals for the true parameters βk, or tests of significance based on Wald tests. For the public health data analyst who is mainly interested in applying logistic regression, Hosmer & Lemeshow (36) provide a very readable text.

Logistic Regression for Ordinal Responses

The proportional odds model (61) specifies a cumulative logit link to relate p covariates to an ordinal response that takes the values, with corresponding multinomial probabilities for the ith individual, , and . Let θig denote the odds of observing the gth category or above for the ith individual, that is

equation
The proportional odds model is
5
equation 5
for , where the intercept terms account for different logodds and probabilities that the response is at least as great as g in the reference population. Parameter interpretations are such that no matter how the cut off dichotomizes the ordinal response, the odds ratio does not change. Interpretations proceed in global terms for the odds of a higher response compared to a lower. This assumption can be checked with the score test for the proportional odds assumption (74), which is part of the output of SAS PROC LOGISTIC. If the assumption is contradicted, generalized logits may be modeled. These describe the odds of making each response relative to a reference response, giving r−1 distinct sets of parameter estimates. Also known as polychotomous logistic regression, this approach has been applied to many areas, including public health (23, 48). Sample size calculations for Expression 5 are given by Whitehead (89).

Example: expanded data set for outcomes in newborns

O'Shea et al (70) investigated the rate of CP over time according to the availability of surfactant replacement therapy. We consider the CP rate from 7/1/86 to 6/30/94, dividing time into two four-year intervals. The first interval, 7/1/86 to 6/30/90, corresponds to the pre-surfactant years, and 7/1/90 to 6/30/94 corresponds to the post-surfactant years. The proportional odds model is applied to OFUV using data on 394 very low birthweight infants, which include the 216 babies in Table 5. The model assumes that the effect of any covariate measured by an odds ratio, θi3, comparing the outcome of death (D) to alive at follow-up (H or C) will be equal to the odds ratio, θi2, comparing an unfavorable outcome (D or C) to H. Of interest is whether the difference in OFUV between the two periods can be explained by gender and BW (per 100 grams). All predictors were significant at the .0001 level. Table 8 shows that after adjusting for gender and BW, there remains a significant improvement in the outcomes of infants over time consistent with the nonsignificant trend from the nonparametric analysis of covariance, based on the smaller data set. This analysis shows that the occurrence of an unfavorable outcome compared to a more favorable one had 2.57 times higher odds before 1990 than after 1990 in the population of VLBW babies studied. The improvement seen for the larger data set can perhaps be explained by surfactant replacement therapy, but this information was not available for individual babies in this analysis. The results should be scrutinized more closely, however, because a score test p-value of 0.012 rejects the proportional odds assumption. As a way to evaluate the general findings, separate ordinary logistic regressions were fit to the two odds of interest, θi3, and θi2. For both models, all explanatory variables were significant with p-values less than .01, and respective parameter estimates of PRE90 were 0.78 and 1.04. In other words, the occurence of death before the one-year follow-up visit had 2.18 times higher odds before 1990 as after, and the occurence of death or CP versus healthy outcome had 2.83 times higher odds before 1990 as after 1990 after adjusting for potential differences in the distributions of gender and birthweight. An alternative approach to analysis uses polychotomous logistic regression to model the odds of D versus H, and C versus H. This approach would be especially useful if the goal were to estimate the response probabilities of OFUV for different subgroups of newborns. Stokes et al (84) describe the use of SAS PROC CATMOD for polychotomous logistic regression.

image
CLICK TO VIEW
TABLE 8

Results of proportional odds model for outcome in infants

Conditional Logistic Regression

Historically, conditional logistic regression has been applied to matched case-control studies in which a subject known to have the event of interest (case) is paired, or matched with a person who does not have the event (control) (11). Data are collected, retrospectively, to determine whether the case and control were exposed to certain risk factors. For example, Mack et al (57) studied women in a retirement community to determine if there was an association between the use of estrogen and the incidence of endometrial cancer. Cases were matched to controls with the same age and marital status, and who were living in the same community at the time of the diagnosis of the case. Information was collected on obesity, hypertension, gallbladder disease history, and nonestrogen drug use. The stratified logistic model for , where yij = 1 if the jth subject in the ith matched pair has the event and xij is the vector of risk factors, is

6
equation 6
where γi is a stratum-specific parameter for the ith matched set, and β is a p dimensional vector of parameters common across all strata. In a matched case-control study the stratum refers to the matched set. Unconditional ML estimation discussed above would give incorrect results because there are too many parameters to estimate in relation to the sample size. A solution can be obtained for matched pairs by conditioning on the set of {yi1 + yi2} to eliminate the γi. The resulting conditional likelihood is maximized to provide CML estimates of factors that differ within stratum. To implement the model for matched pairs, each is transformed to a single observation with explanatory variables having values equal to the differences between the corresponding values for the case and the control. The outcome variable has the value 1 for all paired observations. An unconditional logistic regression model without intercept is fitted to the newly created covariates and artificial response. Conditional logistic regression reduces to McNemar's test when a single dichotomous explanatory variable is used. A related but more general method is needed for studies with n cases and m controls. It involves a likelihood like that for proportional hazards models for time-to-event data. Stokes et al (84) illustrate the use of SAS PROC PHREG for n cases and m controls, and SAS PROC LOGISTIC for the 1:1 endometrial cancer study.

Conditional logistic regression applies to other highly stratified data where primary interest is in within-stratum factors. Multicenter clinical trials with many centers and small numbers of patients per center can have centers as the strata. Center effects are conditioned out and inference is made on within-center treatment comparisons. Conditional logistic regression also applies to some correlated data situations. In multiperiod cross-over trials with 2 or more treatments, conditional maximum likelihood removes the subject-specific effect to make inference on the within-subject treatment effects. Stokes et al (84) provide such an illustration. Alternatively, generalized estimating equations (90) (described below) provide more comprehensive modeling, which also accounts for explanatory variables at the subject level.

Example: schizophrenia and birth complications

In a case-control study looking at the role of birth complications in patients with schizophrenia, several siblings from each of 7 families were studied. Each individual was classified as either normal (NM) or with schizophrenia (SH). The goal was to determine if there was a positive relationship between schizophrenia and a birth complications (BC) index that took a value somewhere between 0 (uncomplicated birth) to 15 (severely complicated birth). The data appear in Table 9. In Expression 6, γi denotes the ith family effect, and β represents the effect of birth complications on schizophrenia. The conditional maximum likelihood estimate for β is 0.325, with asymptotic p-value (Wald-type) = .053. The odds ratio estimate is 1.384 with a 95% asymptotic-based confidence interval of (0.996, 1.924).

image
CLICK TO VIEW
TABLE 9

Schizophrenia and birth complications data*

Exact Inference for Logistic Regression

When sample sizes are too small or data too sparsely distributed, exact methods of estimation and inference can be applied for models in the previous three sections. The principle of conditioning on observed margins of tables to obtain exact tests for contingency tables can be extended to logistic regression. Inference is based on exact permutational distributions of the sufficient statistics that correspond to the regression parameter of interest, conditioning on the sufficient statistics of the remaining parameters fixed at their observed values. Exact logistic regression simplifies to exact tests for contingency tables because observed row and column margins correspond to certain sufficient statistics. The conditioning principle applies to both the unstratified model (Expression 3) and the stratified model (Expression 6) so that the same algorithms are applied to both in the computations. For unstratified binary data, the p × 1 vector of sufficient statistics for β, with elements is . The conditional distribution of T is obtained by conditioning on , the sufficient statistic for γ. Exact inference for β requires fixing some of the sufficient statistics at their observed values and varying those of interest over their permissible ranges. If interest is in a single parameter, say β1, then inference is based on the distribution of , which may be represented by f(t1 | β1). An exact p-value for H0: β1 = 0 vs H1: β1≠ 0 is given by the conditional probabilities test as described by Mehta & Patel (65). A confidence interval can be obtained by inverting the test. Maximizing f(t1 | β1) yields a conditional maximum likelihood estimate for β1, if it exists; otherwise an alternative estimate is the median unbiased estimate that is always defined.

In the stratified binary model, exact inference is carried out in an analogous fashion as discussed by Mehta & Patel (65). The approach extends the conditional likelihood method discussed above in that instead of conditioning only on the sufficient statistic for the stratum-specific parameters, the sufficient statistics for the other parameters are conditioned on as well, to obtain the conditional distribution of the sufficient statistic of the parameter of interest. Exact inference for the polychotomous logistic regression is considered by Hirji (33).

image
CLICK TO VIEW
TABLE 10

Exact logistic regression results for osteogenic sarcoma data

Example: osteogenic sarcoma

The data in Table 3 are reconsidered to assess the simultaneous relationship of the predictors with disease through the unstratified logistic regression model with covariates AOP, LI, and MALE. Because all of the diseased subjects had LI present, LI is a perfect predictor, and the usual ML logistic regression analysis cannot be used. However, it is possible to estimate βLI and provide a confidence interval and p-value using exact inference. Table 10 gives two-sided p-values, model-adjusted estimates, and confidence intervals for odds ratios for each predictor obtained using LogXact (56). Unlike the unadjusted results p-values in Table 4, the simultaneous consideration of the three predictors reveals that LI is nearly significant, but AOP and MALE are not.

Example: schizophrenia and birth complications

In the previous discussion of this example, the total number of cases with schizophrenia in each family were conditioned upon to obtain CML estimates. The p-value and confidence interval presented were based upon large sample properties of the conditional likelihood. However, exact inference may be more appropriate because there were only 29 family members. The exact p-value is 0.017 and the 95% exact confidence interval for the odds ratio is (1.02, 2.10). The results change from not significant at the 0.05 level to significant when one switches from asymptotic to exact inference.

GENERALIZED ESTIMATING EQUATIONS

The previous section described logistic regression analysis in which observations were assumed independent and parameter estimation was achieved by maximum likelihood or exact methods. In many categorical data situations, however, independence assumptions may not hold, and ignoring the intra-cluster correlation would lead to incorrect variance estimates for the regression parameters (52). This section discusses generalized estimating equations (51) that extend unconditional logistic regression and other generalized linear models (68) to the analysis of cluster sample data. We provide a brief summary of the theory of generalized linear models (GLMs) as background for a general discussion of GEE. The final sections discuss GEE methods for correlated binary and multinomial responses, respectively.

Background

GLMs are a general class of models for independent observations. A GLM has three components: (a) a random component that defines the form of the distribution for the ith observation belonging to the exponential family of distributions; this family includes, among others, the binomial, poisson, and normal distributions, corresponding to logistic, loglinear, and linear regression, respectively; (b) the systematic component given by the linear predictor, ηi=x′AiβA, for the ith observation; and (c) the link function, g(), which links the random and systematic components by the equation, ηi = g(μi). The marginal distribution of yi has mean and variance

7
equation 7
where v() is the variance function, γ is the intercept, β is a p × 1 vector of regression coefficients, and Φ is the scale parameter, either known or to be estimated. The regression coefficients, βA, are estimated by solving the maximum likelihood score equations for βA:
8
equation 8
where Di = ∂μi/∂βA. For the logistic regression model in Expression 3, μi = πi,v(μi) = μi(1 − μi), the variance of a bernoulli random variable, Φ = 1, and . Alternative generalized linear models for a binary response result from using other link functions such as the probit and complementary log-log link. Like the logit link, these have the attractive quality that all possible values of the linear predictor in (−∞, +∞) correspond to values of πi in (1). For the logit link, Equation 8 reduces to Equation 4 because ∂μi/∂ηi = μi(1−μi), and applying the chain rule from calculus gives Di = μi(1 − μi)xAi′. A book by McCullagh & Nelder (62) is considered the primary reference. Dobson (22) has a shorter introductory text. Healy (32) provides an introduction to GLIM (71), a software package for fitting GLMs.

Note that Expression 8, which in general terms is called an estimating equation because its solution gives an estimate of βA, is completely specified by the mean and variance of yi given in Expression 7. Without further distributional assumptions it would be called a quasi-score and its integral a quasi-likelihood (62, 75, 88). For binomial responses or counts, a quasi-likelihood analysis may be carried out in which Φ is estimated. Such an analysis allows for overdispersion (Φ > 1), which occurs when variation in the responses exceeds that expected under binomial or poisson sampling (50).

GEE: Examples and Theory

Like GLMs, generalized estimating equations apply to continuous responses as well as discrete responses, including counts, binomial, or multinomial responses. GEEs may be viewed as an extension of GLMs from random sampling to cluster sampling as they account for the dependence of observations within clusters. In other words, GEE applies to regression models with multivariate responses, or a cluster of responses whose elements correspond to correlated (usually positively) observations. Three general regression formulations are commonly used in public health.

1.

The response vector corresponds to repeated measures taken on subjects, and parameter estimates are obtained for the marginal means in a single regression model. In a prospective study of coronary heart disease risk factors (28), young adults were followed annually for seven years to describe and compare blood pressure patterns among educational, ethnic, and gender groups (55A). An individual defines a cluster, and his or her blood pressure at a given time is an observation.

2.

In a cross-sectional study, the response vector may correspond to a cluster of distinct subjects, such as patients in a medical practice. Then, a single regression model relates covariates to a patient's response. In a state-wide study of cancer screening in Colorado, patient records of 1798 women over 50 years of age were sampled from 132 physicians in as many primary care practices to determine the level of preventive services being provided (16, 58). Practices define clusters and patient charts are observations. The data are discussed below.

3.

When the response vector consists of different outcome variables, separate regression models may be fit to each, yet taking the correlation among responses into account through a multivariate procedure such as GEE may provide more efficient estimation of regression parameters. In the management of dizziness in primary care settings, the response vector consists of three clinical decisions: whether or not to require medication, order laboratory tests, and whether to refer for further testing (82). A cluster is comprised of a patient's 3 binary outcomes.

Some notation and general theory for GEE is presented before special consideration is given to binary and ordinal responses. Let ni equal the number of observations in the ith cluster, and let K be the total number of clusters of possibly varying size. Clusters are indexed by i and observations are indexed by T. Let be a vector of ni outcome values, and a ni × p matrix of cluster-and/or observation-level covariate values for the ith cluster. Estimates of βA are obtained by solving the generalized estimating equations

9
equation 9
where Di is now an ni × (p+1) matrix, Vi = AiRi(α)Ai is the working variance matrix of yi, and Ai = diag{v1/2(μit)} is a ni × ni diagonal matrix. Furthermore, Ri(α) is a ni × ni working correlation matrix that depends on an unknown parameter vector α. GEE estimation involves alternating between iteratively reweighted least squares estimation for βA and method of moments estimation for Φ and α. For a large number of clusters, the GEE estimate of βA has an approximate multivariate normal distribution (51). In other words, valid inference requires a sufficiently large sample size in terms of the number of clusters. This is in contrast to conditional logistic regression applied to cluster samples (with clusters conditioned as strata) where the sample size requirements are in terms of the number of observations, albeit at the price of the stronger assumption of conditional independence given cluster-specific parameters. The “robust” or “sandwich” variance estimate of is given by
10
equation 10
where βA, Φ, and α are replaced by their estimates. Diagonal estimates of Expression 10 are the variances of individual parameter estimates, and off-diagonal elements are their estimated covariances. Unlike model-based counterparts, they are robust in the sense that they are valid even if Ri(α) is mis-specified, assuming correct specification of the model for the mean (51). Like quasi-score equations, Expression 9 only depends on means and second moment parameters (variances and covariances) and does not require the specification of the response vector's full multivariate distribution. They extend quasi-likelihood by introducing additional parameters, α, to describe the nature of the intra-cluster correlation. However, the power of statistical tests for βA can be increased by carefully modeling Ri(α): (a) in repeated measures such as the blood pressure study, a first-order autoregressive correlation gives a larger correlation to observations that are closer in time as opposed to those that are temporally farther apart; (b) in the cancer screening study, an exchangeable correlation is used in which any two women in the same practice are assumed to have equal correlation; and (c) in the dizziness study there are 3 distinct correlations to estimate, one for each outcome pair. Alternatively, the odds ratio may be used as a measure of association (54).

The Analysis of Correlated Binary Data

The GEE procedure can be used to fit logistic models to binary responses from cluster samples. The mean and variance of the tth binary response in the ith cluster is given by

11
equation 11
where v(πit) is the variance of yit. The regression parameter vector, β, may address variation across both clusters and observations. Expression 11 assumes that β is the same for each outcome, although for the dizziness example we would write βt instead and obtain estimates of the coefficients for each outcome.

Example: cancer screening in primary care practices

As part of the Partners Project (16, 58), patient and physician characteristics were examined for their associations with mammography screening. Let yit = 1 if the tth patient in the ith practice received a mammogram in the past year, and 0 otherwise. A logistic regression model is fit with the covariates patient age (three categories: 50–59, 60–69, 70–74) and smoking status, physician gender, whether the physician practices family medicine or internal medicine, and whether the patient made a health maintenance visit (HMV) in the previous year. An exchangeable correlation structure is assumed where the pairwise correlation is the same for any two women in the same practice. Twenty-two women were sampled per practice, but cluster sizes vary slightly due to occasional missing data. Parameter estimates and their standard errors are given in Table 11(a); the estimate of the exchangeable correlation was .037. Patients who made a HMV had 9.8 times higher odds to get a mammogram than those who did not receive a HMV. Female physicians had 1.43 times higher odds as male physicians to order mammograms, and nonsmoking patients had 1.56 times higher odds to get a mammogram than smokers. Physician specialty and patient age were not significantly associated with the level of mammography screening after adjusting for the other predictors.

image
CLICK TO VIEW
TABLE 11

GEE results for cancer screening data

The Analysis of Correlated Ordinal Data

The GEE approach can be applied to correlated ordinal data by extending the proportional odds model in Expression 5. Modifying Expression 5, the model becomes

12
equation 12
where θitg is the odds of observing a response in the gth category or above for the tth member of the ith cluster. The estimating equations are generalizations of Expression 9. As in GEE for binary outcomes, analogous working correlation structures are possible, but Ri is more complicated because there is dependency between outcome levels of the ordinal response (within observations), as well as across observations. In general, additional computational burden results with increasing cluster sizes because Ri is commonly inverted at each iteration. An independence working correlation may be used to alleviate this concern, although some efficiency may be lost.

For correlated nominal outcomes, GEE may be applied to generalized logit models. In work sampling studies, for example, repeated observations are made on a subject who is categorized into performing one of a finite number of tasks. These tasks often form nonordered categories. Miller et al (67) apply GEE methods to a medical setting in which at any given time during a doctor/patient session the doctor might be observed with the patient, with the patient's chart, or engaged in another task. Other developments include GEE influence diagnostics (76) and second-order generalized estimating equations that model marginal means and odds ratios, simultaneously (31, 53).

Example: cancer screening in primary care practices

A second analysis concerns the level of adherence to recommendations of the National Cancer Institute for annual mammogram and clinical breast exam. Each woman is assigned a score from 0 to 2 indicating the number of exams received, with a 2 indicating that both mammogram and clinical breast exam were received in the past year. A working exchangeable correlation structure is used (55). Parameter estimates and their standard errors are given in Table 11(b). Patients who made a HMV had 23.8 times higher odds to have a higher level of cancer screening than women who did not make a HMV. The other predictors had effects similar in magnitude as when only mammogram was considered.

WEIGHTED LEAST SQUARES

Historically, marginal modeling of discrete repeated measures was investigated by Koch & Reinfurt (45) and Koch et al (44), who extended the weighted last squares (WLS) methodology of Grizzle et al (30). The WLS method fits models to a large class of response functions, including marginal proportions, marginal logits, cumulative logits, and mean scores. Although GEE methods apply to many analyses for observational units, WLS is particularly useful for fitting models to aggregate quantities such as rank measures of correlation and incidence densities, as discussed below. The procedure applies noniterative generalized least squares to the response functions of interest, using their observed covariance matrix as weights. For sufficiently large samples, the response functions have an approximate multivariate normal distribution. Tests of hypotheses can be carried out on pertinent linear combinations of the response functions. The WLS method requires that subjects be grouped into moderately large subpopulations defined by identical covariate values. Both GEE and WLS are nonlikelihood or semiparametric methods that use weighted least squares algorithms to fit models for marginal response functions. GEE accounts for the correlation structure to gain efficiency, whereas WLS may be viewed as a special case of GEE using an empirically estimated covariance matrix (66).

Rank Measures of Correlation for Ordinal Responses

In some clinical trials, patients receive a randomly allocated dosage level of a drug and their outcome status is determined at two or more visits. In studies where both the dose and the outcome (none, some, marked) are ordinal, rank measures of association may be modeled using WLS procedures to determine the extent of treatment effects and treatment by time interactions (13, 42). This approach is especially useful if there is uncertainty about the goodness of fit of the specific structure imposed by models for ordinal data such as a proportional odds assumption. Carr et al (13) describe WLS applied to Goodman-Kruskal rank correlation coefficients, sometimes called gamma coefficients, which make minimal assumptions about the true nature of the association except for ordinality of the treatment and response. Analyses are based on hypothesis tests applied to linear functions of the correlations with specifications for treatment effects and interactions of treatment and center. Other background variables, in addition to center, such as baseline measurement are easily incorporated in the analysis.

Carr et al (13) consider a clinical trial of 111 patients who had respiratory illness. In each of two centers patients were randomly assigned one of two treatments (placebo = 0, active = 1). The response variables were ordinal classifications (0 = terrible, 1 = poor, 2 = fair, 3 = good, 4 = excellent) of the health status of each patient at each of four successive visits after the initiation of treatment. The baseline status of patients was also evaluated according to the same ordinal classification. Inference is based on the gamma coefficients at each visit within each center, and their estimated covariance matrix. For clinic 1, the gammas based on 56 patients were .148, .467, .292, and .242 for visits 1, 2, 3, and 4, respectively. Similarly, the gammas for clinic 2 based on 55 patients were .421, .647, .559, and .433. The results indicate that the treatment × visit interaction is significant (p = .044) for the combined centers. The interaction is due to the stronger extent of better response for active treatment at visits 2 and 3 than visits 1 and 4. They found no treatment by center interaction.

The Modeling of Incidence Densities

Weighted least squares methods may also be used to model correlated incidence densities. Lavange et al (49) consider repeated measures that describe the effect of passive exposure to tobacco smoke on lower respiratory illness (LRI) in children. They present an analysis of 158 children exposed to passive tobacco smoke compared to 136 unexposed. Interest was in comparing the incidence rates for different groups of children. A complication in the data is that the at-risk periods for each child vary randomly. They use WLS to model incidence densities for which the covariance structure is determined by sample survey methods.

In the first stage of the WLS analysis, subgroup-specific incidence densities and their covariance matrix are estimated, the latter by a first-order Taylor series approximation. The incidence density (ID) for a particular subgroup is defined as the ratio of the number of illness episodes to the total person-time at risk. These estimates are used to compare marginal rates of LRI between the exposed and unexposed groups. The SUDAAN software package (81) was used to estimate incidence densities and test contrasts among them. In the second stage of analysis, WLS is applied using SAS PROC CATMOD to fit models to these densities in order to assess the main effects and interactions among the various risk factors. For the exposed and unexposed groups, the IDs were 1.24 and 0.65, respectively, for exposed and unexposed groups. A test of no differences between LRI for the exposed and unexposed groups had a Wald p-value of .001, indicating that the exposed group had a significantly higher rate of illness.

In another approach, they fit a poisson model using GEE to counts representing the number of episodes of illness per child at each period. This approach estimates average rates by using an offset term in the model to account for time at risk in each period. The GEE procedure has the advantage of applying to more comprehensive models than WLS, including those with continuous covariates. Alternatively, the WLS approach is attractive because direct estimates of adjusted incidence densities for risk factors of interest can be provided. Moreover, no distributional assumptions are required, only random sampling of primary sampling units (persons).

CONCLUDING REMARKS

Categorical data analysis encompasses a much broader array of techniques today than only 15 years ago due to many methodological and computational advances. While chi-square and unconditional logistic regression methods still comprise the staple diet of public health researchers, methods for ordinal response data are receiving wider use. More recently, fast algorithms for exact inference such as provided by StatXact (83) have encouraged the inferential analysis of small data sets. Analyses of complex data sets characterized by cluster samples, continuous and categorical covariates, and cluster- and observation-level covariates, are no longer restricted to naive analyses based upon aggregating information at the cluster level. While the generalized estimating equations approach has replaced weighted least squares in many analyses of correlated discrete data, the latter is still useful for a wide variety of response functions.

Another approach to modeling correlated categorical data comes from a sample survey perspective (7, 39, 49, 81). Used widely in the analysis of very large sample survey data sets, but applicable in epidemiology and other disciplines, survey regression weights observations according to the sampling design, and provides estimators like GEE. For simple random samples, the results correspond to those obtained with a working independence assumption in GEE.

Some authors propose maximum likelihood methods for correlated discrete data (5, 25, 94), which has certain advantages and disadvantages compared to GEE. In particular, maximum likelihood applies to a wider range of missing data assumptions (4, 26), although the GEE procedure has been modified to increase their scope (78). Finally, GEE is better suited for analyzing large numbers of clusters of varying size and makes fewer distributional assumptions.

In addition to marginal models, random effects and transitional models address cluster samples of discrete data in public health research (21, 52, 69, 72, 90). These three classes of models differ with respect to parameter interpretations. In random effects models, regression parameters have cluster-specific interpretations, and the surplus of parameters are reduced by assuming underlying probability distributions, usually normal distributions, specified up to a few unknown parameters (12, 17, 63, 86). In transitional models, the estimated regression parameters have interpretations that are conditional on the values of other outcomes, such as measurements at previous times, or the responses of other family members (9, 79, 91).

Bayesian statistical methods are also used in public health (24). In particular, Empirical Bayes methods have been employed in estimating disease incidence and mortality rates (14). These procedures are useful when diseases are rare, and the poisson variation is overdispersed. It allows non-zero estimation of very low rates corresponding to zero observed numbers of deaths. The Empirical Bayes estimate of the rate is a weighted average of a local rate and the global rate of all the regions. Others have described binary response models in epidemiology when an exposure variable is measured or classified with error (46, 80, 87). Categorical methods also apply to grouped survival data (20, 42), including extensions of Mantel-Haenszel methods (19, 59). Stokes et al (84) illustrate these methods as well as poisson regression for categorized time-to-event data using the SAS system.

While many of the methods discussed in this paper have only appeared in the past 15 years, the coming years will witness their wider dissemination among public health researchers. Computational advances will continue expanding the scope of exact methods, and complex data sets will more commonly be analyzed with statistical methods that incorporate various patterns of dependency in the data.

Acknowledgments

The work of the first author was partially supported by a grant from the Agency for Health Care Policy and Research (HS 06992). We would like to acknowledge the support of Stuart Cohen, Maureen McClatchey, and Brent Shelton in regard to the cancer screening data, and Michael O'Shea who provided the cerebral palsy data. We especially thank Bahjat Qaqish for many helpful discussions.

Literature Cited

  • 1. 
    Agresti A. 1990. Categorical Data Analysis. New York: Wiley & Sons
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
    More AR articles citing this reference

    • The Importance of the Normality Assumption in Large Public Health Data Sets

      Thomas Lumley, Paula Diehr, Scott Emerson, and Lu ChenDepartment of Biostatistics, University of Washington, Box 357232, Seattle, Washington 98195; e-mail: [email protected]
      Annual Review of Public Health Vol. 23: 151 - 169
      • ...An ordered categorical response can be collapsed into a binary (proportional odds) model (1, 20)....
    • CATEGORICAL DATA ANALYSIS

      Thomas D. WickensDepartment of Psychology, University of California, Los Angeles, California 90095; e-mail: [email protected]
      Annual Review of Psychology Vol. 49: 537 - 558
      • ..., and such texts as Agresti (1990, 1996), Clogg & Shihadeh (1994), Wickens (1989)....
    • An Introduction to Categorical Data Analysis

      Douglas SloaneDepartment of Sociology, Catholic University, Washington, DC 20064 S. Philip MorganDepartment of Sociology, University of Pennsylvania, 3718 Locust Walk, Philadelphia, Pennsylvania 19104-6299
      Annual Review of Sociology Vol. 22: 351 - 375
      • ...see also Hout (1984a, b, 1988), Agresti (1989, 1990), Clogg & Shihadeh (1994).] The Goodman book also contained a formal treatment of how to deal with unordered polytomies via partitioning and collapsing (see in particular Chapter 7 and Appendix A), ...
      • ...models are denoted not by marginals fitted but rather by multiplicative or additive equations (see Knoke & Burke 1980, Agresti 1990, ...
      • ...models that impose linear constraints on the row or column variables in two-way tables are referred to as row effects or column effects models (see Agresti 1990, ...
      • ... 15 For additional substantive examples and a nontechnical discussion of logit-specified models, see Duncan & Duncan (1978, Appendix A), and Agresti (1990, ...

  • 2. 
    Agresti A, Mehta CR, Patel NR. 1990. Exact inference for contingency tables with ordered categories. J. Am. Stat. Assoc. 85:453–458
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 3. 
    Agresti A. 1992. A survey of exact inference for contingency tables. Statist. Sci. 7:131–77
    • Crossref
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
    • Article Location
    • Article Location
    • Article Location
    • Article Location
    • Article Location
    • Article Location
  • 4. 
    Baker SG, Laird NM. 1988. Regression analysis for categorical variables with outcome subject to nonignorable nonresponse. J. Am. Stat. Assoc. 83:62–69
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 5. 
    Balagtas CC, Becker MP, Lang JB. 1993. Marginal modeling of categorical data from crossover experiments. Appl. Statist. 44:63–77
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 6. 
    Bhushan V, Paneth N, Kiely JL. 1993. Impact of improved survival on very low birth weight infants on recent secular trends in the prevalence of cerebral palsy. J. Pediatrics 91:1094–100
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
  • 7. 
    Binder DA. 1983. On the variance of asymptotically normal estimators from complex surveys. Int. Stat. Rev. 51:279–92
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • Are Survey Weights Needed? A Review of Diagnostic Tests in Regression Analysis

      Kenneth A. Bollen,1,2 Paul P. Biemer,3,4 Alan F. Karr,4 Stephen Tueller,4 and Marcus E. Berzofsky41Department of Psychology and Neuroscience, University of North Carolina, Chapel Hill, North Carolina 275992Department of Sociology, University of North Carolina, Chapel Hill, North Carolina 27599; email: [email protected]3Odum Institute for Research in Social Science, University of North Carolina, Chapel Hill, North Carolina 275994RTI International, Research Triangle Park, North Carolina 27709
      Annual Review of Statistics and Its Application Vol. 3: 375 - 392
      • ...Binder 1983, Chambless & Boyle 1985, Morel 1989, Roberts et al. 1987) and the software to implement (e.g., ...
    • What Do We Do with Missing Data? Some Options for Analysis of Incomplete Data

      Trivellore E. Raghunathan Department of Biostatistics and Institute for Social Research, University of Michigan,
      Ann Arbor, Michigan 48109
      ; email: [email protected]
      Annual Review of Public Health Vol. 25: 99 - 117
      • ...A weighted logistic regression can be used to estimate the parameter of interest (2)....

  • 8. 
    Bishop YMM, Fienberg SE, Holland PW. 1975. Discrete Multivariate Analysis. Cambridge, MA: MIT Press
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • An Introduction to Categorical Data Analysis

      Douglas SloaneDepartment of Sociology, Catholic University, Washington, DC 20064 S. Philip MorganDepartment of Sociology, University of Pennsylvania, 3718 Locust Walk, Philadelphia, Pennsylvania 19104-6299
      Annual Review of Sociology Vol. 22: 351 - 375
      • ...many borrowing from and building on Goodman's work; these include important books by Bishop, Fienberg & Holland (1975), Haberman (1974, 1978/1979), Fienberg (1980)....

  • 9. 
    Bonney GE. 1986. Regressive logistic models for familial disease and other binary traits. Biometrics 42:611–25
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • An Accidental Genetic Epidemiologist

      Robert C. ElstonDepartment of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio 44106, USA; email: [email protected]

      Annual Review of Genomics and Human Genetics Vol. 21: 15 - 36
      • ...George Bonney introduced regressive models, a major advance in segregation analysis (6, 7)....

  • 10. 
    Bose C, Corbet A, Bose G, Garcia-Prats J, Lombardy L, et al. 1990. Improved outcome at 28 days of age for very low birthweight infants treated with a single dose of a synthetic surfactant. J. Pediatrics 117:947–53
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
  • 11. 
    Breslow NE. 1996 Statistics in epidemiology: the case-control study. J. Am. Stat. Assoc. 91:14–28
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
    • Article Location
  • 12. 
    Breslow NE, Clayton DG. 1993. Approximate inference in generalized linear models. J. Am. Stat. Assoc. 88:9–25
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • An Overview of Joint Modeling of Time-to-Event and Longitudinal Outcomes

      Grigorios Papageorgiou, Katya Mauff, Anirudh Tomer, and Dimitris RizopoulosDepartment of Biostatistics, Erasmus Medical Center, 3000 CA Rotterdam, The Netherlands; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 6: 223 - 240
      • ...we then postulate a generalized linear mixed model (Breslow & Clayton 1993) for the conditional expectation of yki given a vector of random effects bki: 1. where the xki(t) and zki(t) are the time-dependent design vectors for the fixed effects βk and the random effects bki, ...
    • Two-Part and Related Regression Models for Longitudinal Data

      V.T. Farewell,1 D.L. Long,2 B.D.M. Tom,1 S. Yiu,1 and L. Su11Medical Research Council Biostatistics Unit, Institute of Public Health, University of Cambridge, Cambridge CB2 0SR, United Kingdom; email: [email protected]2Department of Biostatistics, West Virginia University, Morgantown, West Virginia 26506
      Annual Review of Statistics and Its Application Vol. 4: 283 - 315
      • ...which presents the same computational challenges as with generalized linear mixed models (GLMMs) (Stiratelli et al. 1984, Breslow & Clayton 1993, Wolfinger & O'Connell 1993)...
    • Modeling Through Latent Variables

      Geert Verbeke1,2 and Geert Molenberghs1,21Interuniversity Institute for Biostatistics and Statistical Bioinformatics, Katholieke Universiteit Leuven, B-3000 Leuven, Belgium; email: [email protected]2Interuniversity Institute for Biostatistics and Statistical Bioinformatics, Universiteit Hasselt, B-3590 Hasselt, Belgium; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 4: 267 - 282
      • ...The resulting model is called a generalized linear mixed model (Breslow & Clayton 1993, Molenberghs & Verbeke 2005)....
    • Bayesian Computational Tools

      Christian P. RobertCEREMADE, Université Paris-Dauphine, 75775 Paris Cedex 16, France; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 1: 153 - 177
      • ...Example 2: A typical instance is made of random effects models as in the following instance (inspired from Breslow & Clayton 1993) of Poisson observations (i=1, ...
    • Assessing Change with Longitudinal and Clustered Binary Data

      John M NeuhausDepartment of Epidemiology and Biostatistics, University of California, San Francisco, California 94143-0560; e-mail: [email protected]
      Annual Review of Public Health Vol. 22: 115 - 128
      • ... extended the logistic regression model by adding random effects: 1 Breslow & Clayton (4) discuss other examples of mixed-effects generalized linear models....
      • ...several authors (4, 10) have suggested approximations to Equation 2 that one can easily maximize to obtain approximate maximum likelihood estimates of β and γ....
      • ...Potential choices for Ψ include Laplace approximations to Equation 2 (6) and the marginal quasi-likelihood approach of Breslow & Clayton (4)....
      • ...Marginal quasi-likelihood (10) and penalized quasi-likelihood (4) are two popular approaches, but both methods provide highly biased estimates of regression parameters and variance components (6, 22)...
    • Selected Statistical Issues in Group Randomized Trials

      Ziding Feng,1 Paula Diehr,1,2 Arthur Peterson,1 and Dale McLerran11Cancer Prevention Research Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave.N. MP-702, Seattle, Washington 98109-1024; e-mail: [email protected] ,[email protected] ,[email protected] [email protected] ,2Department of Biostatistics, University of Washington, Seattle, Washington 98195 [email protected] ,
      Annual Review of Public Health Vol. 22: 167 - 187
      • ...There are three primary approaches used in GRT evaluations: (a) generalized linear mixed models (GLMM) (3)...
      • ...The most popular approximation methods use a linear approximation of h(y) by h(μ) + (y − μ)h′(μ) (e.g. h(.) is logit for binary data) and then apply traditional linear mixed model estimation methods to the linearized form (3, 36, 43, 50)....
    • Multilevel Modeling for Binary Data

      Guang GuoDepartment of Sociology, University of North Carolina, Chapel Hill, North Carolina 27599; email: [email protected] Hongxin ZhaoCenter for Children and Families, Teachers College, Columbia University, New York, NY 10027
      Annual Review of Sociology Vol. 26: 441 - 462
      • ... and penalized quasi-likelihood or PQL (Breslow & Clayton 1993) are the two prevailing approximation procedures....
      • ...which is the same as Breslow & Clayton's (1993) PQL-1 except that PL explicitly estimates the extra-dispersion parameter φ....

  • 13. 
    Carr GJ, Hafner KB, Koch GG. 1989. Analysis of rank measures of association for ordinal data from longitudinal studies. J. Am. Stat. Assoc. 84:797–804
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
    • Article Location
  • 14. 
    Clayton DG, Kaldor J. 1987. Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. Biometrics 43:671–81
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • Statistical and Substantive Inferences in Public Health: Issues in the Application of Multilevel Models

      Jeffrey B. Bingenheimer1 and Stephen W. Raudenbush2 1 School of Public Health, University of Michigan,
      Ann Arbor, Michigan 48109
      ; email: [email protected] 2 and 2School of Education, Survey Research Center, and Departments of Statistics and Sociology, University of Michigan,
      Ann Arbor, Michigan 48109
      ; email: [email protected]
      Annual Review of Public Health Vol. 25: 53 - 77
      • ...; to derive age-standardized lip cancer rates for areas in Scotland (11); to map county-level, ...
    • Public Health, GIS, and Spatial Analytic Tools

      Gerard RushtonDepartment of Geography, The University of Iowa, 316 Jessup Hall, Iowa City, Iowa, 52242-1316; email: [email protected]
      Annual Review of Public Health Vol. 24: 43 - 56
      • ...The decade of the 1990s saw major developments in Empiric Bayes methods (18, 19, 50) as the preferred methodology to shrink observed rates differentially toward the mean of the distribution of rates in proportion to their expected variability based on the number of observations in the small areas....
      • ...“This technique is the standard in disease mapping” and cited Clayton & Kaldor (18) in support....
    • Multilevel Analysis in Public Health Research

      Ana V. Diez-RouxDivision of General Medicine, Columbia College of Physicians and Surgeons, and Division of Epidemiology, Joseph T. Mailman School of Public Health, Columbia University, New York, New York; e-mail: [email protected]
      Annual Review of Public Health Vol. 21: 171 - 192
      • ...to derive shrunken estimates of rates of death or diseases for small areas with few observations (7, 59) or to estimate rates of different health outcomes for individual providers (hospitals, ...

  • 15. 
    Clogg CC. 1981. New developments in latent structure analysis. In Factor Analysis and Measurement in Sociological Research, ed. DM Jackson, EF Borgatta, pp. 215–46. Beverly Hills, CA: Sage
    • Google Scholar
    Article Location
  • 16. 
    Cohen SJ, McClatchey MW, Wolfe P, Shelton BJ, Archer PG. 1996. Health maintenance visit as the determinant of cancer screening by primary care physicians. J. Gen. Int. Med. 11(Suppl. 1):121 (Abstr.)
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
  • 17. 
    Conaway M. 1990. A random effects model for binary data. Biometrics 46:317–28
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 18. 
    Cox DR. 1958. The regression analysis of binary sequences. J. R. Stat. Soc. Ser. B 20:215–42
    • Google Scholar
    Article Location
  • 19. 
    Cox DR. 1972. Regression models and life tables. J. R. Stat. Soc. Ser. B 34:187–220
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • TIME-SERIES–CROSS-SECTION DATA: What Have We Learned in the Past Few Years?

      Nathaniel BeckDepartment of Political Science, University of California, San Diego, La Jolla, California 92093-0521; e-mail: [email protected]
      Annual Review of Political Science Vol. 4: 271 - 293
      • ...it is easy to view BTSCS dyad-year data as grouped duration data suitable for a grouped Cox (1972) proportional hazards semiparametric estimation....

  • 20. 
    Deddens JA, Koch GG. 1988. Survival analysis: grouped data. In Encyclopedia of Statistical Sciences, ed. A Kotz, NL Johnson, 9:129–34. New York: Wiley & Sons
    • Google Scholar
    Article Location
  • 21. 
    Diggle PJ, Liang KY, Zeger SL. 1994. Analysis of Longitudinal Data. Oxford: Oxford Univ. Press
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • Analysis of Nonlinear Patterns of Change with Random Coefficient Models

      Robert Cudeck1 and Jeffrey R. Harring21Psychology Department, Ohio State University, Columbus, Ohio 43210; email: [email protected]2Department of Measurement, Statistics and Evaluation, University of Maryland, College Park, Maryland 20742; email: [email protected]
      Annual Review of Psychology Vol. 58: 615 - 637
      • ...about which several excellent surveys have been recently published (Diggle et al. 1994, Fitzmaurice et al. 2004, Hand & Crowder 1996, Raudenbush & Bryk 2002, Singer & Willett 2003)....
    • The Ecology of Bird Introductions

      Richard P. Duncan,1 Tim M. Blackburn,2 and Daniel Sol31Ecology and Entomology Group, Soil, Plant and Ecological Sciences Division, Lincoln University, P.O. Box 84, Canterbury, New Zealand; email: [email protected] 2School of Biosciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, United Kingdom; email: [email protected] 3Department of Biology, McGill University, 1205 avenue Docteur Penfield, Montréal, Québec, H3A 1B1 Canada; email: [email protected]
      Annual Review of Ecology, Evolution, and Systematics Vol. 34: 71 - 98
      • ...generalized linear mixed models and generalized estimating equations (Diggle et al. 1994, Goldstein 1995)....
    • Assessing Change with Longitudinal and Clustered Binary Data

      John M NeuhausDepartment of Epidemiology and Biostatistics, University of California, San Francisco, California 94143-0560; e-mail: [email protected]
      Annual Review of Public Health Vol. 22: 115 - 128
      • ...Useful references include Lindsey (15) and Diggle et al (7), as well as Neuhaus (19)...
    • Selected Statistical Issues in Group Randomized Trials

      Ziding Feng,1 Paula Diehr,1,2 Arthur Peterson,1 and Dale McLerran11Cancer Prevention Research Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave.N. MP-702, Seattle, Washington 98109-1024; e-mail: [email protected] ,[email protected] ,[email protected] [email protected] ,2Department of Biostatistics, University of Washington, Seattle, Washington 98195 [email protected] ,
      Annual Review of Public Health Vol. 22: 167 - 187
      • ...summed over all groups and then divided by the total number of pairs (11)...
    • Comparing Personal Trajectories and Drawing Causal Inferences from Longitudinal Data

      Stephen W. RaudenbushSchool of Education and Institute for Social Research, University of Michigan, 610 East University Avenue, Ann Arbor, Michigan 48109; e-mail: [email protected]
      Annual Review of Psychology Vol. 52: 501 - 525
      • ...We refer the interested reader to Diggle et al (1994, Chapter 10), ...
      • ...The model is also a “mixed” model (Diggle et al 1994)....
      • ...ηti is a linear function of the πs. Diggle et al (1994) review applications of hierarchical generalized linear models....
    • Multilevel Analysis in Public Health Research

      Ana V. Diez-RouxDivision of General Medicine, Columbia College of Physicians and Surgeons, and Division of Epidemiology, Joseph T. Mailman School of Public Health, Columbia University, New York, New York; e-mail: [email protected]
      Annual Review of Public Health Vol. 21: 171 - 192
      • ...the assumption of independence of observations is violated, resulting in incorrect standard errors and inefficient estimates (17)....
      • ... have appeared in different literature under a variety of names including hierarchical linear models (4), random-effects or random-coefficient models (17, 58, 66), ...
      • ...in contrast with unit-specific (or subject-specific for cases of repeat observations on individuals over time) random-effects models (6, 17, 107)....
      • ...the coefficient expresses how the response changes as a function of covariates averaged over the random effects (17, 107)....
    • Understanding Changing Risk Factor Associations With Increasing Age in Adults

      G. A. Kaplan,1 Mary N. Haan,2 and Robert B. Wallace3 1Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan 48109; e-mail: [email protected] ; 2Department of Epidemiology and Preventive Medicine, School of Medicine, University of California, Davis, California 95616; e-mail: [email protected] ; and 3Department of Preventive and Internal Medicine, University of Iowa, Iowa City, Iowa 52242; e-mail: [email protected]
      Annual Review of Public Health Vol. 20: 89 - 108
      • ...it is only recently that sophisticated methods for analysis of such longitudinal data have emerged (12)....
    • CATEGORICAL DATA ANALYSIS

      Thomas D. WickensDepartment of Psychology, University of California, Los Angeles, California 90095; e-mail: [email protected]
      Annual Review of Psychology Vol. 49: 537 - 558
      • ...see Agresti, 1989b;, Ashby et al, 1992;, Diggle, Liang & Zeger, 1994;, Pendergast et al, 1996)....

  • 22. 
    Dobson AJ. 1990. An Introduction to Generalized Linear Models. London: Chapman & Hall
    • Crossref
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
  • 23. 
    Elston JM, Koch GG, Weissert WG. 1990. Regression-adjusted small area estimates of functional dependency in the noninstitutionalized American population age 65 and over. Am. J. Public Health 81:335–43
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
  • 24. 
    Etzioni RD, Kadane JB. 1995. Bayesian statistical methods in public health and medicine. Annu. Rev. Public Health 16:23–41
    • Link
    • Web of Science ®
    • Google Scholar
  • 25. 
    Fitzmaurice GM, Laird NM. 1993. A likelihood-based method for analysing longitudinal binary responses. Biometrika 80:141–51
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 26. 
    Fitzmaurice GM, Molenberghs G, Lipsitz SR. 1995. Regression models for longitudinal binary responses with informative drop-outs. J. R. Stat. Soc. Ser. B 57:691–704
    • Google Scholar
    Article Location
  • 27. 
    Fleiss JL. 1981. Statistical Methods for Rates and Proportions. New York: Wiley & Sons
    • Google Scholar
    Article Location
  • 28. 
    Friedman GD, Cutter GR, Donahue RP, Hughes GH, Hulley SB, et al. 1988. Cardia: study design, recruitment, and some characteristics of the examined subjects. J. Clin. Epidemiol. 41:1105–16
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • Psychosocial and Biobehavioral Factors and Their Interplay in Coronary Heart Disease

      Redford B. WilliamsBehavioral Medicine Research Center, Duke University Medical Center, Durham, North Carolina 27710; email: [email protected]
      Annual Review of Clinical Psychology Vol. 4: 349 - 365
      • ...The Coronary Artery Risk Development in Young Adults (CARDIA) study included 5115 black and white men and women aged 18 to 30 years in 1985 to 1986 (Friedman et al. 1988)....

  • 29. 
    Goorin AM, Perez-Atayde A, Gebhardt M, Andersen JW, Wilkinson RH, et al. 1987. Weekly high-dose methotrexate and doxorubicin for osteosarcoma. J. Clin. Oncol. 5:1178–84
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
  • 30. 
    Grizzle JE, Starmer CF, Koch GG. 1969. Analysis of categorical data by linear models. Biometrics 25:489–504
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • CATEGORICAL DATA ANALYSIS

      Thomas D. WickensDepartment of Psychology, University of California, Los Angeles, California 90095; e-mail: [email protected]
      Annual Review of Psychology Vol. 49: 537 - 558
      • ...The older and more established procedure fits models by least-squares and tests them with Wald tests (Grizzle, Stamer & Koch, 1969;, Koch et al 1977; proc catmod of sas)....

  • 31. 
    Heagerty SL, Zeger SL. 1996. Marginal regression models for clustered ordinal measurements, J. Am. Stat. Assoc. 91:1024–36
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 32. 
    Healy MJR. 1990. Glim: An Introduction. Oxford: Oxford Univ. Press
    • Google Scholar
    Article Location
  • 33. 
    Hirji KF. 1992. Exact distributions for polytomous data. J. Am. Stat. Assoc. 87:487–92
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 34. 
    Hirji KF, Tsiatis AA, Mehta CR. 1989. Median unbiased estimation for binary data. Am. Stat. 43:7–11
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 35. 
    Hilton JF, Mehta CR. 1993. Power and sample size calculations for exact conditional tests with ordered categorical data. Biometrics 49:609–16
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
  • 36. 
    Hosmer DW, Lemeshow S. 1989. Applied Logistic Regression. New York: Wiley & Sons
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • Risk-Adjusted Surgical Outcomes

      Jennifer Daley,1 William G. Henderson,2 and Shukri F. Khuri31Department of Medicine, Boston Veterans Administration Healthcare System, Harvard Medical School, Boston, Massachusetts e-mail: [email protected] Massachusetts General Hospital and Partners Healthcare System, Institute for Health Policy, 50 Staniford Street, Boston, Massachusetts 02114; e-mail: jdaley1 @partners.org 2Veterans Administration Cooperative Studies Program Coordinating Center, Fifth and Roosevelt Roads, Hines, Illinois 60141; e-mail: [email protected] 3Department of Surgery, Brigham and Women's Hospital Harvard Medical School, 1400 VFW Parkway, West Roxbury Massachusetts 02132; e-mail: [email protected] Boston Veterans Administration Healthcare System, 1400 VFW Parkway, West Roxbury Massachusetts 02132; e-mail: [email protected]
      Annual Review of Medicine Vol. 52: 275 - 287
      • ...The predictive validity of the models—the degree to which the logistic regression equation accurately predicts the outcome for each patient—is assessed using statistical tests of the discrimination and calibration of the models (28, 29, 30)....

  • 37. 
    Imrey PB, Koch GG, Preisser JS. 1996. The evolution of categorical data modeling: a biometric perspective. In Advances in Biometry: Fifty Years of the Biometric Society, ed. P Armitage, HA David, pp. 89–113. New York: Wiley & Sons
    • Google Scholar
    Article Location
  • 38. 
    Imrey PB, Koch GG, Stokes ME1981. Categorical data analysis: some reflections on the log linear model and logistic regression, part I. Int. Stat. Rev. 49:265–83
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 39. 
    Imrey PB, Koch GG, Stokes ME1982. Categorical data analysis: some reflections on the log linear model and logistic regression, part II. Int. Stat. Rev. 50:35–64
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
  • 40. 
    Koch GG, Amara IA, Davis GW, Gillings DB. 1982. A review of some statistical methods for covariance analysis of categorical data. Biometrics 38:563–95
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
  • 41. 
    Koch GG, Carr GJ, Amara IA, Stokes ME, Uryniak TJ. 1990. Categorical data analysis. In Statistical Methodology in the Pharmaceutical Sciences, ed. DA Berry, pp. 391–475. New York: Marcel Dekker
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
    • Article Location
  • 42. 
    Koch GG, Edwards S. 1988. Clinical efficacy trials with categorical data. In Biopharmaceutical Statistics for Drug Development, ed. KE Peace, pp. 403–51. New York: Marcel Dekker
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
    • Article Location
    • Article Location
  • 43. 
    Koch GG, Gillings DB, Stokes ME. 1980. Biostatistical implications of design, sampling and measurement to the analysis of health science data. Annu. Rev. Public Health 1:163–225
    • Link
    • Web of Science ®
    • Google Scholar
  • 44. 
    Koch GG, Landis JR, Freeman JL, Freeman DH, Lehnen RG. 1977. A general methodology for the analysis of experiments with repeated measurement of categorical data. Biometrics 33:133–58
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
    More AR articles citing this reference

    • CATEGORICAL DATA ANALYSIS

      Thomas D. WickensDepartment of Psychology, University of California, Los Angeles, California 90095; e-mail: [email protected]
      Annual Review of Psychology Vol. 49: 537 - 558
      • ...The older and more established procedure fits models by least-squares and tests them with Wald tests (Grizzle, Stamer & Koch, 1969;, Koch et al 1977; proc catmod of sas)....

  • 45. 
    Koch GG, Reinfurt DW. 1971. The analysis of categorical data from mixed models. Biometrics 27:157–73
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 46. 
    Kohlmeier L, Bellach B. 1995. Exposure assessment error and its handling in nutritional epidemiology. Annu. Rev. Public Health 16:43–59
    • Link
    • Web of Science ®
    • Google Scholar
  • 47. 
    Kuritz SJ, Landis JR, Koch GG. 1988. A general overview of Mantel-Haenszel methods: applications and recent developments. Annu. Rev. Public Health 9:123–60
    • Link
    • Web of Science ®
    • Google Scholar
  • 48. 
    Lafata JE, Koch GG, Weissert WG. 1994. Estimating activity limitation in the non institutional population: a method for small areas. Am. J. Public Health 84:1813–17
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
  • 49. 
    Lavange LM, Keyes LL, Koch GG, Margolis PA. 1994. Application of sample survey methods for modelling ratios to incidence densitites. Stat. Med. 13:343–55
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
  • 50. 
    Liang KY, McCullagh P. 1993 Case studies in binary dispersion. Biometrics 49:623–30
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • CATEGORICAL DATA ANALYSIS

      Thomas D. WickensDepartment of Psychology, University of California, Los Angeles, California 90095; e-mail: [email protected]
      Annual Review of Psychology Vol. 49: 537 - 558
      • ...which can be estimated in a variety of ways (Collett, 1991;, McCullagh & Nelder, 1989; for examples, see Liang & McCullagh, 1993)....

  • 51. 
    Liang KY, Zeger SL. 1986 Longitudinal data analysis using generalized linear models. Biometrika 73:13–22
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
    • Article Location
    • Article Location
    More AR articles citing this reference

    • Methods Based on Semiparametric Theory for Analysis in the Presence of Missing Data

      Marie DavidianDepartment of Statistics, North Carolina State University, Raleigh, North Carolina 27695, USA; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 9: 167 - 196
      • ...β) (q × 1) is an arbitrary function of X and β; Equation 3 is a generalized estimating equation (GEE) (Liang & Zeger 1986)....
      • ...leading to the well-known choice of A(X) as in Liang & Zeger (1986)....
    • Using Longitudinal Complex Survey Data

      Mary E. ThompsonDepartment of Statistics and Actuarial Science, University of Waterloo, Waterloo, Canada N2L 3G1; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 2: 305 - 320
      • ...often using a variant of the generalized estimating equation (GEE) approach described by Liang & Zeger (1986)....
    • Advances in Analysis of Longitudinal Data

      Robert D. Gibbons1,2,Donald Hedeker1,2, and Stephen DuToit1,31Center for Health Statistics, University of Illinois at Chicago, Illinois 60612; email: [email protected]2Division of Epidemiology & Biostatistics, University of Illinois at Chicago, Illinois 606123Scientific Software International, Lincolnwood, Illinois 60712
      Annual Review of Clinical Psychology Vol. 6: 79 - 107
      • ...the generalized estimating equation (GEE) models were developed (Liang & Zeger 1986...
    • Sample Size Planning for Statistical Power and Accuracy in Parameter Estimation

      Scott E. Maxwell,1 Ken Kelley,2 and Joseph R. Rausch31Department of Psychology, University of Notre Dame, Notre Dame, Indiana 46556; email: [email protected]2Inquiry Methodology Program, Indiana University, Bloomington, Indiana 47405; email: [email protected]3Department of Psychology, University of Minnesota, Minneapolis, Minnesota 55455; email: [email protected]
      Annual Review of Psychology Vol. 59: 537 - 563
      • ...Their methods extend directly to the context of generalized estimation equations (e.g., Liang & Zeger 1986), ...
    • Statistical and Substantive Inferences in Public Health: Issues in the Application of Multilevel Models

      Jeffrey B. Bingenheimer1 and Stephen W. Raudenbush2 1 School of Public Health, University of Michigan,
      Ann Arbor, Michigan 48109
      ; email: [email protected] 2 and 2School of Education, Survey Research Center, and Departments of Statistics and Sociology, University of Michigan,
      Ann Arbor, Michigan 48109
      ; email: [email protected]
      Annual Review of Public Health Vol. 25: 53 - 77
      • ...Designs of this sort can be used to estimate individual growth curves and can also incorporate time-varying covariates (79, 85)....
    • TIME-SERIES–CROSS-SECTION DATA: What Have We Learned in the Past Few Years?

      Nathaniel BeckDepartment of Political Science, University of California, San Diego, La Jolla, California 92093-0521; e-mail: [email protected]
      Annual Review of Political Science Vol. 4: 271 - 293
      • ...the currently popular “general estimating equation” approach of Liang & Zeger (1986) is known to have good properties only as N becomes large....
      • ...such as Liang & Zeger's (1986) “general estimating equation,” may possibly work with BTSCS data, ...
    • Assessing Change with Longitudinal and Clustered Binary Data

      John M NeuhausDepartment of Epidemiology and Biostatistics, University of California, San Francisco, California 94143-0560; e-mail: [email protected]
      Annual Review of Public Health Vol. 22: 115 - 128
      • ...the popularity of such population-averaged approaches as generalized estimating equations (14) and the wide availability of software to fit them has led many investigators to ignore the fact that these approaches do not assess the within-subject changes that are typically of scientific interest in longitudinal studies....
    • Selected Statistical Issues in Group Randomized Trials

      Ziding Feng,1 Paula Diehr,1,2 Arthur Peterson,1 and Dale McLerran11Cancer Prevention Research Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave.N. MP-702, Seattle, Washington 98109-1024; e-mail: [email protected] ,[email protected] ,[email protected] [email protected] ,2Department of Biostatistics, University of Washington, Seattle, Washington 98195 [email protected] ,
      Annual Review of Public Health Vol. 22: 167 - 187
      • ...There are three primary approaches used in GRT evaluations: (a) generalized linear mixed models (GLMM) (3); (b) generalized estimating equations (GEE) (30), ...
      • ...Sβ has optimality similar to quasi-score and score equations when K is large with only the minimal assumptions that the mean and covariance functions are correctly specified (30)....
      • ... or by the total number of pairs minus the number of parameters in the model (30)....

  • 52. 
    Liang KY, Zeger SL. 1993 Regression analysis for correlated data. Annu. Rev. Public Health 14:43–68
    • Link
    • Web of Science ®
    • Google Scholar
  • 53. 
    Liang KY, Zeger SL, Qaqish BF. 1992. Multivariate regression models for categorical data. J. R. Statist. Soc. Ser. B 54:3–40
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
  • 54. 
    Lipsitz SR, Laird NM, Harrington DP. 1991. Generalized estimating equations for correlated binary data: using the odds ratio as a measure of association. Biometrika 78:153–60
    • Crossref
    • Web of Science ®
    • Google Scholar
    Article Location
  • 55. 
    Lipsitz SR, Kim K, Zhao L. 1994. Analysis of repeated categorical data using generalized estimating equations. Stat. Med. 13:1149–63
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
  • 55A. 
    Liu KL, Ruth KJ, Flack JM, Jones-Webb R, Burke G, et al. 1996. Blood pressure in young blacks and whites: relevance of obesity and lifestyle factors in determining differences. Circulation 93:60–66
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • RACE AND ETHNICITY IN PUBLIC HEALTH RESEARCH: Models to Explain Health Disparities

      William W. Dressler,1 Kathryn S. Oths,1 and Clarence C. Gravlee21Department of Anthropology, The University of Alabama, Tuscaloosa, Alabama 35487; email: [email protected], [email protected]2Department of Anthropology, Florida State University, Tallahassee, Florida 32306; email: [email protected]
      Annual Review of Anthropology Vol. 34: 231 - 252
      • ...Liu et al. (1996) reported that black-white mean blood pressure differences at a 7-year follow-up could be reduced by 40%–50% through the inclusion of body mass index, ...
      • ...such as those behaviors reviewed in the previous section (Liu et al. 1996, Stamler et al. 2003)....

  • 56. 
    LogXact. 1992. Software for Exact Logistic Regression. Cambridge, MA: Cytel Software
    • Google Scholar
    Article Location
  • 57. 
    Mack TM, Pike MC, Henderson BE, Pfeffer RI, Gerkins VR, et al. 1976. Estrogens and endometrial cancer in a retirement community. N. Engl. J. Med. 294:1262–67
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
  • 58. 
    Main DS, Cohen SJ, DiClemente CC. 1995. Measuring physician readiness to change in cancer screening: preliminary results. Am. J. Prev. Med. 11:54–58
    • Crossref
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
  • 59. 
    Mantel N. 1966. Evaluation of survival data and two new rank order statistics arising in its consideration. Cancer Chemo. Rep. 50:163–70
    • Medline
    • Google Scholar
    Article Location
  • 60. 
    Mantel N, Haenszel W. 1959. Statistical aspects of the analysis of data from retrospective studies of disease. J. Natl. Cancer Inst. 22:719–48
    • Medline
    • Web of Science ®
    • Google Scholar
    Article Location
    More AR articles citing this reference

    • Statistical Applications in Educational Measurement

      Hua-Hua Chang,1 Chun Wang,2 and Susu Zhang31College of Education, Purdue University, West Lafayette, Indiana 47907, USA; email: [email protected]2College of Education, University of Washington, Seattle, Washington 98195, USA; email: [email protected]3Department of Psychology, University of Illinois at Urbana-Champaign, Champaign, Illinois 61820, USA; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 8: 439 - 461
      • ...The first widely used DIF detection method is the Mantel–Haenszel (MH) method (Mantel & Haenszel 1959), ...
    • Evaluation of Causal Effects and Local Structure Learning of Causal Networks

      Zhi Geng,1 Yue Liu,1 Chunchen Liu,2 and Wang Miao3 1School of Mathematical Sciences, Peking University, Beijing 100871, China; email: [email protected], [email protected]2Department of Data Mining, NEC Laboratories China, Beijing 100600, China; email: [email protected]3Guanghua School of Management, Peking University, Beijing 100871, China; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 6: 103 - 124
      • ...There is a debate about whether or not adjustment for a nonconfounder can improve the efficiency of causal inference. Mantel & Haenszel (1959), Gail (1986), Mantel (1989), Wickramaratne & Holford (1989), ...
    • Has Epidemiology Become Infatuated With Methods? A Historical Perspective on the Place of Methods During the Classical (1945–1965) Phase of Epidemiology

      Alfredo Morabia1,21Barry Commoner Center for Health and the Environment, Queens College, City University of New York, New York, NY 11367; email: [email protected]2Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY 10032
      Annual Review of Public Health Vol. 36: 69 - 88
      • ...Mantel & Haenszel's 30-page review of the role and limitations of “retrospective studies” and their relationship to “forward-type” studies reads like a methods paper on how to conduct case-control studies (73)....
      • ...which comprised a discussion of types of controls and dual controls (73)....
      • ...Mantel & Haenszel published their successful estimate of the adjusted odds ratio (73)....
      • ...and how one could use the specificity of the association to check for interviewer bias (73)....
      • ...in 1959 Mantel & Haenszel discussed the importance of consistency of the association (73)....

  • 61. 
    McCullagh P. 1980. Regression models for ordinal data (with discussion). J. R. Stat. Soc. Ser. B 42:109–42
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
  • 62. 
    McCullagh P, Nelder JA. 1989. Generalized Linear Models. London: Chapman & Hall. 2nd ed.
    • Crossref
    • Google Scholar
    Article Locations:
    • Article Location
    • Article Location
    • Article Location
    More AR articles citing this reference

    • Genetic Influences on Disease Subtypes

      Andy Dahl1,2,3 and Noah Zaitlen2,31Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, Illinois 60637, USA; email: [email protected]2Department of Neurology, University of California, Los Angeles, California 90024, USA; email: [email protected]3Department of Computational Medicine, University of California, Los Angeles, California 90095, USA
      Annual Review of Genomics and Human Genetics Vol. 21: 413 - 435
      • ...where samples receive the disease if the liability exceeds some threshold (29, 36, 86)....
      • ...MLR is a classical GLM (86) that has previously been proposed for testing genetic heterogeneity in disease subtypes (18, 91)...
    • Statistical Foundations for Model-Based Adjustments

      Sander Greenland1 and Neil Pearce2,31Department of Epidemiology and Department of Statistics, University of California, Los Angeles, California 90095-1772; email: [email protected]2Departments of Medical Statistics and Non-Communicable Disease Epidemiology, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London WC1E 7HT, United Kingdom; email: [email protected]shtm.ac.uk3Centre for Public Health Research, Massey University, Wellington 6140, New Zealand
      Annual Review of Public Health Vol. 36: 89 - 108
      • ...Recentering and rescaling are especially important for coefficient interpretability when examining effects of exposure combinations (interactions) (61, 100)...
      • ...models containing a product term should also contain the factors in the product as main effects (the hierarchy principle) (61, 100)....
    • Behavioral Assays for Studies of Host Plant Choice and Adaptation in Herbivorous Insects

      Lisa M. Knolhoff1 and David G. Heckel2,1Genective, c/o AgReliant Genetics, Champaign, Illinois 61801; email: [email protected]2Max Planck Institute for Chemical Ecology, Jena 07745, Germany; email: [email protected]
      Annual Review of Entomology Vol. 59: 263 - 278
      • ...The generalized linear model (55) is well-suited to count data and obviates the need for less satisfactory approaches....
    • High-Dimensional Statistics with a View Toward Applications in Biology

      Peter Bühlmann, Markus Kalisch, and Lukas MeierSeminar for Statistics, ETH Zürich, CH-8092 Zürich, Switzerland; email: [email protected], [email protected], [email protected]
      Annual Review of Statistics and Its Application Vol. 1: 255 - 278
      • ...Generalized linear models (McCullagh & Nelder 1989) are very popular for extending the linear model in a unified way....
      • ...if the (conditional) distribution of Y|X = x is from a subclass of the exponential family model (see McCullagh & Nelder 1989, ...
      • ...replace the t-test in step 3 of Algorithm 1 with the log-likelihood ratio test (McCullagh & Nelder 1989), ...
    • Event History Analysis

      Niels KeidingDepartment of Biostatistics, University of Copenhagen, Copenhagen DK-1014, Denmark; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 1: 333 - 360
      • ...the statistical model is essentially a generalized linear model in the sense of McCullagh & Nelder (1989), ...
      • ...One is based on a postulation of an overdispersion parameter to capture at least some of the unaccounted excess variation (e.g., Breslow 1984, 1990; McCullagh & Nelder 1989, ...
    • Probabilistic Forecasting

      Tilmann Gneiting1 and Matthias Katzfuss21Institut für Angewandte Mathematik, Universität Heidelberg, 69120 Heidelberg, Germany; email: [email protected]2Department of Statistics, Texas A&M University, College Station, TX 77843; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 1: 125 - 151
      • ...generalized linear models (McCullagh & Nelder 1989) relate the mean of a certain specified distribution to a linear function in the explanatory variables via a suitably chosen link function....
    • Build, Compute, Critique, Repeat: Data Analysis with Latent Variable Models

      David M. BleiComputer Science Department, Princeton University, Princeton, New Jersey 08540; email: [email protected]
      Annual Review of Statistics and Its Application Vol. 1: 203 - 232
      • ...such as generalized linear models (Nelder & Wedderburn 1972, McCullagh & Nelder 1989), ...
    • Using Geographic Information Systems and Decision Support Systems for the Prediction, Prevention, and Control of Vector-Borne Diseases

      Lars Eisen1 and Rebecca J. Eisen21Department of Microbiology, Immunology, and Pathology, Colorado State University, Fort Collins, Colorado 80523; email: [email protected]2Division of Vector-Borne Infectious Diseases, Coordinating Center for Infectious Diseases, Centers for Disease Control and Prevention, Fort Collins, Colorado 80522; email: [email protected]
      Annual Review of Entomology Vol. 56: 41 - 61
      • ...and the statistical methods have been described in detail elsewhere (34, 35, 44, 52, 58, 73, 82, 84, 89, 95, 115)....
    • Nonparametric Methods for Modeling Nonlinearity in Regression Analysis

      Robert AndersenDepartment of Sociology, University of Toronto, Toronto, Ontario M5S 2J4, Canada; email: [email protected]
      Annual Review of Sociology Vol. 35: 67 - 85
      • ...I start with a brief description of the GLM (for a more extensive treatment, see McCullagh & Nelder 1989)....
      • ...Maximum likelihood estimates for GLMs can be obtained using the Newton-Raphson method or iteratively reweighted least squares (IRLS) (Nelder & Wedderburn 1972, McCullagh & Nelder 1989)....
    • Forecasting Methods in Crime and Justice

      Richard BerkDepartment of Statistics, Department of Criminology, University of Pennsylvania, Philadelphia, Pennsylvania 19104; email: [email protected]
      Annual Review of Law and Social Science Vol. 4: 219 - 238
      • ...The classic work on the generalized linear model is by McCullagh & Nelder (1989)....
    • Methods for Improving Regression Analysis for Skewed Continuous or Counted Responses

      Abdelmonem A. Afifi,1 Jenny B. Kotlerman,2 Susan L. Ettner,1,3 and Marie Cowan21School of Public Health, 2School of Nursing, 3School of Medicine, University of California, Los Angeles, California 90095-1772; email: [email protected], [email protected], [email protected], [email protected]
      Annual Review of Public Health Vol. 28: 95 - 111
      • ...Such programs rely on the concept of generalized linear models (GLM) [Dobson (16), McCullagh & Nelder (43)]....
    • On Time Series Analysis of Public Health and Biomedical Data

      Scott L. Zeger, Rafael Irizarry, and Roger D. PengDepartment of Biostatistics, The Johns Hopkins University Bloomberg School of Public Health, Baltimore, Maryland 21205; email: [email protected], [email protected], [email protected]
      Annual Review of Public Health Vol. 27: 57 - 79
      • ...generalized linear models (GLMs; 31) have unified regression analysis for binary, ...
      • ...We have already mentioned the generalized linear model (31) extensions of autoregressive models for binary, ...
    • Time-Series Studies of Particulate Matter

      Michelle L. Bell,1 Jonathan M. Samet,1 and Francesca Dominici2 1 Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health,
      Baltimore, Maryland 21205
      ; email: [email protected], [email protected] 2 and 2Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health,
      Baltimore, Maryland 21205
      ; email: [email protected]
      Annual Review of Public Health Vol. 25: 247 - 280
      • ...Currently used statistical approaches for time-series analysis are generalized linear models (GLM) with parametric splines (101)...
    • Assessing Change with Longitudinal and Clustered Binary Data

      John M NeuhausDepartment of Epidemiology and Biostatistics, University of California, San Francisco, California 94143-0560; e-mail: [email protected]
      Annual Review of Public Health Vol. 22: 115 - 128
      • ...Many of these approaches are extensions of generalized linear models (17) to accommodate intracluster correlation....
    • Selected Statistical Issues in Group Randomized Trials

      Ziding Feng,1 Paula Diehr,1,2 Arthur Peterson,1 and Dale McLerran11Cancer Prevention Research Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave.N. MP-702, Seattle, Washington 98109-1024; e-mail: [email protected] ,[email protected] ,[email protected] [email protected] ,2Department of Biostatistics, University of Washington, Seattle, Washington 98195 [email protected] ,
      Annual Review of Public Health Vol. 22: 167 - 187
      • Comparing Personal Trajectories and Drawing Causal Inferences from Longitudinal Data

        Stephen W. RaudenbushSchool of Education and Institute for Social Research, University of Michigan, 610 East University Avenue, Ann Arbor, Michigan 48109; e-mail: [email protected]
        Annual Review of Psychology Vol. 52: 501 - 525
        • ...The generalized linear model of McCullagh & Nelder (1989) provides a far more general class of level-1 models....
        • ...In the language of the generalized linear model (McCullagh & Nelder 1989), ...
      • CATEGORICAL DATA ANALYSIS

        Thomas D. WickensDepartment of Psychology, University of California, Los Angeles, California 90095; e-mail: [email protected]
        Annual Review of Psychology Vol. 49: 537 - 558
        • ...This approach is most strongly represented by the generalized linear models (McCullagh & Nelder, 1989)....
        • ...which can be estimated in a variety of ways (Collett, 1991;, McCullagh & Nelder, 1989; for examples, ...

    • 63. 
      McCulloch CE. 1994. Maximum likelihood variance components estimation for binary data. J. Am. Stat. Assoc. 89:330–35
      • Crossref
      • Web of Science ®
      • Google Scholar
      Article Location
      More AR articles citing this reference

      • Selected Statistical Issues in Group Randomized Trials

        Ziding Feng,1 Paula Diehr,1,2 Arthur Peterson,1 and Dale McLerran11Cancer Prevention Research Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave.N. MP-702, Seattle, Washington 98109-1024; e-mail: [email protected] ,[email protected] ,[email protected] [email protected] ,2Department of Biostatistics, University of Washington, Seattle, Washington 98195 [email protected] ,
        Annual Review of Public Health Vol. 22: 167 - 187
        • ...and such new procedures for analyzing correlated data as Markov Chain Monte Carlo (MCMC) (33, 34)....

    • 64. 
      McNemar Q. 1947. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12:153–57
      • Crossref
      • Medline
      • Web of Science ®
      • Google Scholar
      Article Location
    • 65. 
      Mehta CR, Patel NR. 1995. Exact logistic regression: theory and examples. Stat. Med. 14:2143–60
      • Crossref
      • Medline
      • Web of Science ®
      • Google Scholar
      Article Locations:
      • Article Location
      • Article Location
      • Article Location
      • Article Location
      • Article Location
    • 66. 
      Miller ME, Davis CS, Landis JR. 1993. The analysis of longitudinal polytomous data: generalized estimating equations and connections with weighted least squares. Biometrics 49:1033–44
      • Crossref
      • Medline
      • Web of Science ®
      • Google Scholar
      Article Location
    • 67. 
      Miller ME, James MK, Langefeld CD, Espeland MA, Freedman JA. 1996. Some techniques for the analysis of work sampling data. Stat. Med. 15:607–18
      • Crossref
      • Medline
      • Web of Science ®
      • Google Scholar
      Article Location
    • 68. 
      Nelder JA, Wedderburn RWM. 1972. Generalized linear models. J. R. Stat. Soc. Ser. A 135:370–84
      • Crossref
      • Google Scholar
      Article Locations:
      • Article Location
      • Article Location
      More AR articles citing this reference

      • RNA Sequencing Data: Hitchhiker's Guide to Expression Analysis

        Koen Van den Berge,1, Katharina M. Hembach,2, Charlotte Soneson,2,3, Simone Tiberi,2, Lieven Clement,1, Michael I. Love,4, Rob Patro,5, and Mark D. Robinson2,1Bioinformatics Institute Ghent and Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000 Ghent, Belgium2Institute of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland; email: [email protected]3Current Affiliation: Friedrich Miescher Institute for Biomedical Research and SIB Swiss Institute of Bioinformatics, 4058 Basel, Switzerland4Department of Biostatistics and Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27514, USA5Department of Computer Science, Stony Brook University, Stony Brook, New York 11794, USA
        Annual Review of Biomedical Data Science Vol. 2: 139 - 173
        • ...it is known that LRTs have better properties (e.g., invariance to transformation) than Wald tests in GLMs (143); however, ...
      • One Hundred Years of Statistical Developments in Animal Breeding

        Daniel Gianola1–5 and Guilherme J.M. Rosa1,21Department of Animal Sciences,2Department of Biostatistics and Medical Informatics, and3Department of Dairy Science, University of Wisconsin-Madison, Madison, Wisconsin 53706; email: [email protected], [email protected]4Institute of Advanced Studies, Technical University of Munich, 85748 Garching, Germany5Chair of Plant Breeding at TUM-Weihenstephan, 85354 Freising, Germany
        Annual Review of Animal Biosciences Vol. 3: 19 - 56
        • ...Although ML estimation for parameter estimation already had been embedded in the framework of generalized linear models (154), ...
      • Build, Compute, Critique, Repeat: Data Analysis with Latent Variable Models

        David M. BleiComputer Science Department, Princeton University, Princeton, New Jersey 08540; email: [email protected]
        Annual Review of Statistics and Its Application Vol. 1: 203 - 232
        • ...such as generalized linear models (Nelder & Wedderburn 1972, McCullagh & Nelder 1989), ...
      • Nonparametric Methods for Modeling Nonlinearity in Regression Analysis

        Robert AndersenDepartment of Sociology, University of Toronto, Toronto, Ontario M5S 2J4, Canada; email: [email protected]
        Annual Review of Sociology Vol. 35: 67 - 85
        • ...Maximum likelihood estimates for GLMs can be obtained using the Newton-Raphson method or iteratively reweighted least squares (IRLS) (Nelder & Wedderburn 1972, McCullagh & Nelder 1989)....
      • Structural Equation Modeling: Strengths, Limitations, and Misconceptions

        Andrew J. Tomarken1 and Niels G. Waller2Department of 1Psychology and 2Psychology and Human Development, Vanderbilt University, Nashville, Tennessee 37203; email: [email protected], [email protected]
        Annual Review of Clinical Psychology Vol. 1: 31 - 65
        • ...which represent a liberalization of the classical ordinary least-squares linear model to allow for nonlinear functional forms and nonnormal response distributions (e.g., Hardin & Hilbe 2001, Nelder & Wedderburn 1972)....

    • 69. 
      Neuhaus JA, Kalbfleisch JD, Hauck WW. 1991. A comparison of cluster-specific and population-averaged approaches for analyzing correlated binary data. Int. Stat. Rev. 59:25–35
      • Crossref
      • Web of Science ®
      • Google Scholar
      Article Location
      More AR articles citing this reference

      • Interpreting and Understanding Logits, Probits, and Other Nonlinear Probability Models

        Richard Breen,1 Kristian Bernt Karlson,2 and Anders Holm31Nuffield College and Department of Sociology, University of Oxford, OX1 1NF, Oxford, United Kingdom; email: [email protected]2Department of Sociology, University of Copenhagen, DK-1353 Copenhagen, Denmark3Department of Sociology, University of Western Ontario, London, Ontario N6A 5C2, Canada
        Annual Review of Sociology Vol. 44: 39 - 54
        • ...; Yatchew & Griliches 1985; Gail 1986; Hauck et al. 1991; Neuhaus et al. 1991...
        • ...that is, random effects or multilevel models (Neuhaus et al. 1991...
        • ...Derivations based on less restrictive assumption yield similar results (see Neuhaus et al. 1991)....
      • Advances in Analysis of Longitudinal Data

        Robert D. Gibbons1,2,Donald Hedeker1,2, and Stephen DuToit1,31Center for Health Statistics, University of Illinois at Chicago, Illinois 60612; email: [email protected]2Division of Epidemiology & Biostatistics, University of Illinois at Chicago, Illinois 606123Scientific Software International, Lincolnwood, Illinois 60712
        Annual Review of Clinical Psychology Vol. 6: 79 - 107
        • ...like the fixed-effects and GEE models (Neuhaus 1991, Zeger et al. 1988)....
      • Confounding in Health Research

        Sander Greenland1,2 and Hal Morgenstern11Department of Epidemiology, Los Angeles School of Public Health, University of California, Los Angeles, California 90095-1772; e-mail: [email protected] 2Department of Statistics, Los Angeles College of Letters and Science, University of California, Los Angeles, California 90095
        Annual Review of Public Health Vol. 22: 189 - 212
        • Selected Statistical Issues in Group Randomized Trials

          Ziding Feng,1 Paula Diehr,1,2 Arthur Peterson,1 and Dale McLerran11Cancer Prevention Research Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave.N. MP-702, Seattle, Washington 98109-1024; e-mail: [email protected] ,[email protected] ,[email protected] [email protected] ,2Department of Biostatistics, University of Washington, Seattle, Washington 98195 [email protected] ,
          Annual Review of Public Health Vol. 22: 167 - 187
          • ...The marginal model is the direct analog of linear regression and generalized linear models (GLIM) (38), ...
          • ...it turns out that the parameter β in a marginal model is smaller than β* in a conditional model (38) (, ...

      • 70. 
        O'Shea TM, Klinepeter KL, de Regnier RA, Dillard RG, Goldstein D, et al. 1993. Decreased risk of cerebral palsy (CP) with increasing use of surfactant therapy. Pediatric Res. (Part 2) 269A (Abstr.)
        • Google Scholar
        Article Location
      • 71. 
        Payne CD. 1986. The GLIM Manual. Release 3.77. Oxford:NAG
        • Google Scholar
        Article Location
      • 72. 
        Pendergast JF, Gange SJ, Newton MA, Lindstrom MJ, Palta M, et al. 1996. A survey of methods for analyzing clustered binary response data. Int. Stat. Rev. 64:89–118
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location
        More AR articles citing this reference

        • Assessing Change with Longitudinal and Clustered Binary Data

          John M NeuhausDepartment of Epidemiology and Biostatistics, University of California, San Francisco, California 94143-0560; e-mail: [email protected]
          Annual Review of Public Health Vol. 22: 115 - 128
          • ... and Pendergast et al (25) for reviews on methods for clustered binary data....
        • CATEGORICAL DATA ANALYSIS

          Thomas D. WickensDepartment of Psychology, University of California, Los Angeles, California 90095; e-mail: [email protected]
          Annual Review of Psychology Vol. 49: 537 - 558
          • ...see Agresti, 1989b;, Ashby et al, 1992;, Diggle, Liang & Zeger, 1994;, Pendergast et al, 1996)....

      • 73. 
        Prentice RL, Farewell VT. 1986. Relative risk and odds ratio regression. Annu. Rev. Public Health 7:35–58
        • Link
        • Web of Science ®
        • Google Scholar
      • 74. 
        Peterson B, Harrell FE. 1990. Partial proportional odds models for ordinal response variables. Appl. Stat. 39:205–17
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location
      • 75. 
        Preisser JS, Koch GG, Shockley WW. 1993. Statistical considerations for cross-sectional data relating to tracheal reconstruction over time. J. Biopharm. Stat. 3:167–83
        • Crossref
        • Medline
        • Google Scholar
        Article Location
      • 76. 
        Preisser JS, Qaqish BF. 1996. Deletion diagnostics for generalised estimating equations. Biometrika 83:551–62
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location
      • 77. 
        Quade D. 1967. Rank analysis of covariance. J. Am. Stat. Assoc. 62:1187–200
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location
      • 78. 
        Robins JM, Rotnitzky A, Zhao LP. 1995. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. J. Am. Stat. Assoc. 90:106–21
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location
        More AR articles citing this reference

        • Methods Based on Semiparametric Theory for Analysis in the Presence of Missing Data

          Marie DavidianDepartment of Statistics, North Carolina State University, Raleigh, North Carolina 27695, USA; email: [email protected]
          Annual Review of Statistics and Its Application Vol. 9: 167 - 196
          • ...and colleagues, including Robins & Rotnitzky (1992, 1995) and Robins et al. (1995)...

      • 79. 
        Rosner B. 1984. Multivariate methods in ophthalmology with applications to other paried-data situations. Biometrics 40:1025–35
        • Crossref
        • Medline
        • Web of Science ®
        • Google Scholar
        Article Location
      • 80. 
        Satten G, Kupper L. 1993. Inferences about exposure-disease associations using probability-of-exposure information. J. Am. Stat. Assoc. 88:200–8
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location
      • 81. 
        Shah BV, Barnwell BG, Hunt PN, Lavange LM. 1992. SUDAAN User's Manual Release 6.0. Research Triangle Park, NC
        • Google Scholar
        Article Locations:
        • Article Location
        • Article Location
      • 82. 
        Sloane PD, Dallara J, Roach C, Bailey KE, Mitchell M, et al. 1994. Management of dizziness in primary care. J. Am. Board Fam. Pract. 7:1–8
        • Medline
        • Google Scholar
        Article Location
      • 83. 
        StatXact. 1996. StatXact: Statistical Software for Exact Nonparametric Inference, Version 3. Cambridge, MA: Cytel Software
        • Google Scholar
        Article Locations:
        • Article Location
        • Article Location
        • Article Location
        • Article Location
        • Article Location
      • 84. 
        Stokes ME, Davis CS, Koch GG. 1995. Categorical Data Analysis Using the SAS System. Cary, NC: SAS Institute
        • Google Scholar
        Article Locations:
        • Article Location
        • Article Location
        • Article Location
        • Article Location
        • Article Location
      • 85. 
        Tarone RE. 1985. On heterogeneity tests based on efficient scores. Biometrika 72:91–95
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location
      • 86. 
        Ten Have TR. 1996. A mixed effects model for multivariate ordinal response data including correlated discrete failure times with ordinal responses. Biometrics 52:473–91
        • Crossref
        • Medline
        • Web of Science ®
        • Google Scholar
        Article Location
      • 87. 
        Thomas D, Stram D, Dwyer J. 1993. Exposure measurement error influence on exposure-disease relationships and methods for correction. Annu. Rev. Public Health 14:69–93
        • Link
        • Web of Science ®
        • Google Scholar
      • 88. 
        Wedderburn RWM. 1974. Quasi-likelihood functions, generalized linear models and the Gauss-Newton method. Biometrika 61:439–47
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location
        More AR articles citing this reference

        • A Review of Empirical Likelihood

          Nicole A. LazarDepartment of Statistics, Pennsylvania State University, University Park, Pennsylvania 16802, USA; email: [email protected]
          Annual Review of Statistics and Its Application Vol. 8: 329 - 344
          • ...Examples include Cox's partial likelihood (Cox 1972), quasi likelihood (Wedderburn 1974), and pseudo likelihood (Besag 1975)...

      • 89. 
        Whitehead J. 1993. Sample size calculations for ordered categorical data. Stat. Med. 12:2257–71
        • Crossref
        • Medline
        • Web of Science ®
        • Google Scholar
        Article Location
      • 90. 
        Zeger SL, Liang KY. 1992. An overview of methods for the analysis of longitudinal data. Stat. Med. 11:1825–39
        • Crossref
        • Medline
        • Web of Science ®
        • Google Scholar
        Article Locations:
        • Article Location
        • Article Location
      • 91. 
        Zeger SL, Qaqish BF. 1988. Markov regression models for time-series: a quasi-likelihood approach. Biometrics 44:1019–31
        • Crossref
        • Medline
        • Web of Science ®
        • Google Scholar
        Article Location
      • 92. 
        Zeger SL, Liang KY. 1986. Longitudinal data analysis for discrete and continuous outcomes. Biometrics 42:121–30
        • Crossref
        • Medline
        • Web of Science ®
        • Google Scholar
        Article Location
        More AR articles citing this reference

        • Advances in Analysis of Longitudinal Data

          Robert D. Gibbons1,2,Donald Hedeker1,2, and Stephen DuToit1,31Center for Health Statistics, University of Illinois at Chicago, Illinois 60612; email: [email protected]2Division of Epidemiology & Biostatistics, University of Illinois at Chicago, Illinois 606123Scientific Software International, Lincolnwood, Illinois 60712
          Annual Review of Clinical Psychology Vol. 6: 79 - 107
          • ...the most widely used include mixed-effects regression models (Laird & Ware 1982) and generalized estimating equation (GEE) models (Zeger & Liang 1986)....
          • ...the generalized estimating equation (GEE) models were developed (Liang & Zeger 1986 and Zeger & Liang 1986)....

      • 93. 
        Zelen M. 1971. The analysis of several 2 2 contingency tables. Biometrika 58:129–37
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location
      • 94. 
        Zhao LP, Prentice RL. 1990. Correlated binary regression using a quadratic exponential model. Biometrika 77:642–48
        • Crossref
        • Web of Science ®
        • Google Scholar
        Article Location

      More AR articles citing this reference

      Equation(s):

      1

      Equation(s):

      2

      Equation(s):

      3

      Equation(s):

      4

      Equation(s):

      Equation(s):

      5

      Equation(s):

      6

      Equation(s):

      7

      Equation(s):

      8

      Equation(s):

      9

      Equation(s):

      10

      Equation(s):

      11

      Equation(s):

      12
      • Tables
      • TABLE 1  -Statistical methods for categorical data analysis in public health
      • TABLE 2  -Observed contingency table for stratum h
      • TABLE 3  -Osteogenic sarcoma data*
      • TABLE 4  -P-values and 95% confidence intervals for osteogenic sarcoma data
      • TABLE 5  -Outcomes in very low birthweight infants
      • TABLE 6  -Number of infants (out of those who survived) with cerebral palsy
      • TABLE 7  -Results for relationship of pre/post 1990 with cerebral palsy
      • TABLE 8  -Results of proportional odds model for outcome in infants
      • TABLE 9  -Schizophrenia and birth complications data*
      • TABLE 10  -Exact logistic regression results for osteogenic sarcoma data
      • TABLE 11  -GEE results for cancer screening data
      • Tables
      • Tables

      TABLE 1  Statistical methods for categorical data analysis in public health

      Data structureSmall samplesLarge samples
      Contingency tablesFisher's exact test andChi square and
        extensions Mantel-Haenszel methods
      Case record andExact logistic regressionLogistic regression,
       simple random  proportional odds models and
       samples  extensions of logistic regression
      ClusterExact logistic regressionGeneralized estimating equations
       samples  for stratified models  and weighted least squares

      TABLE 2  Observed contingency table for stratum h

       Response levels  
      Factor levels12 rTotal
      1nh11nh12…nh1rNh1.
      2nh21nh22…nh2rNh2.
      .……………
      snhs1nhs2…nhsrNhs.
      TotalNh.1Nh.2 Nh.rNh

      TABLE 3  Osteogenic sarcoma data*

         DIS
      AOPLIMALENY
      NNN30
      NNY40
      NYN50
      NYY54
      YNN20
      YNY10
      YYN32
      YYY611

      *Reproduced from Mehta & Patel (65).

      TABLE 4  P-values and 95% confidence intervals for osteogenic sarcoma data

       p-Value95% CI for odds ratio
      PredictorExactAsymptoticExactAsymptotic
      AOP.032.022(1.19, 19.33)(1.20, 17.63)
      LI.0075.0042(2.99, ∞)Undefined
      MALE.026.021(1.24, 44.02)(1.17, 31.64)

      TABLE 5  Outcomes in very low birthweight infants

       MalesFemales
      YearHCDHCD
      1988–8917512926
      1989–90154102018
      1990–91161151907
      1991–9214252701

      TABLE 6  Number of infants (out of those who survived) with cerebral palsy

       Year
       1988–891989–901990–911991–92
      Males5/224/191/172/16
      Females2/111/210/190/27

      TABLE 7  Results for relationship of pre/post 1990 with cerebral palsy

       Gender adjustedUnadjusted
       ExactAsymptoticExactAsymptotic
      Test of independence.026.022.013.009
      Odds ratio estimate4.264.184.944.98
      95% Confidence Interval(1.20, 19.83)(1.12, 15.61)(1.26, 28.46)(1.41, 22.7)

      TABLE 8  Results of proportional odds model for outcome in infants

       Parameter estimateStandard errorOdds ratio
      Intercept 12.090.49—
      Intercept 22.560.50—
      Pre900.950.242.57
      Male0.950.242.57
      BW (100 g)−0.460.0550.63

      TABLE 9  Schizophrenia and birth complications data*

      FamilyBCNumber of siblingsFamilyBCNumber of siblings
      IDindexSHNMIDindexSHNM
      115103101
      17014210
      16014004
      15015601
      13025310
      12035010
      10016301
      22106012
      20017610
      39107201
      3201 

      *Data reproduced from Mehta & Patel (65).

      TABLE 10  Exact logistic regression results for osteogenic sarcoma data

      Parameterp-valueOdds ratio95% CI
      AOP.153.18(0.60, 20)
      LI.0616.59(0.85, ∞)
      MALE.124.70(0.70, 56)

      TABLE 11  GEE results for cancer screening data

       (a) Mammography(b) Ordinal outcome
        StandardOdds StandardOdds
      ParameterEstimateerrorratioEstimateerrorratio
      Intercept 1−1.400.22—−2.570.21—
      Intercept 2———−1.060.20—
      Hmv2.280.149.803.170.1523.8
      Internal med.0.150.171.160.230.161.26
      Female physician0.360.181.430.520.161.68
      Patient nonsmoker0.450.151.560.320.121.38
      Patient (60, 69) yrs−0.040.120.970.030.121.03
      Patient 70 + yrs−0.190.170.82−0.120.150.89
      Previous Article Next Article
      • Related Articles
      • Literature Cited
      • Most Downloaded
      Most Downloaded from this journal

      The Growing Impact of Globalization for Health and Public Health Practice

      Ronald Labonté, Katia Mohindra, and Ted Schrecker
      Vol. 32, 2011

      Abstract - FiguresPreview

      Abstract

      In recent decades, public health policy and practice have been increasingly challenged by globalization, even as global financing for health has increased dramatically. This article discusses globalization and its health challenges from a vantage of ...Read More

      • Full Text HTML
      • Download PDF
      • Figures
      image

      Figure 1: Global poverty: World Bank $1.25/day poverty line. Source: Data from Reference 24. Note that East Asia and Pacific includes China; South Asia includes India.

      image

      Figure 2: Global poverty: World Bank $2.50/day poverty line. Source: Data from Reference 24. Note that East Asia and Pacific includes China; South Asia includes India.

      image

      Figure 3: Quadruple burden of disease in South Africa: percentage of overall years of life lost, 2000. Source: (16). “Pre-transitional causes” of death include communicable diseases, maternal and peri...


      Racism and Health: Evidence and Needed Research

      David R. Williams, Jourdyn A. Lawrence, Brigette A. Davis
      Vol. 40, 2019

      AbstractPreview

      Abstract

      In recent decades, there has been remarkable growth in scientific research examining the multiple ways in which racism can adversely affect health. This interest has been driven in part by the striking persistence of racial/ethnic inequities in health and ...Read More

      • Full Text HTML
      • Download PDF

      Designing Difference in Difference Studies: Best Practices for Public Health Policy Research

      Coady Wing, Kosali Simon, Ricardo A. Bello-Gomez
      Vol. 39, 2018

      AbstractPreview

      Abstract

      The difference in difference (DID) design is a quasi-experimental research design that researchers often use to study causal relationships in public health settings where randomized controlled trials (RCTs) are infeasible or unethical. However, causal ...Read More

      • Full Text HTML
      • Download PDF

      Public Health and Online Misinformation: Challenges and Recommendations

      Briony Swire-Thompson and David Lazer
      Vol. 41, 2020

      Abstract - FiguresPreview

      Abstract

      The internet has become a popular resource to learn about health and to investigate one's own health condition. However, given the large amount of inaccurate information online, people can easily become misinformed. Individuals have always obtained ...Read More

      • Full Text HTML
      • Download PDF
      • Figures
      image

      Figure 1: User ratings of apricot kernels receive a 4.60 out of 5 efficacy score for cancer on WebMD (130).

      image

      Figure 2: Survival of patients with colorectal cancers receiving alternative medicine (blue solid line) versus conventional cancer treatment (orange dashed line). Figure adapted with permission from J...

      image

      Figure 3: Percentage of US adults who say they have a great deal of confidence in the people in the scientific community, medicine, and the press between 1972 and 2018. Figure adapted with permission ...


      The Role of Media Violence in Violent Behavior

      L. Rowell Huesmann and Laramie D. Taylor
      Vol. 27, 2006

      Abstract - FiguresPreview

      Abstract

      ▪ Abstract Media violence poses a threat to public health inasmuch as it leads to an increase in real-world violence and aggression. Research shows that fictional television and film violence contribute to both a short-term and a long-term increase in ...Read More

      • Full Text HTML
      • Download PDF
      • Figures
      image

      Figure 1 : The relative strength of known public health threats.


      See More
      • © Copyright 2022
      • Contact Us
      • Email Preferences
      • Annual Reviews Directory
      • Multimedia
      • Supplemental Materials
      • FAQs
      • Privacy Policy
      Back to Top