High-Dimensional Statistics with a View Toward Applications in Biology

Annual Review of Statistics and Its Application

Vol. 1:255-278 (Volume publication date January 2014)
https://doi.org/10.1146/annurev-statistics-022513-115545

Abstract

We review statistical methods for high-dimensional data analysis and pay particular attention to recent developments for assessing uncertainties in terms of controlling false positive statements (type I error) and p-values. The main focus is on regression models, but we also discuss graphical modeling and causal inference based on observational data. We illustrate the concepts and methods with various packages from the statistical software using a high-throughput genomic data set about riboflavin production with Bacillus subtilis, which we make publicly available for the first time.

Keywords