
Full text loading...
We review statistical methods for high-dimensional data analysis and pay particular attention to recent developments for assessing uncertainties in terms of controlling false positive statements (type I error) and p-values. The main focus is on regression models, but we also discuss graphical modeling and causal inference based on observational data. We illustrate the concepts and methods with various packages from the statistical software using a high-throughput genomic data set about riboflavin production with Bacillus subtilis, which we make publicly available for the first time.
Article metrics loading...
Full text loading...
Data & Media loading...
Download Supplemental Text (PDF) Download data sets:
riboflavin (CSV)
riboflavingrouped (CSV)
riboflavingrouped_structure (CSV)
riboflavinv100 (CSV)