Skip to main content

Showing 1–47 of 47 results for author: Bühlmann, P

Searching in archive math. Search in all archives.
.
  1. arXiv:2405.04715  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Causality Pursuit from Heterogeneous Environments via Neural Adversarial Invariance Learning

    Authors: Yihong Gu, Cong Fang, Peter Bühlmann, Jianqing Fan

    Abstract: Pursuing causality from data is a fundamental problem in scientific discovery, treatment intervention, and transfer learning. This paper introduces a novel algorithmic method for addressing nonparametric invariance and causality learning in regression models across multiple environments, where the joint distribution of response variables and covariates varies, but the conditional expectations of o… ▽ More

    Submitted 30 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 48 pages, 7 figures with appendix

    MSC Class: 62G08

  2. arXiv:2402.09758  [pdf, other

    stat.ME math.ST stat.ML

    Extrapolation-Aware Nonparametric Statistical Inference

    Authors: Niklas Pfister, Peter Bühlmann

    Abstract: We define extrapolation as any type of statistical inference on a conditional function (e.g., a conditional expectation or conditional quantile) evaluated outside of the support of the conditioning variable. This type of extrapolation occurs in many data analysis applications and can invalidate the resulting conclusions if not taken into account. While extrapolating is straightforward in parametri… ▽ More

    Submitted 12 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  3. arXiv:2312.08485  [pdf, ps, other

    math.ST

    Distributional Robustness and Transfer Learning Through Empirical Bayes

    Authors: Michael Law, Peter Bühlmann, Ya'acov Ritov

    Abstract: We consider the problem of statistical inference on parameters of a target population when auxiliary observations are available from related populations. We propose a flexible empirical Bayes approach that can be applied on top of any asymptotically linear estimator to incorporate information from related populations when constructing confidence regions. The proposed methodology is valid regardles… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  4. arXiv:2308.10375  [pdf, other

    stat.ME math.ST

    Model Selection over Partially Ordered Sets

    Authors: Armeen Taeb, Peter Bühlmann, Venkat Chandrasekaran

    Abstract: In problems such as variable selection and graph estimation, models are characterized by Boolean logical structure such as presence or absence of a variable or an edge. Consequently, false positive error or false negative error can be specified as the number of variables/edges that are incorrectly included or excluded in an estimated model. However, there are several other problems such as ranking… ▽ More

    Submitted 15 April, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

    Comments: added an acknowledgement section that was missing in v1 and updated a figure and made some minor updates

    Journal ref: Proceedings of National Academy of Sciences, 2024

  5. arXiv:2302.05761  [pdf, other

    math.ST stat.ML

    Confidence and Uncertainty Assessment for Distributional Random Forests

    Authors: Jeffrey Näf, Corinne Emmenegger, Peter Bühlmann, Nicolai Meinshausen

    Abstract: The Distributional Random Forest (DRF) is a recently introduced Random Forest algorithm to estimate multivariate conditional distributions. Due to its general estimation procedure, it can be employed to estimate a wide range of targets such as conditional average treatment effects, conditional quantiles, and conditional correlations. However, only results about the consistency and convergence rate… ▽ More

    Submitted 19 December, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

  6. arXiv:2206.14591  [pdf, other

    stat.ME math.ST stat.ML

    Treatment Effect Estimation with Observational Network Data using Machine Learning

    Authors: Corinne Emmenegger, Meta-Lina Spohn, Timon Elmer, Peter Bühlmann

    Abstract: Causal inference methods for treatment effect estimation usually assume independent units. However, this assumption is often questionable because units may interact, resulting in spillover effects between units. We develop augmented inverse probability weighting (AIPW) for estimation and inference of the direct effect of the treatment with observational data from a single (social) network with spi… ▽ More

    Submitted 4 September, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

  7. arXiv:2205.08925  [pdf, other

    stat.ME math.ST

    Ancestor regression in linear structural equation models

    Authors: Christoph Schultheiss, Peter Bühlmann

    Abstract: We present a new method for causal discovery in linear structural equation models. We propose a simple ``trick'' based on statistical testing in linear models that can distinguish between ancestors and non-ancestors of any given variable. Naturally, this can then be extended to estimating the causal order among all variables. We provide explicit error control for false causal discovery, at least a… ▽ More

    Submitted 14 March, 2023; v1 submitted 18 May, 2022; originally announced May 2022.

  8. arXiv:2203.12808  [pdf, other

    stat.ME math.ST stat.ML

    Robustness Against Weak or Invalid Instruments: Exploring Nonlinear Treatment Models with Machine Learning

    Authors: Zijian Guo, Mengchu Zheng, Peter Bühlmann

    Abstract: We discuss causal inference for observational studies with possibly invalid instrumental variables. We propose a novel methodology called two-stage curvature identification (TSCI) by exploring the nonlinear treatment model with machine learning. {The first-stage machine learning enables improving the instrumental variable's strength and adjusting for different forms of violating the instrumental v… ▽ More

    Submitted 4 January, 2024; v1 submitted 23 March, 2022; originally announced March 2022.

  9. arXiv:2111.14969  [pdf, other

    math.ST stat.ME stat.ML

    A Fast Non-parametric Approach for Local Causal Structure Learning

    Authors: Mona Azadkia, Armeen Taeb, Peter Bühlmann

    Abstract: We study the problem of causal structure learning with essentially no assumptions on the functional relationships and noise. We develop DAG-FOCI, a computationally fast algorithm for this setting that is based on the FOCI variable selection algorithm in~\cite{azadkia2021simple}. DAG-FOCI outputs the set of parents of a response variable of interest. We provide theoretical guarantees of our procedu… ▽ More

    Submitted 18 March, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: 27 pages

    MSC Class: 62D20

  10. arXiv:2108.13657  [pdf, other

    stat.ME math.ST stat.ML

    Double Machine Learning for Partially Linear Mixed-Effects Models with Repeated Measurements

    Authors: Corinne Emmenegger, Peter Bühlmann

    Abstract: Traditionally, spline or kernel approaches in combination with parametric estimation are used to infer the linear coefficient (fixed effects) in a partially linear mixed-effects model for repeated measurements. Using machine learning algorithms allows us to incorporate complex interaction structures and high-dimensional variables. We employ double machine learning to cope with the nonparametric pa… ▽ More

    Submitted 3 February, 2022; v1 submitted 31 August, 2021; originally announced August 2021.

  11. arXiv:2101.12525  [pdf, other

    stat.ME math.ST

    Regularizing Double Machine Learning in Partially Linear Endogenous Models

    Authors: Corinne Emmenegger, Peter Bühlmann

    Abstract: The linear coefficient in a partially linear model with confounding variables can be estimated using double machine learning (DML). However, this DML estimator has a two-stage least squares (TSLS) interpretation and may produce overly wide confidence intervals. To address this issue, we propose a regularization and selection scheme, regsDML, which leads to narrower confidence intervals. It selects… ▽ More

    Submitted 19 September, 2021; v1 submitted 29 January, 2021; originally announced January 2021.

    Comments: new content and revised text

  12. arXiv:2101.06950  [pdf, other

    stat.ME math.ST

    Learning and scoring Gaussian latent variable causal models with unknown additive interventions

    Authors: Armeen Taeb, Juan L. Gamella, Christina Heinze-Deml, Peter Bühlmann

    Abstract: With observational data alone, causal structure learning is a challenging problem. The task becomes easier when having access to data collected from perturbations of the underlying system, even when the nature of these is unknown. Existing methods either do not allow for the presence of latent variables or assume that these remain unperturbed. However, these assumptions are hard to justify if the… ▽ More

    Submitted 7 October, 2023; v1 submitted 18 January, 2021; originally announced January 2021.

  13. arXiv:2010.15764  [pdf, other

    stat.ML cs.LG math.ST

    Domain adaptation under structural causal models

    Authors: Yuansi Chen, Peter Bühlmann

    Abstract: Domain adaptation (DA) arises as an important problem in statistical machine learning when the source data used to train a model is different from the target data used to test the model. Recent advances in DA have mainly been application-driven and have largely relied on the idea of a common subspace for source and target data. To understand the empirical successes and failures of DA methods, we p… ▽ More

    Submitted 23 November, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: 80 pages, 22 figures, accepted in JMLR

  14. arXiv:2010.10194  [pdf, other

    stat.ME cs.LG math.ST stat.CO stat.ML

    Optimistic search: Change point estimation for large-scale data via adaptive logarithmic queries

    Authors: Solt Kovács, Housen Li, Lorenz Haubner, Axel Munk, Peter Bühlmann

    Abstract: Change point estimation is often formulated as a search for the maximum of a gain function describing improved fits when segmenting the data. Searching through all candidates requires $O(n)$ evaluations of the gain function for an interval with $n$ observations. If each evaluation is computationally demanding (e.g. in high-dimensional models), this can become infeasible. Instead, we propose optimi… ▽ More

    Submitted 29 November, 2022; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: Generalize the univariate theory to Gaussian mean changes of general dimension, including high-dimensional scenarios

  15. arXiv:2004.03758  [pdf, other

    stat.ME math.ST

    Doubly Debiased Lasso: High-Dimensional Inference under Hidden Confounding

    Authors: Zijian Guo, Domagoj Ćevid, Peter Bühlmann

    Abstract: Inferring causal relationships or related associations from observational data can be invalidated by the existence of hidden confounding. We focus on a high-dimensional linear regression setting, where the measured covariates are affected by hidden confounding and propose the {\em Doubly Debiased Lasso} estimator for individual components of the regression coefficient vector. Our advocated method… ▽ More

    Submitted 20 July, 2021; v1 submitted 7 April, 2020; originally announced April 2020.

  16. arXiv:1909.10828  [pdf, other

    math.ST

    Double-estimation-friendly inference for high-dimensional misspecified models

    Authors: Rajen D. Shah, Peter Bühlmann

    Abstract: All models may be wrong -- but that is not necessarily a problem for inference. Consider the standard $t$-test for the significance of a variable $X$ for predicting response $Y$ whilst controlling for $p$ other covariates $Z$ in a random design linear model. This yields correct asymptotic type~I error control for the null hypothesis that $X$ is conditionally independent of $Y$ given $Z$ under an \… ▽ More

    Submitted 19 May, 2022; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: To appear in Statistical Science

  17. arXiv:1908.03606  [pdf, other

    stat.ME math.ST

    Goodness-of-fit testing in high-dimensional generalized linear models

    Authors: Jana Janková, Rajen D. Shah, Peter Bühlmann, Richard J. Samworth

    Abstract: We propose a family of tests to assess the goodness-of-fit of a high-dimensional generalized linear model. Our framework is flexible and may be used to construct an omnibus test or directed against testing specific non-linearities and interaction effects, or for testing the significance of groups of variables. The methodology is based on extracting left-over signal in the residuals from an initial… ▽ More

    Submitted 12 November, 2019; v1 submitted 9 August, 2019; originally announced August 2019.

    Comments: 40 pages, 4 figures

  18. arXiv:1706.08058  [pdf, ps, other

    math.ST stat.AP stat.ME

    Invariant Causal Prediction for Sequential Data

    Authors: Niklas Pfister, Peter Bühlmann, Jonas Peters

    Abstract: We investigate the problem of inferring the causal predictors of a response $Y$ from a set of $d$ explanatory variables $(X^1,\dots,X^d)$. Classical ordinary least squares regression includes all predictors that reduce the variance of $Y$. Using only the causal predictors instead leads to models that have the advantage of remaining invariant under interventions, loosely speaking they lead to invar… ▽ More

    Submitted 28 May, 2018; v1 submitted 25 June, 2017; originally announced June 2017.

    Comments: 55 pages

    MSC Class: 62L05; 62P20; 63J05 ACM Class: G.3

  19. arXiv:1607.05980  [pdf, other

    math.ST

    Causal inference in partially linear structural equation models

    Authors: Dominik Rothenhäusler, Jan Ernest, Peter Bühlmann

    Abstract: We consider identifiability of partially linear additive structural equation models with Gaussian noise (PLSEMs) and estimation of distributionally equivalent models to a given PLSEM. Thereby, we also include robustness results for errors in the neighborhood of Gaussian distributions. Existing identifiability results in the framework of additive SEMs with Gaussian noise are limited to linear and n… ▽ More

    Submitted 14 December, 2017; v1 submitted 20 July, 2016; originally announced July 2016.

    Comments: D.R. and J.E. contributed equally to this work

    MSC Class: 62G99; 62H99; 68T99

  20. arXiv:1603.00285  [pdf, ps, other

    math.ST stat.ML

    Kernel-based Tests for Joint Independence

    Authors: Niklas Pfister, Peter Bühlmann, Bernhard Schölkopf, Jonas Peters

    Abstract: We investigate the problem of testing whether $d$ random variables, which may or may not be continuous, are jointly (or mutually) independent. Our method builds on ideas of the two variable Hilbert-Schmidt independence criterion (HSIC) but allows for an arbitrary number of variables. We embed the $d$-dimensional joint distribution and the product of the marginals into a reproducing kernel Hilbert… ▽ More

    Submitted 4 November, 2016; v1 submitted 1 March, 2016; originally announced March 2016.

    Comments: 67 pages

  21. arXiv:1601.03704  [pdf, other

    stat.ME math.ST stat.CO

    Computationally efficient change point detection for high-dimensional regression

    Authors: Florencia Leonardi, Peter Bühlmann

    Abstract: Large-scale sequential data is often exposed to some degree of inhomogeneity in the form of sudden changes in the parameters of the data-generating process. We consider the problem of detecting such structural changes in a high-dimensional regression setting. We propose a joint estimator of the number and the locations of the change points and of the parameters in the corresponding segments. The e… ▽ More

    Submitted 14 January, 2016; originally announced January 2016.

  22. arXiv:1511.03334  [pdf, other

    stat.ME math.ST

    Goodness of fit tests for high-dimensional linear models

    Authors: Rajen D. Shah, Peter Bühlmann

    Abstract: In this work we propose a framework for constructing goodness of fit tests in both low and high-dimensional linear models. We advocate applying regression methods to the scaled residuals following either an ordinary least squares or Lasso fit to the data, and using some proxy for prediction error as the final test statistic. We call this family Residual Prediction (RP) tests. We show that simulati… ▽ More

    Submitted 8 April, 2017; v1 submitted 10 November, 2015; originally announced November 2015.

    Comments: 42 pages, 12 figures

  23. arXiv:1502.03300  [pdf, other

    math.ST

    A sequential rejection testing method for high-dimensional regression with correlated variables

    Authors: Jacopo Mandozzi, Peter Bühlmann

    Abstract: We propose a general, modular method for significance testing of groups (or clusters) of variables in a high-dimensional linear model. In presence of high correlations among the covariables, due to serious problems of identifiability, it is indispensable to focus on detecting groups of variables rather than singletons. We propose an inference method which allows to build in hierarchical structures… ▽ More

    Submitted 11 February, 2015; originally announced February 2015.

  24. Discussion: "A significance test for the lasso"

    Authors: Peter Bühlmann, Lukas Meier, Sara van de Geer

    Abstract: Discussion of "A significance test for the lasso" by Richard Lockhart, Jonathan Taylor, Ryan J. Tibshirani, Robert Tibshirani [arXiv:1301.7161].

    Submitted 27 May, 2014; originally announced May 2014.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1175A the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1175A

    Journal ref: Annals of Statistics 2014, Vol. 42, No. 2, 469-477

  25. arXiv:1312.5556  [pdf, other

    math.ST

    Hierarchical Testing in the High-Dimensional Setting with Correlated Variables

    Authors: Jacopo Mandozzi, Peter Bühlmann

    Abstract: We propose a method for testing whether hierarchically ordered groups of potentially correlated variables are significant for explaining a response in a high-dimensional linear model. In presence of highly correlated variables, as is very common in high-dimensional data, it seems indispensable to go beyond an approach of inferring individual regression coefficients, and we show that detecting smal… ▽ More

    Submitted 3 September, 2014; v1 submitted 19 December, 2013; originally announced December 2013.

  26. arXiv:1311.3492  [pdf, ps, other

    stat.ML math.ST

    High-dimensional learning of linear causal networks via inverse covariance estimation

    Authors: Po-Ling Loh, Peter Bühlmann

    Abstract: We establish a new framework for statistical estimation of directed acyclic graphs (DAGs) when data are generated from a linear, possibly non-Gaussian structural equation model. Our framework consists of two parts: (1) inferring the moralized graph from the support of the inverse covariance matrix; and (2) selecting the best-scoring graph amongst DAGs that are consistent with the moralized graph.… ▽ More

    Submitted 14 November, 2013; originally announced November 2013.

    Comments: 41 pages, 7 figures

    MSC Class: 62F12

  27. arXiv:1303.3216  [pdf, other

    math.ST stat.ME

    Jointly interventional and observational data: estimation of interventional Markov equivalence classes of directed acyclic graphs

    Authors: Alain Hauser, Peter Bühlmann

    Abstract: In many applications we have both observational and (randomized) interventional data. We propose a Gaussian likelihood framework for joint modeling of such different data-types, based on global parameters consisting of a directed acyclic graph (DAG) and correponding edge weights and error variances. Thanks to the global nature of the parameters, maximum likelihood estimation is reasonable with onl… ▽ More

    Submitted 13 March, 2013; originally announced March 2013.

  28. On asymptotically optimal confidence regions and tests for high-dimensional models

    Authors: Sara van de Geer, Peter Bühlmann, Ya'acov Ritov, Ruben Dezeure

    Abstract: We propose a general method for constructing confidence intervals and statistical tests for single or low-dimensional components of a large parameter vector in a high-dimensional model. It can be easily adjusted for multiplicity taking dependence among tests into account. For linear models, our method is essentially the same as in Zhang and Zhang [J. R. Stat. Soc. Ser. B Stat. Methodol. 76 (2014)… ▽ More

    Submitted 23 June, 2014; v1 submitted 3 March, 2013; originally announced March 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOS1221 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1221

    Journal ref: Annals of Statistics 2014, Vol. 42, No. 3, 1166-1202

  29. Correlated variables in regression: clustering and sparse estimation

    Authors: Peter Bühlmann, Philipp Rütimann, Sara van de Geer, Cun-Hui Zhang

    Abstract: We consider estimation in a high-dimensional linear model with strongly correlated variables. We propose to cluster the variables first and do subsequent sparse estimation such as the Lasso for cluster-representatives or the group Lasso based on the structure from the clusters. Regarding the first step, we present a novel and bottom-up agglomerative clustering algorithm based on canonical correlat… ▽ More

    Submitted 26 September, 2012; originally announced September 2012.

    Comments: 40 pages, 6 figures

    MSC Class: 62J07; 62H30

    Journal ref: Journal of Statistical Planning and Inference 2013, Vol. 143, 1835-1858

  30. Hypersurfaces and their singularities in partial correlation testing

    Authors: Shaowei Lin, Caroline Uhler, Bernd Sturmfels, Peter Bühlmann

    Abstract: An asymptotic theory is developed for computing volumes of regions in the parameter space of a directed Gaussian graphical model that are obtained by bounding partial correlations. We study these volumes using the method of real log canonical thresholds from algebraic geometry. Our analysis involves the computation of the singular loci of correlation hypersurfaces. Statistical applications include… ▽ More

    Submitted 2 December, 2013; v1 submitted 3 September, 2012; originally announced September 2012.

  31. Geometry of the faithfulness assumption in causal inference

    Authors: Caroline Uhler, Garvesh Raskutti, Peter Bühlmann, Bin Yu

    Abstract: Many algorithms for inferring causality rely heavily on the faithfulness assumption. The main justification for imposing this assumption is that the set of unfaithful distributions has Lebesgue measure zero, since it can be seen as a collection of hypersurfaces in a hypercube. However, due to sampling error the faithfulness condition alone is not sufficient for statistical estimation, and strong-f… ▽ More

    Submitted 22 April, 2013; v1 submitted 2 July, 2012; originally announced July 2012.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOS1080 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1080

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 2, 436-463

  32. $\ell_0$-penalized maximum likelihood for sparse directed acyclic graphs

    Authors: Sara van de Geer, Peter Bühlmann

    Abstract: We consider the problem of regularized maximum likelihood estimation for the structure and parameters of a high-dimensional, sparse directed acyclic graphical (DAG) model with Gaussian distribution, or equivalently, of a Gaussian structural equation model. We show that the $\ell_0$-penalized maximum likelihood estimator of a DAG has about the same number of edges as the minimal-edge I-MAP (a DAG w… ▽ More

    Submitted 9 May, 2013; v1 submitted 24 May, 2012; originally announced May 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1085 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1085

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 2, 536-567

  33. arXiv:1205.2536  [pdf, other

    stat.ML math.ST

    Identifiability of Gaussian structural equation models with equal error variances

    Authors: Jonas Peters, Peter Bühlmann

    Abstract: We consider structural equation models in which variables can be written as a function of their parents and noise terms, which are assumed to be jointly independent. Corresponding to each structural equation model, there is a directed acyclic graph describing the relationships between the variables. In Gaussian structural equation models with linear functions, the graph can be identified from the… ▽ More

    Submitted 28 August, 2013; v1 submitted 11 May, 2012; originally announced May 2012.

    Journal ref: Biometrika 2014, Vol. 101, No. 1, 219-228

  34. Introduction to the Lehmann special section

    Authors: Peter Bühlmann, Tony Cai

    Abstract: The current Special Issue of The Annals of Statistics contains three invited articles. Javier Rojo discusses Erich's scientific achievements and provides complete lists of his scientific writings and his former Ph.D. students. Willem van Zwet describes aspects of Erich's life and work, enriched with personal and interesting anecdotes of Erich's long and productive scientific journey. Finally, Pete… ▽ More

    Submitted 23 February, 2012; originally announced February 2012.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOS928 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS928

    Journal ref: Annals of Statistics 2011, Vol. 39, No. 5, 2243-2243

  35. arXiv:1202.1377  [pdf, ps, other

    stat.ME math.ST

    Statistical significance in high-dimensional linear models

    Authors: Peter Bühlmann

    Abstract: We propose a method for constructing p-values for general hypotheses in a high-dimensional linear model. The hypotheses can be local for testing a single regression parameter or they may be more global involving several up to all parameters. Furthermore, when considering many hypotheses, we show how to adjust for multiple testing taking dependence among the p-values into account. Our technique is… ▽ More

    Submitted 11 October, 2013; v1 submitted 7 February, 2012; originally announced February 2012.

    Comments: Published in at http://dx.doi.org/10.3150/12-BEJSP11 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJSP11

    Journal ref: Bernoulli 2013, Vol. 19, No. 4, 1212-1242

  36. GLMMLasso: An Algorithm for High-Dimensional Generalized Linear Mixed Models Using L1-Penalization

    Authors: Jürg Schelldorfer, Lukas Meier, Peter Bühlmann

    Abstract: We propose an L1-penalized algorithm for fitting high-dimensional generalized linear mixed models. Generalized linear mixed models (GLMMs) can be viewed as an extension of generalized linear models for clustered observations. This Lasso-type approach for GLMMs should be mainly used as variable screening method to reduce the number of variables below the sample size. We then suggest a refitting by… ▽ More

    Submitted 20 November, 2012; v1 submitted 19 September, 2011; originally announced September 2011.

    Journal ref: Journal of Computational and Graphical Statistics. Volume 23, Issue 2, 2014, pages 460-477

  37. Asymptotic optimality of the Westfall--Young permutation procedure for multiple testing under dependence

    Authors: Nicolai Meinshausen, Marloes H. Maathuis, Peter Bühlmann

    Abstract: Test statistics are often strongly dependent in large-scale multiple testing applications. Most corrections for multiplicity are unduly conservative for correlated test statistics, resulting in a loss of power to detect true positives. We show that the Westfall--Young permutation method has asymptotically optimal power for a broad class of testing problems with a block-dependence and sparsity stru… ▽ More

    Submitted 19 March, 2012; v1 submitted 10 June, 2011; originally announced June 2011.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOS946 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS946

    Journal ref: Annals of Statistics 2011, Vol. 39, No. 6, 3369-3391

  38. arXiv:1104.2808  [pdf, ps, other

    stat.ME cs.DM math.ST

    Characterization and Greedy Learning of Interventional Markov Equivalence Classes of Directed Acyclic Graphs

    Authors: Alain Hauser, Peter Bühlmann

    Abstract: The investigation of directed acyclic graphs (DAGs) encoding the same Markov property, that is the same conditional independence relations of multivariate observational distributions, has a long tradition; many algorithms exist for model selection and structure learning in Markov equivalence classes. In this paper, we extend the notion of Markov equivalence of DAGs to the case of interventional di… ▽ More

    Submitted 26 September, 2012; v1 submitted 14 April, 2011; originally announced April 2011.

    Journal ref: Journal of Machine Learning Research, 13:2409-2464, 2012

  39. arXiv:1009.0530  [pdf, ps, other

    stat.ML math.ST

    High-dimensional covariance estimation based on Gaussian graphical models

    Authors: Shuheng Zhou, Philipp Rutimann, Min Xu, Peter Buhlmann

    Abstract: Undirected graphs are often used to describe high dimensional distributions. Under sparsity conditions, the graph can be estimated using $\ell_1$-penalization methods. We propose and study the following method. We combine a multiple regression approach with ideas of thresholding and refitting: first we infer a sparse undirected graphical model structure via thresholding of each among many… ▽ More

    Submitted 22 June, 2011; v1 submitted 2 September, 2010; originally announced September 2010.

    Comments: 50 Pages, 6 figures. Major revision

    Report number: University of Michigan, Department of Statistics Technical Report 512

    Journal ref: Journal of Machine Learning Research, Volume 12, pp 2975-3026, 2011

  40. arXiv:1001.5176  [pdf, ps, other

    math.ST

    The adaptive and the thresholded Lasso for potentially misspecified models

    Authors: Sara van de Geer, Peter Buhlmann, Shuheng Zhou

    Abstract: We revisit the adaptive Lasso as well as the thresholded Lasso with refitting, in a high-dimensional linear model, and study prediction error, $\ell_q$-error ($q \in \{1, 2 \} $), and number of false positive selections. Our theoretical results for the two methods are, at a rather fine scale, comparable. The differences only show up in terms of the (minimal) restricted and sparse eigenvalues, favo… ▽ More

    Submitted 15 July, 2010; v1 submitted 28 January, 2010; originally announced January 2010.

    Comments: 45 pages

    MSC Class: 62J07 62G08

    Journal ref: The Electronic Journal of Statistics 5 (2011) 688-749

  41. arXiv:0910.0722  [pdf, other

    math.ST stat.ML

    On the conditions used to prove oracle results for the Lasso

    Authors: Sara A. van de Geer, Peter Bühlmann

    Abstract: Oracle inequalities and variable selection properties for the Lasso in linear models have been established under a variety of different assumptions on the design matrix. We show in this paper how the different conditions and concepts relate to each other. The restricted eigenvalue condition (Bickel et al., 2009) or the slightly weaker compatibility condition (van de Geer, 2007) are sufficient fo… ▽ More

    Submitted 5 October, 2009; originally announced October 2009.

    Comments: 33 pages, 1 figure

    Journal ref: Electronic Journal of Statistics, 3, (2009), 1360-1392

  42. arXiv:0903.2515  [pdf, ps, other

    math.ST stat.ML

    Adaptive Lasso for High Dimensional Regression and Gaussian Graphical Modeling

    Authors: Shuheng Zhou, Sara van de Geer, Peter Bühlmann

    Abstract: We show that the two-stage adaptive Lasso procedure (Zou, 2006) is consistent for high-dimensional model selection in linear and Gaussian graphical models. Our conditions for consistency cover more general situations than those accomplished in previous work: we prove that restricted eigenvalue conditions (Bickel et al., 2008) are also sufficient for sparse structure estimation.

    Submitted 13 March, 2009; originally announced March 2009.

    Comments: 30 pages

  43. Discussion: One-step sparse estimates in nonconcave penalized likelihood models

    Authors: Peter Bühlmann, Lukas Meier

    Abstract: Discussion of ``One-step sparse estimates in nonconcave penalized likelihood models'' [arXiv:0808.1012]

    Submitted 7 August, 2008; originally announced August 2008.

    Comments: Published in at http://dx.doi.org/10.1214/07-AOS0316A the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0316A

    Journal ref: Annals of Statistics 2008, Vol. 36, No. 4, 1534-1541

  44. Smoothing $\ell_1$-penalized estimators for high-dimensional time-course data

    Authors: Lukas Meier, Peter Bühlmann

    Abstract: When a series of (related) linear models has to be estimated it is often appropriate to combine the different data-sets to construct more efficient estimators. We use $\ell_1$-penalized estimators like the Lasso or the Adaptive Lasso which can simultaneously do parameter estimation and model selection. We show that for a time-course of high-dimensional linear models the convergence rates of the… ▽ More

    Submitted 11 December, 2007; originally announced December 2007.

    Comments: Published in at http://dx.doi.org/10.1214/07-EJS103 the Electronic Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-EJS-EJS_2007_103 MSC Class: 62J07 (Primary); 62J99; 62H12 (Secondary)

    Journal ref: Electronic Journal of Statistics 2007, Vol. 1, 597-615

  45. High-dimensional graphs and variable selection with the Lasso

    Authors: Nicolai Meinshausen, Peter Bühlmann

    Abstract: The pattern of zero entries in the inverse covariance matrix of a multivariate normal distribution corresponds to conditional independence restrictions between variables. Covariance selection aims at estimating those structural zeros from data. We show that neighborhood selection with the Lasso is a computationally attractive alternative to standard covariance selection for sparse high-dimension… ▽ More

    Submitted 1 August, 2006; originally announced August 2006.

    Comments: Published at http://dx.doi.org/10.1214/009053606000000281 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0163 MSC Class: 62J07 (Primary) 62H20; 62F12 (Secondary)

    Journal ref: Annals of Statistics 2006, Vol. 34, No. 3, 1436-1462

  46. Boosting for high-dimensional linear models

    Authors: Peter Bühlmann

    Abstract: We prove that boosting with the squared error loss, $L_2$Boosting, is consistent for very high-dimensional linear models, where the number of predictor variables is allowed to grow essentially as fast as $O$(exp(sample size)), assuming that the true underlying regression function is sparse in terms of the $\ell_1$-norm of the regression coefficients. In the language of signal processing, this me… ▽ More

    Submitted 30 June, 2006; originally announced June 2006.

    Comments: Published at http://dx.doi.org/10.1214/009053606000000092 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0121 MSC Class: 62J05; 62J07 (Primary) 49M15; 62P10; 68Q32 (Secondary)

    Journal ref: Annals of Statistics 2006, Vol. 34, No. 2, 559-583

  47. arXiv:math/0510436  [pdf, ps, other

    math.ST

    Estimating high-dimensional directed acyclic graphs with the PC-algorithm

    Authors: Markus Kalisch, Peter Buehlmann

    Abstract: We consider the PC-algorithm Spirtes et. al. (2000) for estimating the skeleton of a very high-dimensional acyclic directed graph (DAG) with corresponding Gaussian distribution. The PC-algorithm is computationally feasible for sparse problems with many nodes, i.e. variables, and it has the attractive property to automatically achieve high computational efficiency as a function of sparseness of t… ▽ More

    Submitted 20 October, 2005; originally announced October 2005.

    MSC Class: 62H20; 62H12 (Primary); 68Q32 (Secondary)