Skip to main content

Showing 1–50 of 61 results for author: Lederer, J

.
  1. arXiv:2407.04526  [pdf, other

    physics.comp-ph

    Peering inside the black box: Learning the relevance of many-body functions in Neural Network potentials

    Authors: Klara Bonneau, Jonas Lederer, Clark Templeton, David Rosenberger, Klaus-Robert Müller, Cecilia Clementi

    Abstract: Machine learned potentials are becoming a popular tool to define an effective energy model for complex systems, either incorporating electronic structure effects at the atomistic resolution, or effectively renormalizing part of the atomistic degrees of freedom at a coarse-grained resolution. One of the main criticisms to machine learned potentials is that the energy inferred by the network is not… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2405.16696  [pdf, other

    math.ST stat.ML

    How many samples are needed to train a deep neural network?

    Authors: Pegah Golestaneh, Mahsa Taheri, Johannes Lederer

    Abstract: Neural networks have become standard tools in many areas, yet many important statistical questions remain open. This paper studies the question of how much data are needed to train a ReLU feed-forward neural network. Our theoretical and empirical results suggest that the generalization error of ReLU feed-forward neural networks scales at the rate $1/\sqrt{n}$ in the sample size $n$ rather than the… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  3. arXiv:2405.14529  [pdf, other

    cs.CV

    AnomalyDINO: Boosting Patch-based Few-shot Anomaly Detection with DINOv2

    Authors: Simon Damm, Mike Laszkiewicz, Johannes Lederer, Asja Fischer

    Abstract: Recent advances in multimodal foundation models have set new standards in few-shot anomaly detection. This paper explores whether high-quality visual features alone are sufficient to rival existing state-of-the-art vision-language models. We affirm this by adapting DINOv2 for one-shot and few-shot anomaly detection, with a focus on industrial applications. We show that this approach does not only… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2401.13555  [pdf, other

    cs.CV cs.AI cs.LG

    Benchmarking the Fairness of Image Upsampling Methods

    Authors: Mike Laszkiewicz, Imant Daunhawer, Julia E. Vogt, Asja Fischer, Johannes Lederer

    Abstract: Recent years have witnessed a rapid development of deep generative models for creating synthetic media, such as images and videos. While the practical applications of these models in everyday tasks are enticing, it is crucial to assess the inherent risks regarding their fairness. In this work, we introduce a comprehensive framework for benchmarking the performance and fairness of conditional gener… ▽ More

    Submitted 29 April, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published at the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)

  5. arXiv:2311.09245  [pdf, other

    cs.LG math.ST stat.ML

    Affine Invariance in Continuous-Domain Convolutional Neural Networks

    Authors: Ali Mohaddes, Johannes Lederer

    Abstract: The notion of group invariance helps neural networks in recognizing patterns and features under geometric transformations. Indeed, it has been shown that group invariance can largely improve deep learning performances in practice, where such transformations are very common. This research studies affine invariance on continuous-domain convolutional neural networks. Despite other research considerin… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  6. arXiv:2307.15067  [pdf, ps, other

    cs.CV cs.CR cs.LG

    Set-Membership Inference Attacks using Data Watermarking

    Authors: Mike Laszkiewicz, Denis Lukovnikov, Johannes Lederer, Asja Fischer

    Abstract: In this work, we propose a set-membership inference attack for generative models using deep image watermarking techniques. In particular, we demonstrate how conditional sampling from a generative model can reveal the watermark that was injected into parts of the training data. Our empirical results demonstrate that the proposed watermarking technique is a principled approach for detecting the non-… ▽ More

    Submitted 22 June, 2023; originally announced July 2023.

    Comments: Preliminary work

  7. arXiv:2306.06210  [pdf, other

    cs.CV cs.LG

    Single-Model Attribution of Generative Models Through Final-Layer Inversion

    Authors: Mike Laszkiewicz, Jonas Ricker, Johannes Lederer, Asja Fischer

    Abstract: Recent breakthroughs in generative modeling have sparked interest in practical single-model attribution. Such methods predict whether a sample was generated by a specific generator or not, for instance, to prove intellectual property theft. However, previous works are either limited to the closed-world setting or require undesirable changes to the generative model. We address these shortcomings by… ▽ More

    Submitted 26 June, 2024; v1 submitted 26 May, 2023; originally announced June 2023.

    Comments: Accepted at the Forty-first International Conference on Machine Learning [ICML2024]

  8. arXiv:2303.04258  [pdf, ps, other

    stat.ME math.ST

    Extremes in High Dimensions: Methods and Scalable Algorithms

    Authors: Johannes Lederer, Marco Oesting

    Abstract: Extreme-value theory has been explored in considerable detail for univariate and low-dimensional observations, but the field is still in an early stage regarding high-dimensional multivariate observations. In this paper, we focus on Hüsler-Reiss models and their domain of attraction, a popular class of models for multivariate extremes that exhibit some similarities to multivariate Gaussian distrib… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: 26 pages

  9. arXiv:2303.02114  [pdf, ps, other

    math.ST cs.LG stat.ML

    Lag selection and estimation of stable parameters for multiple autoregressive processes through convex programming

    Authors: Somnath Chakraborty, Johannes Lederer, Rainer von Sachs

    Abstract: Motivated by a variety of applications, high-dimensional time series have become an active topic of research. In particular, several methods and finite-sample theories for individual stable autoregressive processes with known lag have become available very recently. We, instead, consider multiple stable autoregressive processes that share an unknown lag. We use information across the different pro… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  10. arXiv:2302.11241  [pdf, other

    cs.LG cs.AI stat.ME

    The DeepCAR Method: Forecasting Time-Series Data That Have Change Points

    Authors: Ayla Jungbluth, Johannes Lederer

    Abstract: Many methods for time-series forecasting are known in classical statistics, such as autoregression, moving averages, and exponential smoothing. The DeepAR framework is a novel, recent approach for time-series forecasting based on deep learning. DeepAR has shown very promising results already. However, time series often have change points, which can degrade the DeepAR's prediction performance subst… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  11. arXiv:2302.08235  [pdf, other

    math.ST

    Reducing Computational and Statistical Complexity in Machine Learning Through Cardinality Sparsity

    Authors: Ali Mohades, Johannes Lederer

    Abstract: High-dimensional data has become ubiquitous across the sciences but causes computational and statistical challenges. A common approach for dealing with these challenges is sparsity. In this paper, we introduce a new concept of sparsity, called cardinality sparsity. Broadly speaking, we call a tensor sparse if it contains only a small number of unique values. We show that cardinality sparsity can i… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  12. arXiv:2212.05517  [pdf, other

    physics.chem-ph stat.ML

    SchNetPack 2.0: A neural network toolbox for atomistic machine learning

    Authors: Kristof T. Schütt, Stefaan S. P. Hessmann, Niklas W. A. Gebauer, Jonas Lederer, Michael Gastegger

    Abstract: SchNetPack is a versatile neural networks toolbox that addresses both the requirements of method development and application of atomistic machine learning. Version 2.0 comes with an improved data pipeline, modules for equivariant neural networks as well as a PyTorch implementation of molecular dynamics. An optional integration with PyTorch Lightning and the Hydra configuration framework powers a f… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  13. arXiv:2212.05427  [pdf, ps, other

    cs.LG cs.AI math.ST stat.ML

    Statistical guarantees for sparse deep learning

    Authors: Johannes Lederer

    Abstract: Neural networks are becoming increasingly popular in applications, but our mathematical understanding of their potential and limitations is still limited. In this paper, we further this understanding by develo** statistical guarantees for sparse deep learning. In contrast to previous work, we consider different types of sparsity, such as few active connections, few active nodes, and other norm-b… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  14. arXiv:2206.10311  [pdf, other

    cs.LG math.ST stat.ML

    Marginal Tail-Adaptive Normalizing Flows

    Authors: Mike Laszkiewicz, Johannes Lederer, Asja Fischer

    Abstract: Learning the tail behavior of a distribution is a notoriously difficult problem. By definition, the number of samples from the tail is small, and deep generative models, such as normalizing flows, tend to concentrate on learning the body of the distribution. In this paper, we focus on improving the ability of normalizing flows to correctly capture the tail behavior and, thus, form more accurate mo… ▽ More

    Submitted 27 June, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Accepted at ICML2022 Thirty-ninth International Conference on Machine Learning

  15. arXiv:2205.04491  [pdf, other

    cs.LG math.ST

    Statistical Guarantees for Approximate Stationary Points of Simple Neural Networks

    Authors: Mahsa Taheri, Fang Xie, Johannes Lederer

    Abstract: Since statistical guarantees for neural networks are usually restricted to global optima of intricate objective functions, it is not clear whether these theories really explain the performances of actual outputs of neural-network pipelines. The goal of this paper is, therefore, to bring statistical theory closer to practice. We develop statistical guarantees for simple neural networks that coincid… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  16. arXiv:2203.16205  [pdf, other

    physics.chem-ph cs.LG

    Automatic Identification of Chemical Moieties

    Authors: Jonas Lederer, Michael Gastegger, Kristof T. Schütt, Michael Kampffmeyer, Klaus-Robert Müller, Oliver T. Unke

    Abstract: In recent years, the prediction of quantum mechanical observables with machine learning methods has become increasingly popular. Message-passing neural networks (MPNNs) solve this task by constructing atomic representations, from which the properties of interest are predicted. Here, we introduce a method to automatically identify chemical moieties (molecular building blocks) from such representati… ▽ More

    Submitted 27 April, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

  17. arXiv:2202.00975  [pdf, other

    stat.ML cs.LG stat.ME

    VC-PCR: A Prediction Method based on Supervised Variable Selection and Clustering

    Authors: Rebecca Marion, Johannes Lederer, Bernadette Govaerts, Rainer von Sachs

    Abstract: Sparse linear prediction methods suffer from decreased prediction accuracy when the predictor variables have cluster structure (e.g. there are highly correlated groups of variables). To improve prediction accuracy, various methods have been proposed to identify variable clusters from the data and integrate cluster information into a sparse modeling process. But none of these methods achieve satisf… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  18. arXiv:2201.05055  [pdf, other

    q-bio.GN q-bio.QM stat.ME

    Depth Normalization of Small RNA Sequencing: Using Data and Biology to Select a Suitable Method

    Authors: Yannick Düren, Johannes Lederer, Li-Xuan Qin

    Abstract: Deep sequencing has become one of the most popular tools for transcriptome profiling in biomedical studies. While an abundance of computational methods exists for "normalizing" sequencing data to remove unwanted between-sample variations due to experimental handling, there is no consensus on which normalization is the most suitable for a given data set. To address this problem, we developed "DANA"… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: 16 pages, 6 figures

  19. arXiv:2112.11407  [pdf, other

    cs.LG cs.AI stat.ML

    Toward Explainable AI for Regression Models

    Authors: Simon Letzgus, Patrick Wagner, Jonas Lederer, Wojciech Samek, Klaus-Robert Müller, Gregoire Montavon

    Abstract: In addition to the impressive predictive power of machine learning (ML) models, more recently, explanation methods have emerged that enable an interpretation of complex non-linear learning models such as deep neural networks. Gaining a better understanding is especially important e.g. for safety-critical ML applications or medical diagnostics etc. While such Explainable AI (XAI) techniques have re… ▽ More

    Submitted 17 January, 2023; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: 17 pages, 10 figures, published; changes: 1. references to code and xai-regression.org added (p. 1/2, end of introduction), 2. adjustment of sign-error in restructuring section (p. 8, just above Fig. 4)

    Journal ref: IEEE Signal Processing Magazine (Volume: 39, Issue: 4, July 2022) 40-58

  20. arXiv:2107.07352  [pdf, other

    cs.LG cs.AI stat.ML

    Copula-Based Normalizing Flows

    Authors: Mike Laszkiewicz, Johannes Lederer, Asja Fischer

    Abstract: Normalizing flows, which learn a distribution by transforming the data to samples from a Gaussian base distribution, have proven powerful density approximations. But their expressive power is limited by this choice of the base distribution. We, therefore, propose to generalize the base distribution to a more elaborate copula distribution to capture the properties of the target distribution more ac… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: Accepted for presentation at the ICML 2021 Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models (INNF+ 2021)

  21. arXiv:2106.02260  [pdf, other

    cs.LG cs.AI stat.ML

    Regularization and Reparameterization Avoid Vanishing Gradients in Sigmoid-Type Networks

    Authors: Leni Ven, Johannes Lederer

    Abstract: Deep learning requires several design choices, such as the nodes' activation functions and the widths, types, and arrangements of the layers. One consideration when making these choices is the vanishing-gradient problem, which is the phenomenon of algorithms getting stuck at suboptimal points due to small gradients. In this paper, we revisit the vanishing-gradient problem in the context of sigmoid… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  22. arXiv:2105.14052  [pdf, other

    cs.LG stat.CO

    Targeted Deep Learning: Framework, Methods, and Applications

    Authors: Shih-Ting Huang, Johannes Lederer

    Abstract: Deep learning systems are typically designed to perform for a wide range of test inputs. For example, deep learning systems in autonomous cars are supposed to deal with traffic situations for which they were not specifically trained. In general, the ability to cope with a broad spectrum of unseen test inputs is called generalization. Generalization is definitely important in applications where the… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  23. arXiv:2105.14035  [pdf, other

    stat.ML cs.LG stat.CO

    DeepMoM: Robust Deep Learning With Median-of-Means

    Authors: Shih-Ting Huang, Johannes Lederer

    Abstract: Data used in deep learning is notoriously problematic. For example, data are usually combined from diverse sources, rarely cleaned and vetted thoroughly, and sometimes corrupted on purpose. Intentional corruption that targets the weak spots of algorithms has been studied extensively under the label of "adversarial attacks." In contrast, the arguably much more common case of corruption that reflect… ▽ More

    Submitted 8 November, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

  24. arXiv:2101.09957  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Activation Functions in Artificial Neural Networks: A Systematic Overview

    Authors: Johannes Lederer

    Abstract: Activation functions shape the outputs of artificial neurons and, therefore, are integral parts of neural networks in general and deep learning in particular. Some activation functions, such as logistic and relu, have been used for many decades. But with deep learning becoming a mainstream research topic, new activation functions have mushroomed, leading to confusion in both theory and practice. T… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  25. arXiv:2010.00885  [pdf, other

    cs.LG stat.ML

    Optimization Landscapes of Wide Deep Neural Networks Are Benign

    Authors: Johannes Lederer

    Abstract: We analyze the optimization landscapes of deep learning with wide networks. We highlight the importance of constraints for such networks and show that constraint -- as well as unconstraint -- empirical-risk minimization over such networks has no confined points, that is, suboptimal parameters that are difficult to escape from. Hence, our theories substantiate the common belief that wide neural net… ▽ More

    Submitted 13 January, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

  26. arXiv:2009.09070  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Is there a role for statistics in artificial intelligence?

    Authors: Sarah Friedrich, Gerd Antes, Sigrid Behr, Harald Binder, Werner Brannath, Florian Dumpert, Katja Ickstadt, Hans Kestler, Johannes Lederer, Heinz Leitgöb, Markus Pauly, Ansgar Steland, Adalbert Wilhelm, Tim Friede

    Abstract: The research on and application of artificial intelligence (AI) has triggered a comprehensive scientific, economic, social and political discussion. Here we argue that statistics, as an interdisciplinary scientific field, plays a substantial role both for the theoretical and practical understanding of AI and for its future development. Statistics might even be considered a core element of AI. With… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

  27. arXiv:2009.06202  [pdf, ps, other

    cs.LG cs.AI cs.NE math.ST stat.ML

    Risk Bounds for Robust Deep Learning

    Authors: Johannes Lederer

    Abstract: It has been observed that certain loss functions can render deep-learning pipelines robust against flaws in the data. In this paper, we support these empirical findings with statistical theory. We especially show that empirical-risk minimization with unbounded, Lipschitz-continuous loss functions, such as the least-absolute deviation loss, Huber loss, Cauchy loss, and Tukey's biweight loss, can pr… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  28. arXiv:2006.15604  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Layer Sparsity in Neural Networks

    Authors: Mohamed Hebiri, Johannes Lederer

    Abstract: Sparsity has become popular in machine learning, because it can save computational resources, facilitate interpretations, and prevent overfitting. In this paper, we discuss sparsity in the framework of neural networks. In particular, we formulate a new notion of sparsity that concerns the networks' layers and, therefore, aligns particularly well with the current trend toward deep networks. We call… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

  29. arXiv:2006.12296  [pdf, ps, other

    econ.EM

    A Pipeline for Variable Selection and False Discovery Rate Control With an Application in Labor Economics

    Authors: Sophie-Charlotte Klose, Johannes Lederer

    Abstract: We introduce tools for controlled variable selection to economists. In particular, we apply a recently introduced aggregation scheme for false discovery rate (FDR) control to German administrative data to determine the parts of the individual employment histories that are relevant for the career outcomes of women. Our results suggest that career outcomes can be predicted based on a small set of va… ▽ More

    Submitted 23 June, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

  30. arXiv:2006.03589  [pdf, other

    cs.LG cs.AI stat.ML

    Higher-Order Explanations of Graph Neural Networks via Relevant Walks

    Authors: Thomas Schnake, Oliver Eberle, Jonas Lederer, Shinichi Nakajima, Kristof T. Schütt, Klaus-Robert Müller, Grégoire Montavon

    Abstract: Graph Neural Networks (GNNs) are a popular approach for predicting graph structured data. As GNNs tightly entangle the input graph into the neural network structure, common explainable AI approaches are not applicable. To a large extent, GNNs have remained black-boxes for the user so far. In this paper, we show that GNNs can in fact be naturally explained using higher-order expansions, i.e. by ide… ▽ More

    Submitted 26 November, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: 14 pages + 6 pages supplement

  31. arXiv:2006.00294  [pdf, ps, other

    cs.LG cs.NE math.ST stat.ME stat.ML

    Statistical Guarantees for Regularized Neural Networks

    Authors: Mahsa Taheri, Fang Xie, Johannes Lederer

    Abstract: Neural networks have become standard tools in the analysis of data, but they lack comprehensive mathematical theories. For example, there are very few statistical guarantees for learning neural networks from data, especially for classes of estimators that are used in practice or at least similar to such. In this paper, we develop a general statistical guarantee for estimators that consist of a lea… ▽ More

    Submitted 11 November, 2020; v1 submitted 30 May, 2020; originally announced June 2020.

  32. arXiv:2005.00466  [pdf, other

    stat.ML cs.LG stat.ME

    Thresholded Adaptive Validation: Tuning the Graphical Lasso for Graph Recovery

    Authors: Mike Laszkiewicz, Asja Fischer, Johannes Lederer

    Abstract: Many Machine Learning algorithms are formulated as regularized optimization problems, but their performance hinges on a regularization parameter that needs to be calibrated to each application at hand. In this paper, we propose a general calibration scheme for regularized optimization problems and apply it to the graphical lasso, which is a method for Gaussian graphical modeling. The scheme is equ… ▽ More

    Submitted 30 March, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: To appear in the proceedings of Artificial Intelligence and Statistics (AISTATS) 2021

  33. arXiv:2004.11554  [pdf, other

    stat.ME

    Estimating the Lasso's Effective Noise

    Authors: Johannes Lederer, Michael Vogt

    Abstract: Much of the theory for the lasso in the linear model $Y = X β^* + \varepsilon$ hinges on the quantity $2 \| X^\top \varepsilon \|_{\infty} / n$, which we call the lasso's effective noise. Among other things, the effective noise plays an important role in finite-sample bounds for the lasso, the calibration of the lasso's tuning parameter, and inference on the parameter vector $β^*$. In this paper,… ▽ More

    Submitted 21 January, 2022; v1 submitted 24 April, 2020; originally announced April 2020.

    MSC Class: 62J07; 62F03; 62F40

    Journal ref: Journal of Machine Learning Research 2021, Vol. 22, 1-32

  34. arXiv:2002.11916  [pdf, other

    stat.ME stat.AP stat.CO stat.ML

    Tuning-free ridge estimators for high-dimensional generalized linear models

    Authors: Shih-Ting Huang, Fang Xie, Johannes Lederer

    Abstract: Ridge estimators regularize the squared Euclidean lengths of parameters. Such estimators are mathematically and computationally attractive but involve tuning parameters that can be difficult to calibrate. In this paper, we show that ridge estimators can be modified such that tuning parameters can be avoided altogether. We also show that these modified versions can improve on the empirical predicti… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  35. arXiv:1909.10635  [pdf, other

    stat.AP stat.CO stat.ME stat.ML

    Tuning parameter calibration for prediction in personalized medicine

    Authors: Shih-Ting Huang, Yannick Düren, Kristoffer H. Hellton, Johannes Lederer

    Abstract: Personalized medicine has become an important part of medicine, for instance predicting individual drug responses based on genomic information. However, many current statistical methods are not tailored to this task, because they overlook the individual heterogeneity of patients. In this paper, we look at personalized medicine from a linear regression standpoint. We introduce an alternative versio… ▽ More

    Submitted 2 October, 2019; v1 submitted 23 September, 2019; originally announced September 2019.

  36. arXiv:1907.03808  [pdf, other

    stat.ME q-bio.QM stat.AP

    False Discovery Rates in Biological Networks

    Authors: Lu Yu, Tobias Kaufmann, Johannes Lederer

    Abstract: The increasing availability of data has generated unprecedented prospects for network analyses in many biological fields, such as neuroscience (e.g., brain networks), genomics (e.g., gene-gene interaction networks), and ecology (e.g., species interaction networks). A powerful statistical framework for estimating such networks is Gaussian graphical models, but standard estimators for the correspond… ▽ More

    Submitted 4 February, 2021; v1 submitted 8 July, 2019; originally announced July 2019.

  37. arXiv:1907.03807  [pdf, other

    stat.ME q-bio.QM stat.AP

    Aggregating Knockoffs for False Discovery Rate Control with an Application to Gut Microbiome Data

    Authors: Fang Xie, Johannes Lederer

    Abstract: Recent discoveries suggest that our gut microbiome plays an important role in our health and wellbeing. However, the gut microbiome data are intricate; for example, the microbial diversity in the gut makes the data high-dimensional. While there are dedicated high-dimensional methods, such as the lasso estimator, they always come with the risk of false discoveries. Knockoffs are a recent approach t… ▽ More

    Submitted 1 March, 2021; v1 submitted 8 July, 2019; originally announced July 2019.

    Journal ref: Entropy 23(2021) 230

  38. arXiv:1812.07691  [pdf, other

    stat.AP stat.ME

    Efficiency in Lung Transplant Allocation Strategies

    Authors: **g**g Zou, David J. Lederer, Daniel Rabinowitz

    Abstract: Currently in the United States, lung transplantations are allocated to candidates according to the candidates' Lung Allocation Score (LAS). The LAS is an ad-hoc ranking system for patients' priorities of transplantation. The goal of this study is to develop a framework for improving patients' life expectancy over the LAS based on a comprehensive modeling of the lung transplantation waiting list. P… ▽ More

    Submitted 16 April, 2020; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: 36 pages of main text, 10 figures

  39. arXiv:1801.01394  [pdf, ps, other

    math.ST

    Prediction Error Bounds for Linear Regression With the TREX

    Authors: Jacob Bien, Irina Gaynanova, Johannes Lederer, Christian Müller

    Abstract: The TREX is a recently introduced approach to sparse linear regression. In contrast to most well-known approaches to penalized regression, the TREX can be formulated without the use of tuning parameters. In this paper, we establish the first known prediction error bounds for the TREX. Additionally, we introduce extensions of the TREX to a more general class of penalties, and we provide a bound on… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

  40. arXiv:1710.02950  [pdf, ps, other

    stat.ML math.ST

    Maximum Regularized Likelihood Estimators: A General Prediction Theory and Applications

    Authors: Rui Zhuang, Johannes Lederer

    Abstract: Maximum regularized likelihood estimators (MRLEs) are arguably the most established class of estimators in high-dimensional statistics. In this paper, we derive guarantees for MRLEs in Kullback-Leibler divergence, a general measure of prediction accuracy. We assume only that the densities have a convex parametrization and that the regularization is definite and positive homogenous. The results thu… ▽ More

    Submitted 17 October, 2018; v1 submitted 9 October, 2017; originally announced October 2017.

  41. arXiv:1708.05499  [pdf, ps, other

    math.ST

    Inference for high-dimensional instrumental variables regression

    Authors: David Gold, Johannes Lederer, **g Tao

    Abstract: This paper concerns statistical inference for the components of a high-dimensional regression parameter despite possible endogeneity of each regressor. Given a first-stage linear model for the endogenous regressors and a second-stage linear model for the dependent variable, we develop a novel adaptation of the parametric one-step update to a generic second-stage estimator. We provide conditions un… ▽ More

    Submitted 21 November, 2019; v1 submitted 17 August, 2017; originally announced August 2017.

    Comments: 53 pages

  42. arXiv:1704.02739  [pdf, other

    stat.ML stat.AP stat.ME

    Integrating Additional Knowledge Into Estimation of Graphical Models

    Authors: Yunqi Bu, Johannes Lederer

    Abstract: In applications of graphical models, we typically have more information than just the samples themselves. A prime example is the estimation of brain connectivity networks based on fMRI data, where in addition to the samples themselves, the spatial positions of the measurements are readily available. With particular regard for this application, we are thus interested in ways to incorporate addition… ▽ More

    Submitted 20 April, 2017; v1 submitted 10 April, 2017; originally announced April 2017.

    Comments: 16 pages, 4 figures, 1 table

  43. arXiv:1610.00207  [pdf, other

    stat.ME math.ST stat.ML

    Tuning parameter calibration for $\ell_1$-regularized logistic regression

    Authors: Wei Li, Johannes Lederer

    Abstract: Feature selection is a standard approach to understanding and modeling high-dimensional classification data, but the corresponding statistical methods hinge on tuning parameters that are difficult to calibrate. In particular, existing calibration schemes in the logistic regression framework lack any finite sample guarantees. In this paper, we introduce a novel calibration scheme for $\ell_1$-penal… ▽ More

    Submitted 28 February, 2019; v1 submitted 1 October, 2016; originally announced October 2016.

  44. arXiv:1609.07195  [pdf, ps, other

    stat.ME stat.CO stat.ML

    Balancing Statistical and Computational Precision: A General Theory and Applications to Sparse Regression

    Authors: Mahsa Taheri, Néhémy Lim, Johannes Lederer

    Abstract: Modern technologies are generating ever-increasing amounts of data. Making use of these data requires methods that are both statistically sound and computationally efficient. Typically, the statistical and computational aspects are treated separately. In this paper, we propose an approach to entangle these two aspects in the context of regularized estimation. Applying our approach to sparse and gr… ▽ More

    Submitted 14 September, 2022; v1 submitted 22 September, 2016; originally announced September 2016.

  45. arXiv:1609.05551  [pdf, other

    math.ST stat.OT

    Graphical Models for Discrete and Continuous Data

    Authors: Rui Zhuang, Noah Simon, Johannes Lederer

    Abstract: We introduce a general framework for undirected graphical models. It generalizes Gaussian graphical models to a wide range of continuous, discrete, and combinations of different types of data. The models in the framework, called exponential trace models, are amenable to estimation based on maximum likelihood. We introduce a sampling-based approximation algorithm for computing the maximum likelihoo… ▽ More

    Submitted 15 June, 2019; v1 submitted 18 September, 2016; originally announced September 2016.

  46. arXiv:1608.00624  [pdf, ps, other

    math.ST stat.ML

    Oracle Inequalities for High-dimensional Prediction

    Authors: Johannes Lederer, Lu Yu, Irina Gaynanova

    Abstract: The abundance of high-dimensional data in the modern sciences has generated tremendous interest in penalized estimators such as the lasso, scaled lasso, square-root lasso, elastic net, and many others. In this paper, we establish a general oracle inequality for prediction in high-dimensional linear regression with such methods. Since the proof relies only on convexity and continuity arguments, the… ▽ More

    Submitted 13 March, 2018; v1 submitted 1 August, 2016; originally announced August 2016.

  47. arXiv:1604.06815  [pdf, other

    stat.ML cs.OH stat.CO stat.ME

    Non-convex Global Minimization and False Discovery Rate Control for the TREX

    Authors: Jacob Bien, Irina Gaynanova, Johannes Lederer, Christian Müller

    Abstract: The TREX is a recently introduced method for performing sparse high-dimensional regression. Despite its statistical promise as an alternative to the lasso, square-root lasso, and scaled lasso, the TREX is computationally challenging in that it requires solving a non-convex optimization problem. This paper shows a remarkable result: despite the non-convexity of the TREX problem, there exists a poly… ▽ More

    Submitted 20 September, 2016; v1 submitted 22 April, 2016; originally announced April 2016.

    Journal ref: Journal of Computational and Graphical Statistics 2017, Vol. 27, No. 1, 23-33

  48. arXiv:1410.7279  [pdf, other

    stat.ML stat.ME

    Topology Adaptive Graph Estimation in High Dimensions

    Authors: Johannes Lederer, Christian Müller

    Abstract: We introduce Graphical TREX (GTREX), a novel method for graph estimation in high-dimensional Gaussian graphical models. By conducting neighborhood selection with TREX, GTREX avoids tuning parameters and is adaptive to the graph topology. We compare GTREX with standard methods on a new simulation set-up that is designed to assess accurately the strengths and shortcomings of different methods. These… ▽ More

    Submitted 27 October, 2014; originally announced October 2014.

  49. arXiv:1410.5014  [pdf, other

    stat.ME math.ST

    Optimal Two-Step Prediction in Regression

    Authors: Didier Chételat, Johannes Lederer, Joseph Salmon

    Abstract: High-dimensional prediction typically comprises two steps: variable selection and subsequent least-squares refitting on the selected variables. However, the standard variable selection procedures, such as the lasso, hinge on tuning parameters that need to be calibrated. Cross-validation, the most popular calibration scheme, is computationally costly and lacks finite sample guarantees. In this pape… ▽ More

    Submitted 5 June, 2017; v1 submitted 18 October, 2014; originally announced October 2014.

  50. arXiv:1410.0247  [pdf, ps, other

    stat.ME math.ST

    A Practical Scheme and Fast Algorithm to Tune the Lasso With Optimality Guarantees

    Authors: Michaël Chichignoud, Johannes Lederer, Martin Wainwright

    Abstract: We introduce a novel scheme for choosing the regularization parameter in high-dimensional linear regression with Lasso. This scheme, inspired by Lepski's method for bandwidth selection in non-parametric regression, is equipped with both optimal finite-sample guarantees and a fast algorithm. In particular, for any design matrix such that the Lasso has low sup-norm error under an "oracle choice" of… ▽ More

    Submitted 8 November, 2016; v1 submitted 1 October, 2014; originally announced October 2014.