Skip to main content

Showing 1–42 of 42 results for author: Mueller, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.16629  [pdf

    cs.IR stat.AP

    Meta-experiments: Improving experimentation through experimentation

    Authors: Melanie J. I. Müller

    Abstract: A/B testing is widexly used in the industry to optimize customer facing websites. Many companies employ experimentation specialists to facilitate and improve the process of A/B testing. Here, we present the application of A/B testing to this improvement effort itself, by running experiments on the experimentation process, which we call 'meta-experiments'. We discuss the challenges of this approach… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures, 1 table

  2. arXiv:2403.19448  [pdf, other

    math.OC cs.LG eess.SY math.NA stat.ML

    Fisher-Rao Gradient Flows of Linear Programs and State-Action Natural Policy Gradients

    Authors: Johannes Müller, Semih Çaycı, Guido Montúfar

    Abstract: Kakade's natural policy gradient method has been studied extensively in the last years showing linear convergence with and without regularization. We study another natural gradient method which is based on the Fisher information matrix of the state-action distributions and has received little attention from the theoretical side. Here, the state-action distributions follow the Fisher-Rao gradient f… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 27 pages, 4 figures, under review

    MSC Class: 65K05; 90C05; 90C08; 90C40; 90C53

  3. arXiv:2312.03654  [pdf, other

    cs.CE cs.AI cs.LG cs.NE stat.ML

    Efficient Inverse Design Optimization through Multi-fidelity Simulations, Machine Learning, and Search Space Reduction Strategies

    Authors: Luka Grbcic, Juliane Müller, Wibe Albert de Jong

    Abstract: This paper introduces a methodology designed to augment the inverse design optimization process in scenarios constrained by limited compute, through the strategic synergy of multi-fidelity evaluations, machine learning models, and optimization algorithms. The proposed methodology is analyzed on two distinct engineering inverse design problems: airfoil inverse design and the scalar field reconstruc… ▽ More

    Submitted 3 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

  4. arXiv:2311.00553  [pdf, other

    stat.ME stat.CO stat.ML

    Polynomial Chaos Surrogate Construction for Random Fields with Parametric Uncertainty

    Authors: Joy N. Mueller, Khachik Sargsyan, Craig J. Daniels, Habib N. Najm

    Abstract: Engineering and applied science rely on computational experiments to rigorously study physical systems. The mathematical models used to probe these systems are highly complex, and sampling-intensive studies often require prohibitively many simulations for acceptable accuracy. Surrogate models provide a means of circumventing the high computational expense of sampling such complex models. In partic… ▽ More

    Submitted 17 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    MSC Class: 60G99; 65C20; 33C45; 62G07; 62J02

  5. arXiv:2307.08370  [pdf, other

    math.DS q-bio.PE stat.ME

    Parameter estimation for contact tracing in graph-based models

    Authors: Augustine Okolie, Johannes Müller, Mirjam Kretzschmar

    Abstract: We adopt a maximum-likelihood framework to estimate parameters of a stochastic susceptible-infected-recovered (SIR) model with contact tracing on a rooted random tree. Given the number of detectees per index case, our estimator allows to determine the degree distribution of the random tree as well as the tracing probability. Since we do not discover all infectees via contact tracing, this estimati… ▽ More

    Submitted 22 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 24 pages, 8 figures, 3 tables

    MSC Class: 92D30

    Journal ref: Royal Society Interface 2023

  6. arXiv:2306.13520  [pdf, other

    cs.LG stat.ML

    On the Convergence Rate of Gaussianization with Random Rotations

    Authors: Felix Draxler, Lars Kühmichel, Armand Rousselot, Jens Müller, Christoph Schnörr, Ullrich Köthe

    Abstract: Gaussianization is a simple generative model that can be trained without backpropagation. It has shown compelling performance on low dimensional data. As the dimension increases, however, it has been observed that the convergence speed slows down. We show analytically that the number of required layers scales linearly with the dimension for Gaussian input. We argue that this is because the model i… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  7. arXiv:2305.16583  [pdf, other

    stat.ML cs.LG

    Detecting Errors in a Numerical Response via any Regression Model

    Authors: Hang Zhou, Jonas Mueller, Mayank Kumar, Jane-Ling Wang, **g Lei

    Abstract: Noise plagues many numerical datasets, where the recorded values in the data may fail to match the true underlying values due to reasons including: erroneous sensors, data entry/processing mistakes, or imperfect human estimates. We consider general regression settings with covariates and a potentially corrupted response whose observed values may contain errors. By accounting for various uncertaint… ▽ More

    Submitted 12 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  8. arXiv:2303.18022  [pdf, other

    cs.CV stat.ML

    The Topology-Overlap Trade-Off in Retinal Arteriole-Venule Segmentation

    Authors: Angel Victor Juanco Muller, Joao F. C. Mota, Keith A. Goatman, Corne Hoogendoorn

    Abstract: Retinal fundus images can be an invaluable diagnosis tool for screening epidemic diseases like hypertension or diabetes. And they become especially useful when the arterioles and venules they depict are clearly identified and annotated. However, manual annotation of these vessels is extremely time demanding and taxing, which calls for automatic segmentation. Although convolutional neural networks… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: To be published in proceedings of SPIE Medical Imaging 2023 Image Processing

  9. arXiv:2303.09989  [pdf, other

    cs.LG stat.ML

    Finding Competence Regions in Domain Generalization

    Authors: Jens Müller, Stefan T. Radev, Robert Schmier, Felix Draxler, Carsten Rother, Ullrich Köthe

    Abstract: We investigate a "learning to reject" framework to address the problem of silent failures in Domain Generalization (DG), where the test distribution differs from the training distribution. Assuming a mild distribution shift, we wish to accept out-of-distribution (OOD) data from a new domain whenever a model's estimated competence foresees trustworthy responses, instead of rejecting OOD data outrig… ▽ More

    Submitted 21 June, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: The paper has been published at TMLR (see https://openreview.net/forum?id=TSy0vuwQFN)

    Journal ref: Transactions on Machine Learning Research (06/2023)

  10. arXiv:2301.11856  [pdf, other

    cs.LG stat.ML

    ActiveLab: Active Learning with Re-Labeling by Multiple Annotators

    Authors: Hui Wen Goh, Jonas Mueller

    Abstract: In real-world data labeling applications, annotators often provide imperfect labels. It is thus common to employ multiple annotators to label data with some overlap between their examples. We study active learning in such settings, aiming to train an accurate classifier by collecting a dataset with the fewest total annotations. Here we propose ActiveLab, a practical method to decide what to label… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  11. arXiv:2210.06812  [pdf, other

    cs.LG cs.HC stat.ML

    CROWDLAB: Supervised learning to infer consensus labels and quality scores for data with multiple annotators

    Authors: Hui Wen Goh, Ulyana Tkachenko, Jonas Mueller

    Abstract: Real-world data for classification is often labeled by multiple annotators. For analyzing such data, we introduce CROWDLAB, a straightforward approach to utilize any trained classifier to estimate: (1) A consensus label for each example that aggregates the available annotations; (2) A confidence score for how likely each consensus label is correct; (3) A rating for each annotator quantifying the o… ▽ More

    Submitted 27 January, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Journal ref: NeurIPS 2022 Human in the Loop Learning Workshop

  12. arXiv:2207.03061  [pdf, other

    cs.LG cs.CV stat.ML

    Back to the Basics: Revisiting Out-of-Distribution Detection Baselines

    Authors: Johnson Kuan, Jonas Mueller

    Abstract: We study simple methods for out-of-distribution (OOD) image detection that are compatible with any already trained classifier, relying on only its predictions or learned representations. Evaluating the OOD detection performance of various methods when utilized with ResNet-50 and Swin Transformer models, we find methods that solely consider the model's predictions can be easily outperformed by also… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: ICML Workshop on Principles of Distribution Shift 2022

  13. arXiv:2207.01279  [pdf, other

    stat.ME

    Joint lifetime modelling with matrix distributions

    Authors: Albrecher Hansjörg, Bladt Martin, Alaric J. A Müller

    Abstract: Acyclic phase-type (PH) distributions have been a popular tool in survival analysis, thanks to their natural interpretation in terms of ageing towards its inevitable absorption. In this paper, we consider an extension to the bivariate setting for the modelling of joint lifetimes. In contrast to previous models in the literature that were based on a separate estimation of the marginal behavior and… ▽ More

    Submitted 3 October, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

  14. arXiv:2206.07449  [pdf, other

    eess.SP cs.RO eess.SY math.PR stat.AP

    Self-Assessment for Single-Object Tracking in Clutter Using Subjective Logic

    Authors: Thomas Griebel, Johannes Müller, Paul Geisler, Charlotte Hermann, Martin Herrmann, Michael Buchholz, Klaus Dietmayer

    Abstract: Reliable tracking algorithms are essential for automated driving. However, the existing consistency measures are not sufficient to meet the increasing safety demands in the automotive sector. Therefore, this work presents a novel method for self-assessment of single-object tracking in clutter based on Kalman filtering and subjective logic. A key feature of the approach is that it additionally prov… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Accepted for presentation at the 2022 IEEE 25th International Conference on Information Fusion (FUSION), July 4 - 7, 2022, Linkö**, Sweden

  15. arXiv:2203.09438  [pdf, other

    cs.LG stat.ML

    An Explainable Stacked Ensemble Model for Static Route-Free Estimation of Time of Arrival

    Authors: Sören Schleibaum, Jörg P. Müller, Monika Sester

    Abstract: To compare alternative taxi schedules and to compute them, as well as to provide insights into an upcoming taxi trip to drivers and passengers, the duration of a trip or its Estimated Time of Arrival (ETA) is predicted. To reach a high prediction precision, machine learning models for ETA are state of the art. One yet unexploited option to further increase prediction precision is to combine multip… ▽ More

    Submitted 11 January, 2024; v1 submitted 17 March, 2022; originally announced March 2022.

  16. arXiv:2202.12441  [pdf, other

    cs.LG math.OC stat.AP

    Long-Term Missing Value Imputation for Time Series Data Using Deep Neural Networks

    Authors: Jangho Park, Juliane Muller, Bhavna Arora, Boris Faybishenko, Gilberto Pastorello, Charuleka Varadharajan, Reetik Sahu, Deborah Agarwal

    Abstract: We present an approach that uses a deep learning model, in particular, a MultiLayer Perceptron (MLP), for estimating the missing values of a variable in multivariate time series data. We focus on filling a long continuous gap (e.g., multiple months of missing daily observations) rather than on individual randomly missing observations. Our proposed gap filling algorithm uses an automated method for… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  17. arXiv:2111.02705  [pdf, other

    cs.LG cs.CL stat.ML

    Benchmarking Multimodal AutoML for Tabular Data with Text Fields

    Authors: Xingjian Shi, Jonas Mueller, Nick Erickson, Mu Li, Alexander J. Smola

    Abstract: We consider the use of automated supervised learning systems for data tables that not only contain numeric/categorical columns, but one or more text fields as well. Here we assemble 18 multimodal data tables that each contain some text fields and stem from a real business application. Our publicly-available benchmark enables researchers to comprehensively evaluate their own methods for supervised… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: Proceedings of the Neural Information Processing Systems (NeurIPS) Track on Datasets and Benchmarks 2021

  18. arXiv:2106.10414  [pdf, other

    stat.ML cs.LG

    Deep Learning for Functional Data Analysis with Adaptive Basis Layers

    Authors: Junwen Yao, Jonas Mueller, Jane-Ling Wang

    Abstract: Despite their widespread success, the application of deep neural networks to functional data remains scarce today. The infinite dimensionality of functional data means standard learning algorithms can be applied only after appropriate dimension reduction, typically achieved via basis expansions. Currently, these bases are chosen a priori without the information for the task at hand and thus may no… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  19. arXiv:2105.05334  [pdf, other

    q-bio.MN stat.CO

    Coupling from the Past for the Stochastic Simulation of Chemical Reaction Networks

    Authors: J. N. Mueller, J. N. Corcoran

    Abstract: Chemical reaction networks (CRNs) are fundamental computational models used to study the behavior of chemical reactions in well-mixed solutions. They have been used extensively to model a broad range of biological systems, and are primarily used when the more traditional model of deterministic continuous mass action kinetics is invalid due to small molecular counts. We present a perfect sampling a… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: 27 pages, 30 figures

    MSC Class: 60J27; 60J28; 60K30 ACM Class: I.6.3; I.6.8

  20. arXiv:2103.14749  [pdf, other

    stat.ML cs.AI cs.LG

    Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks

    Authors: Curtis G. Northcutt, Anish Athalye, Jonas Mueller

    Abstract: We identify label errors in the test sets of 10 of the most commonly-used computer vision, natural language, and audio datasets, and subsequently study the potential for these label errors to affect benchmark results. Errors in test sets are numerous and widespread: we estimate an average of at least 3.3% errors across the 10 datasets, where for example label errors comprise at least 6% of the Ima… ▽ More

    Submitted 7 November, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Demo available at https://labelerrors.com/ and source code available at https://github.com/cleanlab/label-errors

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  21. arXiv:2103.00083  [pdf, other

    stat.ML cs.LG

    Flexible Model Aggregation for Quantile Regression

    Authors: Rasool Fakoor, Taesup Kim, Jonas Mueller, Alexander J. Smola, Ryan J. Tibshirani

    Abstract: Quantile regression is a fundamental problem in statistical learning motivated by a need to quantify uncertainty in predictions, or to model a diverse population without being overly reductive. For instance, epidemiological forecasts, cost estimates, and revenue predictions all benefit from being able to quantify the range of possible values accurately. As such, many models have been developed for… ▽ More

    Submitted 15 April, 2023; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: Accepted at JMLR 2023

  22. arXiv:2102.09225  [pdf, other

    cs.LG stat.ML

    Continuous Doubly Constrained Batch Reinforcement Learning

    Authors: Rasool Fakoor, Jonas Mueller, Kavosh Asadi, Pratik Chaudhari, Alexander J. Smola

    Abstract: Reliant on too many experiments to learn good actions, current Reinforcement Learning (RL) algorithms have limited applicability in real-world settings, which can be too expensive to allow exploration. We propose an algorithm for batch RL, where effective policies are learned using only a fixed offline dataset instead of online interactions with the environment. The limited data in batch RL produc… ▽ More

    Submitted 6 December, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021 conference paper

  23. arXiv:2010.07167  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Robust Models Using The Principle of Independent Causal Mechanisms

    Authors: Jens Müller, Robert Schmier, Lynton Ardizzone, Carsten Rother, Ullrich Köthe

    Abstract: Standard supervised learning breaks down under data distribution shift. However, the principle of independent causal mechanisms (ICM, Peters et al. (2017)) can turn this weakness into an opportunity: one can take advantage of distribution shift between different environments during training in order to obtain more robust models. We propose a new gradient-based learning framework whose objective fu… ▽ More

    Submitted 8 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

  24. arXiv:2006.14284  [pdf, other

    cs.LG stat.ML

    Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation

    Authors: Rasool Fakoor, Jonas Mueller, Nick Erickson, Pratik Chaudhari, Alexander J. Smola

    Abstract: Automated machine learning (AutoML) can produce complex model ensembles by stacking, bagging, and boosting many individual models like trees, deep networks, and nearest neighbor estimators. While highly accurate, the resulting predictors are large, slow, and opaque as compared to their constituents. To improve the deployment of AutoML on tabular data, we propose FAST-DAD to distill arbitrarily com… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Journal ref: NeurIPS 2020

  25. arXiv:2004.02441  [pdf, other

    cs.LG stat.ML

    TraDE: Transformers for Density Estimation

    Authors: Rasool Fakoor, Pratik Chaudhari, Jonas Mueller, Alexander J. Smola

    Abstract: We present TraDE, a self-attention-based architecture for auto-regressive density estimation with continuous and discrete valued data. Our model is trained using a penalized maximum likelihood objective, which ensures that samples from the density estimate resemble the training data distribution. The use of self-attention means that the model need not retain conditional sufficient statistics durin… ▽ More

    Submitted 14 October, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

  26. arXiv:2003.08907  [pdf, other

    cs.LG cs.CV stat.ML

    Overinterpretation reveals image classification model pathologies

    Authors: Brandon Carter, Siddhartha Jain, Jonas Mueller, David Gifford

    Abstract: Image classifiers are typically scored on their test set accuracy, but high accuracy can mask a subtle type of model failure. We find that high scoring convolutional neural networks (CNNs) on popular benchmarks exhibit troubling pathologies that allow them to display high accuracy even in the absence of semantically salient features. When a model provides a high-confidence decision without salient… ▽ More

    Submitted 7 December, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: NeurIPS 2021

  27. arXiv:2003.06505  [pdf, other

    stat.ML cs.LG

    AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data

    Authors: Nick Erickson, Jonas Mueller, Alexander Shirkov, Hang Zhang, Pedro Larroy, Mu Li, Alexander Smola

    Abstract: We introduce AutoGluon-Tabular, an open-source AutoML framework that requires only a single line of Python to train highly accurate machine learning models on an unprocessed tabular dataset such as a CSV file. Unlike existing AutoML frameworks that primarily focus on model/hyperparameter selection, AutoGluon-Tabular succeeds by ensembling multiple models and stacking them in multiple layers. Exper… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

  28. arXiv:1911.13060  [pdf, other

    cs.LG eess.IV stat.ML

    Orthogonal Wasserstein GANs

    Authors: Jan Müller, Reinhard Klein, Michael Weinmann

    Abstract: Wasserstein-GANs have been introduced to address the deficiencies of generative adversarial networks (GANs) regarding the problems of vanishing gradients and mode collapse during the training, leading to improved convergence behaviour and improved image quality. However, Wasserstein-GANs require the discriminator to be Lipschitz continuous. In current state-of-the-art Wasserstein-GANs this constra… ▽ More

    Submitted 14 December, 2019; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: Correction of the formatting of the appendix

    MSC Class: I.2.6 ACM Class: I.2.6

  29. arXiv:1910.09599  [pdf, ps, other

    cs.LG cs.NE math.NA stat.ML

    On the space-time expressivity of ResNets

    Authors: Johannes Müller

    Abstract: Residual networks (ResNets) are a deep learning architecture that substantially improved the state of the art performance in certain supervised learning tasks. Since then, they have received continuously growing attention. ResNets have a recursive structure $x_{k+1} = x_k + R_k(x_k)$ where $R_k$ is a neural network called a residual block. This structure can be seen as the Euler discretisation of… ▽ More

    Submitted 27 February, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Extended abstract of master's thesis; presented at the ICLR 2020 Workshop on Integration of Deep Neural Models and Differential Equations; full version of the thesis available under https://freidok.uni-freiburg.de/data/151788

  30. arXiv:1909.04844  [pdf, other

    cs.LG cs.DB stat.ML

    Recognizing Variables from their Data via Deep Embeddings of Distributions

    Authors: Jonas Mueller, Alex Smola

    Abstract: A key obstacle in automated analytics and meta-learning is the inability to recognize when different datasets contain measurements of the same variable. Because provided attribute labels are often uninformative in practice, this task may be more robustly addressed by leveraging the data values themselves rather than just relying on their arbitrarily selected variable names. Here, we present a comp… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: IEEE International Conference on Data Mining (ICDM), 2019

  31. arXiv:1908.10947  [pdf, other

    stat.ML cs.LG math.OC

    Surrogate Optimization of Deep Neural Networks for Groundwater Predictions

    Authors: Juliane Mueller, Jangho Park, Reetik Sahu, Charuleka Varadharajan, Bhavna Arora, Boris Faybishenko, Deborah Agarwal

    Abstract: Sustainable management of groundwater resources under changing climatic conditions require an application of reliable and accurate predictions of groundwater levels. Mechanistic multi-scale, multi-physics simulation models are often too hard to use for this purpose, especially for groundwater managers who do not have access to the complex compute resources and data. Therefore, we analyzed the appl… ▽ More

    Submitted 3 February, 2020; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: submitted to Journal of Global Optimization; main paper: 25 pages, 19 figures, 1 table; online supplement: 11 pages, 18 figures, 3 tables

    Report number: LBNL-2001234

  32. arXiv:1906.07380  [pdf, other

    cs.LG stat.ML

    Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles

    Authors: Siddhartha Jain, Ge Liu, Jonas Mueller, David Gifford

    Abstract: The inaccuracy of neural network models on inputs that do not stem from the training data distribution is both problematic and at times unrecognized. Model uncertainty estimation can address this issue, where uncertainty estimates are often based on the variation in predictions produced by a diverse ensemble of models applied to the same input. Here we describe Maximize Overall Diversity (MOD), a… ▽ More

    Submitted 12 February, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: 10 pages, 3 figures

  33. arXiv:1905.12777  [pdf, other

    cs.LG cs.CL stat.ML

    Educating Text Autoencoders: Latent Representation Guidance via Denoising

    Authors: Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

    Abstract: Generative autoencoders offer a promising approach for controllable text generation by leveraging their latent sentence representations. However, current models struggle to maintain coherent latent spaces required to perform meaningful text manipulations via latent vector operations. Specifically, we demonstrate by example that neural encoders do not necessarily map similar sentences to nearby lat… ▽ More

    Submitted 7 July, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: ICML 2020 camera-ready

  34. arXiv:1811.00915  [pdf, ps, other

    cs.LG q-bio.NC stat.ML

    Convolutional Neural Networks for Epileptic Seizure Prediction

    Authors: Matthias Eberlein, Raphael Hildebrand, Ronald Tetzlaff, Nico Hoffmann, Levin Kuhlmann, Benjamin Brinkmann, Jens Müller

    Abstract: Epilepsy is the most common neurological disorder and an accurate forecast of seizures would help to overcome the patient's uncertainty and helplessness. In this contribution, we present and discuss a novel methodology for the classification of intracranial electroencephalography (iEEG) for seizure prediction. Contrary to previous approaches, we categorically refrain from an extraction of hand-cra… ▽ More

    Submitted 11 April, 2023; v1 submitted 2 November, 2018; originally announced November 2018.

    Comments: accepted for MLESP 2018

    Journal ref: 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

  35. arXiv:1810.03805  [pdf, other

    cs.LG stat.ML

    What made you do this? Understanding black-box decisions with sufficient input subsets

    Authors: Brandon Carter, Jonas Mueller, Siddhartha Jain, David Gifford

    Abstract: Local explanation frameworks aim to rationalize particular decisions made by a black-box prediction model. Existing techniques are often restricted to a specific type of predictor or based on input saliency, which may be undesirably sensitive to factors unrelated to the model's decision making process. We instead propose sufficient input subsets that identify minimal subsets of features whose obse… ▽ More

    Submitted 8 February, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: Published in AISTATS 2019; Equal contribution by first two authors

  36. arXiv:1809.10784  [pdf, other

    stat.ML cs.LG

    Adaptive Gaussian process surrogates for Bayesian inference

    Authors: Timur Takhtaganov, Juliane Müller

    Abstract: We present an adaptive approach to the construction of Gaussian process surrogates for Bayesian inference with expensive-to-evaluate forward models. Our method relies on the fully Bayesian approach to training Gaussian process models and utilizes the expected improvement idea from Bayesian global optimization. We adaptively construct training designs by maximizing the expected improvement in fit o… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

    Comments: 38 pages, submitted to the SIAM/ASA Journal on Uncertainty Quantification

    MSC Class: 62F15; 60G15; 62G08; 62K20; 62K86

  37. arXiv:1806.00050  [pdf, other

    cs.LG cs.AI stat.ML

    Interpretable Set Functions

    Authors: Andrew Cotter, Maya Gupta, Heinrich Jiang, James Muller, Taman Narayan, Serena Wang, Tao Zhu

    Abstract: We propose learning flexible but interpretable functions that aggregate a variable-length set of permutation-invariant feature vectors to predict a label. We use a deep lattice network model so we can architect the model structure to enhance interpretability, and add monotonicity constraints between inputs-and-outputs. We then use the proposed set function to automate the engineering of dense, int… ▽ More

    Submitted 31 May, 2018; originally announced June 2018.

  38. arXiv:1801.10242  [pdf, other

    cs.LG stat.ML

    Low-Rank Bandit Methods for High-Dimensional Dynamic Pricing

    Authors: Jonas Mueller, Vasilis Syrgkanis, Matt Taddy

    Abstract: We consider dynamic pricing with many products under an evolving but low-dimensional demand model. Assuming the temporal variation in cross-elasticities exhibits low-rank structure based on fixed (latent) features of the products, we show that the revenue maximization problem reduces to an online bandit convex optimization with side information given by the observed demands. We design dynamic pric… ▽ More

    Submitted 10 September, 2019; v1 submitted 30 January, 2018; originally announced January 2018.

    Comments: NeurIPS 2019

  39. arXiv:1606.05027  [pdf, other

    stat.ML cs.LG

    Learning Optimal Interventions

    Authors: Jonas Mueller, David N. Reshef, George Du, Tommi Jaakkola

    Abstract: Our goal is to identify beneficial interventions from observational data. We consider interventions that are narrowly focused (impacting few covariates) and may be tailored to each individual or globally enacted over a population. For applications where harmful intervention is drastically worse than proposing no change, we propose a conservative definition of the optimal intervention. Assuming the… ▽ More

    Submitted 22 February, 2017; v1 submitted 15 June, 2016; originally announced June 2016.

    Comments: AISTATS 2017

    Journal ref: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, PMLR 54:1039-1047, 2017

  40. arXiv:1511.04486  [pdf, other

    stat.ME q-bio.GN q-bio.QM

    Modeling Persistent Trends in Distributions

    Authors: Jonas Mueller, Tommi Jaakkola, David Gifford

    Abstract: We present a nonparametric framework to model a short sequence of probability distributions that vary both due to underlying effects of sequential progression and confounding noise. To distinguish between these two types of variation and estimate the sequential-progression effects, our approach leverages an assumption that these effects follow a persistent trend. This work is motivated by the rece… ▽ More

    Submitted 24 May, 2017; v1 submitted 13 November, 2015; originally announced November 2015.

    Comments: To appear in: Journal of the American Statistical Association

    Journal ref: Journal of the American Statistical Association, 113(523):1296-1310, 2018

  41. arXiv:1510.08956  [pdf, other

    stat.ML cs.LG stat.ME

    Principal Differences Analysis: Interpretable Characterization of Differences between Distributions

    Authors: Jonas Mueller, Tommi Jaakkola

    Abstract: We introduce principal differences analysis (PDA) for analyzing differences between high-dimensional distributions. The method operates by finding the projection that maximizes the Wasserstein divergence between the resulting univariate populations. Relying on the Cramer-Wold device, it requires no assumptions about the form of the underlying distributions, nor the nature of their inter-class diff… ▽ More

    Submitted 29 October, 2015; originally announced October 2015.

    Comments: Advances in Neural Information Processing Systems 28 (NIPS 2015)

    Journal ref: Advances in Neural Information Processing Systems 28: 1702-1710, 2015

  42. arXiv:1110.4531  [pdf, other

    stat.ML

    Regression for sets of polynomial equations

    Authors: Franz Johannes Király, Paul von Bünau, Jan Saputra Müller, Duncan Blythe, Frank Meinecke, Klaus-Robert Müller

    Abstract: We propose a method called ideal regression for approximating an arbitrary system of polynomial equations by a system of a particular type. Using techniques from approximate computational algebraic geometry, we show how we can solve ideal regression directly without resorting to numerical optimization. Ideal regression is useful whenever the solution to a learning problem can be described by a sys… ▽ More

    Submitted 25 March, 2013; v1 submitted 20 October, 2011; originally announced October 2011.

    Comments: arXiv admin note: substantial text overlap with arXiv:1108.1483

    Journal ref: Journal of Machine Learning Research Workshop and Conference Proceedings Vol.22: Proceedings on the Fifteenth International Conference on Artificial Intelligence and Statistics, 22:628-637. 2012