Skip to main content

Showing 1–15 of 15 results for author: Liu, J Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.07618  [pdf

    cs.CV cs.LG eess.IV

    Uncertainty Estimation and Out-of-Distribution Detection for Deep Learning-Based Image Reconstruction using the Local Lipschitz

    Authors: Danyal F. Bhutto, Bo Zhu, Jeremiah Z. Liu, Neha Koonjoo, Hongwei B. Li, Bruce R. Rosen, Matthew S. Rosen

    Abstract: Accurate image reconstruction is at the heart of diagnostics in medical imaging. Supervised deep learning-based approaches have been investigated for solving inverse problems including image reconstruction. However, these trained models encounter unseen data distributions that are widely shifted from training data during deployment. Therefore, it is essential to assess whether a given input falls… ▽ More

    Submitted 1 December, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

  2. arXiv:2302.06235  [pdf, other

    cs.LG cs.CV stat.ML

    A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models

    Authors: James Urquhart Allingham, Jie Ren, Michael W Dusenberry, Xiuye Gu, Yin Cui, Dustin Tran, Jeremiah Zhe Liu, Balaji Lakshminarayanan

    Abstract: Contrastively trained text-image models have the remarkable ability to perform zero-shot classification, that is, classifying previously unseen images into categories that the model has never been explicitly trained to identify. However, these zero-shot classifiers need prompt engineering to achieve high accuracy. Prompt engineering typically requires hand-crafting a set of prompts for individual… ▽ More

    Submitted 15 July, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted at ICML 2023. 23 pages, 10 tables, 3 figures

  3. arXiv:2302.05807  [pdf, other

    cs.LG stat.ML

    Pushing the Accuracy-Group Robustness Frontier with Introspective Self-play

    Authors: Jeremiah Zhe Liu, Krishnamurthy Dj Dvijotham, Jihyeon Lee, Quan Yuan, Martin Strobel, Balaji Lakshminarayanan, Deepak Ramachandran

    Abstract: Standard empirical risk minimization (ERM) training can produce deep neural network (DNN) models that are accurate on average but under-perform in under-represented population subgroups, especially when there are imbalanced group distributions in the long-tailed training data. Therefore, approaches that improve the accuracy-group robustness trade-off frontier of a DNN model (i.e. improving worst-g… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

    Comments: Accepted to ICLR 2023. Included additional contribution from Martin Strobel

  4. arXiv:2205.00403  [pdf, other

    cs.LG stat.ML

    A Simple Approach to Improve Single-Model Deep Uncertainty via Distance-Awareness

    Authors: Jeremiah Zhe Liu, Shreyas Padhy, Jie Ren, Zi Lin, Yeming Wen, Ghassen Jerfel, Zack Nado, Jasper Snoek, Dustin Tran, Balaji Lakshminarayanan

    Abstract: Accurate uncertainty quantification is a major challenge in deep learning, as neural networks can make overconfident errors and assign high confidence predictions to out-of-distribution (OOD) inputs. The most popular approaches to estimate predictive uncertainty in deep learning are methods that combine predictions from multiple neural networks, such as Bayesian neural networks (BNNs) and deep ens… ▽ More

    Submitted 30 December, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: text overlap with arXiv:2006.10108

  5. arXiv:2204.07293  [pdf, other

    stat.ML cs.LG

    Towards a Unified Framework for Uncertainty-aware Nonlinear Variable Selection with Theoretical Guarantees

    Authors: Wenying Deng, Beau Coker, Rajarshi Mukherjee, Jeremiah Zhe Liu, Brent A. Coull

    Abstract: We develop a simple and unified framework for nonlinear variable selection that incorporates uncertainty in the prediction function and is compatible with a wide range of machine learning models (e.g., tree ensembles, kernel methods, neural networks, etc). In particular, for a learned nonlinear model $f(\mathbf{x})$, we consider quantifying the importance of an input variable $\mathbf{x}^j$ using… ▽ More

    Submitted 27 May, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: 50 pages, 16 figures, 11 tables

  6. arXiv:2010.06610  [pdf, other

    cs.LG cs.CV stat.ML

    Training independent subnetworks for robust prediction

    Authors: Marton Havasi, Rodolphe Jenatton, Stanislav Fort, Jeremiah Zhe Liu, Jasper Snoek, Balaji Lakshminarayanan, Andrew M. Dai, Dustin Tran

    Abstract: Recent approaches to efficiently ensemble neural networks have shown that strong robustness and uncertainty performance can be achieved with a negligible gain in parameters over the original network. However, these methods still require multiple forward passes for prediction, leading to a significant computational cost. In this work, we show a surprising result: the benefits of using multiple pred… ▽ More

    Submitted 4 August, 2021; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: Updated to the ICLR camera ready version, added reference to Soflaei et al. 2020

  7. arXiv:2010.01791  [pdf, other

    cs.CL

    Pruning Redundant Map**s in Transformer Models via Spectral-Normalized Identity Prior

    Authors: Zi Lin, Jeremiah Zhe Liu, Zi Yang, Nan Hua, Dan Roth

    Abstract: Traditional (unstructured) pruning methods for a Transformer model focus on regularizing the individual weights by penalizing them toward zero. In this work, we explore spectral-normalized identity priors (SNIP), a structured pruning approach that penalizes an entire residual module in a Transformer model toward an identity map**. Our method identifies and discards unimportant non-linear map**… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020

  8. arXiv:2006.16829  [pdf, other

    cs.CV

    You Only Look Yourself: Unsupervised and Untrained Single Image Dehazing Neural Network

    Authors: Boyun Li, Yuanbiao Gou, Shuhang Gu, Jerry Zitao Liu, Joey Tianyi Zhou, Xi Peng

    Abstract: In this paper, we study two challenging and less-touched problems in single image dehazing, namely, how to make deep learning achieve image dehazing without training on the ground-truth clean image (unsupervised) and a image collection (untrained). An unsupervised neural network will avoid the intensive labor collection of hazy-clean image pairs, and an untrained model is a ``real'' single image d… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

  9. arXiv:2006.10108  [pdf, other

    cs.LG stat.ML

    Simple and Principled Uncertainty Estimation with Deterministic Deep Learning via Distance Awareness

    Authors: Jeremiah Zhe Liu, Zi Lin, Shreyas Padhy, Dustin Tran, Tania Bedrax-Weiss, Balaji Lakshminarayanan

    Abstract: Bayesian neural networks (BNN) and deep ensembles are principled approaches to estimate the predictive uncertainty of a deep learning model. However their practicality in real-time, industrial-scale applications are limited due to their heavy memory and inference cost. This motivates us to study principled approaches to high-quality uncertainty estimation that require only a single deep neural net… ▽ More

    Submitted 25 October, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

  10. arXiv:1912.01189  [pdf, other

    stat.ML cs.LG

    Variable Selection with Rigorous Uncertainty Quantification using Deep Bayesian Neural Networks: Posterior Concentration and Bernstein-von Mises Phenomenon

    Authors: Jeremiah Zhe Liu

    Abstract: This work develops rigorous theoretical basis for the fact that deep Bayesian neural network (BNN) is an effective tool for high-dimensional variable selection with rigorous uncertainty quantification. We develop new Bayesian non-parametric theorems to show that a properly configured deep BNN (1) learns the variable importance effectively in high dimensions, and its learning rate can sometimes "br… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

  11. arXiv:1911.04061  [pdf, other

    cs.LG stat.ML

    Accurate Uncertainty Estimation and Decomposition in Ensemble Learning

    Authors: Jeremiah Zhe Liu, John Paisley, Marianthi-Anna Kioumourtzoglou, Brent Coull

    Abstract: Ensemble learning is a standard approach to building machine learning systems that capture complex phenomena in real-world data. An important aspect of these systems is the complete and valid quantification of model uncertainty. We introduce a Bayesian nonparametric ensemble (BNE) approach that augments an existing ensemble model to account for different sources of model uncertainty. BNE augments… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

  12. arXiv:1904.09632  [pdf, ps, other

    math.ST cs.LG stat.ME

    Gaussian Process Regression and Classification under Mathematical Constraints with Learning Guarantees

    Authors: Jeremiah Zhe Liu

    Abstract: We introduce constrained Gaussian process (CGP), a Gaussian process model for random functions that allows easy placement of mathematical constrains (e.g., non-negativity, monotonicity, etc) on its sample functions. CGP comes with closed-form probability density function (PDF), and has the attractive feature that its posterior distributions for regression and classification are again CGPs with clo… ▽ More

    Submitted 21 April, 2019; originally announced April 2019.

  13. arXiv:1812.03350  [pdf, other

    cs.LG stat.ML

    Adaptive and Calibrated Ensemble Learning with Dependent Tail-free Process

    Authors: Jeremiah Zhe Liu, John Paisley, Marianthi-Anna Kioumourtzoglou, Brent A. Coull

    Abstract: Ensemble learning is a mainstay in modern data science practice. Conventional ensemble algorithms assigns to base models a set of deterministic, constant model weights that (1) do not fully account for variations in base model accuracy across subgroups, nor (2) provide uncertainty estimates for the ensemble prediction, which could result in mis-calibrated (i.e. precise but biased) predictions that… ▽ More

    Submitted 19 December, 2018; v1 submitted 8 December, 2018; originally announced December 2018.

    Comments: Work-in-progress manuscript appeared at Bayesian Nonparametrics Workshop, Neural Information Processing Systems 2018

  14. arXiv:1711.07601  [pdf, other

    cs.IR cs.LG cs.PF cs.SI

    Pixie: A System for Recommending 3+ Billion Items to 200+ Million Users in Real-Time

    Authors: Chantat Eksombatchai, Pranav **dal, Jerry Zitao Liu, Yuchen Liu, Rahul Sharma, Charles Sugnet, Mark Ulrich, Jure Leskovec

    Abstract: User experience in modern content discovery applications critically depends on high-quality personalized recommendations. However, building systems that provide such recommendations presents a major challenge due to a massive pool of items, a large number of users, and requirements for recommendations to be responsive to user actions and generated on demand in real-time. Here we present Pixie, a s… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

  15. Image reconstruction by domain transform manifold learning

    Authors: Bo Zhu, Jeremiah Z. Liu, Bruce R. Rosen, Matthew S. Rosen

    Abstract: Image reconstruction plays a critical role in the implementation of all contemporary imaging modalities across the physical and life sciences including optical, MRI, CT, PET, and radio astronomy. During an image acquisition, the sensor encodes an intermediate representation of an object in the sensor domain, which is subsequently reconstructed into an image by an inversion of the encoding function… ▽ More

    Submitted 28 April, 2017; originally announced April 2017.

    Comments: 18 pages, 4 figures