Skip to main content

Showing 1–15 of 15 results for author: Talbott, W

.
  1. arXiv:2402.00251  [pdf, other

    cs.LG cs.AI cs.CL

    Efficient Non-Parametric Uncertainty Quantification for Black-Box Large Language Models and Decision Planning

    Authors: Yao-Hung Hubert Tsai, Walter Talbott, Jian Zhang

    Abstract: Step-by-step decision planning with large language models (LLMs) is gaining attention in AI agent development. This paper focuses on decision planning with uncertainty estimation to address the hallucination problem in language models. Existing approaches are either white-box or computationally demanding, limiting use of black-box proprietary LLMs within budgets. The paper's first contribution is… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  2. arXiv:2310.17722  [pdf, other

    cs.LG cs.AI cs.CL

    Large Language Models as Generalizable Policies for Embodied Tasks

    Authors: Andrew Szot, Max Schwarzer, Harsh Agrawal, Bogdan Mazoure, Walter Talbott, Katherine Metcalf, Natalie Mackraz, Devon Hjelm, Alexander Toshev

    Abstract: We show that large language models (LLMs) can be adapted to be generalizable policies for embodied visual tasks. Our approach, called Large LAnguage model Reinforcement Learning Policy (LLaRP), adapts a pre-trained frozen LLM to take as input text instructions and visual egocentric observations and output actions directly in the environment. Using reinforcement learning, we train LLaRP to see and… ▽ More

    Submitted 16 April, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  3. arXiv:2306.07290  [pdf, other

    cs.LG cs.AI

    Value function estimation using conditional diffusion models for control

    Authors: Bogdan Mazoure, Walter Talbott, Miguel Angel Bautista, Devon Hjelm, Alexander Toshev, Josh Susskind

    Abstract: A fairly reliable trend in deep reinforcement learning is that the performance scales with the number of parameters, provided a complimentary scaling in amount of training data. As the appetite for large models increases, it is imperative to address, sooner than later, the potential problem of running out of high-quality demonstrations. In this case, instead of collecting only new data via costly… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  4. arXiv:2303.04248  [pdf, other

    cs.LG cs.CV

    TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation

    Authors: David Berthelot, Arnaud Autef, Jierui Lin, Dian Ang Yap, Shuangfei Zhai, Siyuan Hu, Daniel Zheng, Walter Talbott, Eric Gu

    Abstract: Denoising Diffusion models have demonstrated their proficiency for generative sampling. However, generating good samples often requires many iterations. Consequently, techniques such as binary time-distillation (BTD) have been proposed to reduce the number of network calls for a fixed architecture. In this paper, we introduce TRAnsitive Closure Time-distillation (TRACT), a new method that extends… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  5. arXiv:2207.13751  [pdf, other

    cs.CV cs.GR cs.LG

    GAUDI: A Neural Architect for Immersive 3D Scene Generation

    Authors: Miguel Angel Bautista, Pengsheng Guo, Samira Abnar, Walter Talbott, Alexander Toshev, Zhuoyuan Chen, Laurent Dinh, Shuangfei Zhai, Hanlin Goh, Daniel Ulbricht, Afshin Dehghan, Josh Susskind

    Abstract: We introduce GAUDI, a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera. We tackle this challenging problem with a scalable yet powerful approach, where we first optimize a latent representation that disentangles radiance fields and camera poses. This latent representation is then used to learn a generati… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: Project webpage: https://github.com/apple/ml-gaudi

  6. arXiv:2207.07611  [pdf, other

    cs.LG cs.CV cs.SD eess.AS

    Position Prediction as an Effective Pretraining Strategy

    Authors: Shuangfei Zhai, Navdeep Jaitly, Jason Ramapuram, Dan Busbridge, Tatiana Likhomanenko, Joseph Yitan Cheng, Walter Talbott, Chen Huang, Hanlin Goh, Joshua Susskind

    Abstract: Transformers have gained increasing popularity in a wide range of applications, including Natural Language Processing (NLP), Computer Vision and Speech Recognition, because of their powerful representational capacity. However, harnessing this representational capacity effectively requires a large amount of data, strong regularization, or both, to mitigate overfitting. Recently, the power of the Tr… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted to ICML 2022

  7. arXiv:2207.01844  [pdf, other

    cs.LG cs.CV

    Efficient Representation Learning via Adaptive Context Pooling

    Authors: Chen Huang, Walter Talbott, Navdeep Jaitly, Josh Susskind

    Abstract: Self-attention mechanisms model long-range context by using pairwise attention between all input tokens. In doing so, they assume a fixed attention granularity defined by the individual tokens (e.g., text characters or image pixels), which may not be optimal for modeling complex dependencies at higher levels. In this paper, we propose ContextPool to address this problem by adapting the attention g… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: ICML 2022

  8. arXiv:2201.12300  [pdf, other

    cs.LG stat.ML

    Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation

    Authors: Martin Bertran, Walter Talbott, Nitish Srivastava, Joshua Susskind

    Abstract: Learning generalizeable policies from visual input in the presence of visual distractions is a challenging problem in reinforcement learning. Recently, there has been renewed interest in bisimulation metrics as a tool to address this issue; these metrics can be used to learn representations that are, in principle, invariant to irrelevant distractions by measuring behavioural similarity between sta… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  9. arXiv:2112.01163  [pdf, other

    cs.LG cs.AI cs.RO

    Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models

    Authors: Nitish Srivastava, Walter Talbott, Martin Bertran Lopez, Shuangfei Zhai, Josh Susskind

    Abstract: Modeling the world can benefit robot learning by providing a rich training signal for sha** an agent's latent state space. However, learning world models in unconstrained environments over high-dimensional observation spaces such as images is challenging. One source of difficulty is the presence of irrelevant but hard-to-model background distractions, and unimportant visual details of task-relev… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: NeurIPS Deep Reinforcement Learning Workshop 2021. Code can be found at https://github.com/apple/ml-core

  10. arXiv:2109.08249  [pdf, other

    cs.CL

    Regularized Training of Nearest Neighbor Language Models

    Authors: Jean-Francois Ton, Walter Talbott, Shuangfei Zhai, Josh Susskind

    Abstract: Including memory banks in a natural language processing architecture increases model capacity by equip** it with additional data at inference time. In this paper, we build upon $k$NN-LM \citep{khandelwal20generalization}, which uses a pre-trained language model together with an exhaustive $k$NN search through the training data (memory bank) to achieve state-of-the-art results. We investigate whe… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  11. arXiv:2105.14103  [pdf, other

    cs.LG cs.CL cs.CV

    An Attention Free Transformer

    Authors: Shuangfei Zhai, Walter Talbott, Nitish Srivastava, Chen Huang, Hanlin Goh, Ruixiang Zhang, Josh Susskind

    Abstract: We introduce Attention Free Transformer (AFT), an efficient variant of Transformers that eliminates the need for dot product self attention. In an AFT layer, the key and value are first combined with a set of learned position biases, the result of which is multiplied with the query in an element-wise fashion. This new operation has a memory complexity linear w.r.t. both the context size and the di… ▽ More

    Submitted 21 September, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

  12. arXiv:2006.15427  [pdf, other

    cs.CV

    On the generalization of learning-based 3D reconstruction

    Authors: Miguel Angel Bautista, Walter Talbott, Shuangfei Zhai, Nitish Srivastava, Joshua M Susskind

    Abstract: State-of-the-art learning-based monocular 3D reconstruction methods learn priors over object categories on the training set, and as a result struggle to achieve reasonable generalization to object categories unseen during training. In this paper we study the inductive biases encoded in the model architecture that impact the generalization of learning-based 3D reconstruction methods. We find that 3… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

  13. arXiv:2006.10705  [pdf, other

    cs.LG cs.CV stat.ML

    Set Distribution Networks: a Generative Model for Sets of Images

    Authors: Shuangfei Zhai, Walter Talbott, Miguel Angel Bautista, Carlos Guestrin, Josh M. Susskind

    Abstract: Images with shared characteristics naturally form sets. For example, in a face verification benchmark, images of the same identity form sets. For generative models, the standard way of dealing with sets is to represent each as a one hot vector, and learn a conditional generative model $p(\mathbf{x}|\mathbf{y})$. This representation assumes that the number of sets is limited and known, such that th… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

  14. arXiv:1910.13101  [pdf, other

    cs.LG stat.ML

    Adversarial Fisher Vectors for Unsupervised Representation Learning

    Authors: Shuangfei Zhai, Walter Talbott, Carlos Guestrin, Joshua M. Susskind

    Abstract: We examine Generative Adversarial Networks (GANs) through the lens of deep Energy Based Models (EBMs), with the goal of exploiting the density model that follows from this formulation. In contrast to a traditional view where the discriminator learns a constant function when reaching convergence, here we show that it can provide useful information for downstream tasks, e.g., feature extraction for… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted as spotlight presentation to NeurIPS 2019

  15. arXiv:1905.05895  [pdf, other

    cs.LG cs.CV stat.ML

    Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment

    Authors: Chen Huang, Shuangfei Zhai, Walter Talbott, Miguel Angel Bautista, Shih-Yu Sun, Carlos Guestrin, Josh Susskind

    Abstract: In most machine learning training paradigms a fixed, often handcrafted, loss function is assumed to be a good proxy for an underlying evaluation metric. In this work we assess this assumption by meta-learning an adaptive loss function to directly optimize the evaluation metric. We propose a sample efficient reinforcement learning approach for adapting the loss dynamically during training. We empir… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Accepted to ICML 2019