Skip to main content

Showing 1–15 of 15 results for author: Li, A C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00120  [pdf, other

    cs.LG cs.AI cs.FL

    Reward Machines for Deep RL in Noisy and Uncertain Environments

    Authors: Andrew C. Li, Zizhao Chen, Toryn Q. Klassen, Pashootan Vaezipoor, Rodrigo Toro Icarte, Sheila A. McIlraith

    Abstract: Reward Machines provide an automata-inspired structure for specifying instructions, safety constraints, and other temporally extended reward-worthy behaviour. By exposing complex reward function structure, they enable counterfactual learning updates that have resulted in impressive sample efficiency gains. While Reward Machines have been employed in both tabular and deep RL settings, they have typ… ▽ More

    Submitted 17 June, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

    ACM Class: I.2.0; I.2.6; I.2.4; F.4.3

  2. arXiv:2405.17247  [pdf, other

    cs.LG

    An Introduction to Vision-Language Modeling

    Authors: Florian Bordes, Richard Yuanzhe Pang, Anurag Ajay, Alexander C. Li, Adrien Bardes, Suzanne Petryk, Oscar MaƱas, Zhiqiu Lin, Anas Mahmoud, Bargav Jayaraman, Mark Ibrahim, Melissa Hall, Yunyang Xiong, Jonathan Lebensold, Candace Ross, Srihari Jayakumar, Chuan Guo, Diane Bouchacourt, Haider Al-Tahan, Karthik Padthe, Vasu Sharma, Hu Xu, Xiaoqing Ellen Tan, Megan Richards, Samuel Lavoie , et al. (16 additional authors not shown)

    Abstract: Following the recent popularity of Large Language Models (LLMs), several attempts have been made to extend them to the visual domain. From having a visual assistant that could guide us through unfamiliar environments to generative models that produce images using only a high-level text description, the vision-language model (VLM) applications will significantly impact our relationship with technol… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2311.16102  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Diffusion-TTA: Test-time Adaptation of Discriminative Models via Generative Feedback

    Authors: Mihir Prabhudesai, Tsung-Wei Ke, Alexander C. Li, Deepak Pathak, Katerina Fragkiadaki

    Abstract: The advancements in generative modeling, particularly the advent of diffusion models, have sparked a fundamental question: how can these models be effectively used for discriminative tasks? In this work, we find that generative models can be great test-time adapters for discriminative models. Our method, Diffusion-TTA, adapts pre-trained discriminative models such as image classifiers, segmenters… ▽ More

    Submitted 29 November, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023 Webpage with Code: https://diffusion-tta.github.io/

  4. arXiv:2303.16203  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.RO

    Your Diffusion Model is Secretly a Zero-Shot Classifier

    Authors: Alexander C. Li, Mihir Prabhudesai, Shivam Duggal, Ellis Brown, Deepak Pathak

    Abstract: The recent wave of large-scale text-to-image diffusion models has dramatically increased our text-based image generation abilities. These models can generate realistic images for a staggering variety of prompts and exhibit impressive compositional generalization abilities. Almost all use cases thus far have solely focused on sampling; however, diffusion models can also provide conditional density… ▽ More

    Submitted 12 September, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: In ICCV 2023. Website at https://diffusion-classifier.github.io/

  5. arXiv:2302.14051  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.RO

    Internet Explorer: Targeted Representation Learning on the Open Web

    Authors: Alexander C. Li, Ellis Brown, Alexei A. Efros, Deepak Pathak

    Abstract: Modern vision models typically rely on fine-tuning general-purpose models pre-trained on large, static datasets. These general-purpose models only capture the knowledge within their pre-training datasets, which are tiny, out-of-date snapshots of the Internet -- where billions of images are uploaded each day. We suggest an alternate approach: rather than ho** our static datasets transfer to our d… ▽ More

    Submitted 6 September, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: In ICML 2023. Website at https://internet-explorer-ssl.github.io/

  6. arXiv:2301.02952  [pdf, other

    cs.LG cs.AI

    Learning Symbolic Representations for Reinforcement Learning of Non-Markovian Behavior

    Authors: Phillip J. K. Christoffersen, Andrew C. Li, Rodrigo Toro Icarte, Sheila A. McIlraith

    Abstract: Many real-world reinforcement learning (RL) problems necessitate learning complex, temporally extended behavior that may only receive reward signal when the behavior is completed. If the reward-worthy behavior is known, it can be specified in terms of a non-Markovian reward function - a function that depends on aspects of the state-action history, rather than just the current state and action. Suc… ▽ More

    Submitted 7 January, 2023; originally announced January 2023.

    Comments: 7 pages, 2 figures, presented at KR2ML workshop at NeurIPS 2020

  7. arXiv:2211.10902  [pdf, other

    cs.LG cs.AI cs.FL

    Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines

    Authors: Andrew C. Li, Zizhao Chen, Pashootan Vaezipoor, Toryn Q. Klassen, Rodrigo Toro Icarte, Sheila A. McIlraith

    Abstract: Natural and formal languages provide an effective mechanism for humans to specify instructions and reward functions. We investigate how to generate policies via RL when reward functions are specified in a symbolic language captured by Reward Machines, an increasingly popular automaton-inspired structure. We are interested in the case where the map** of environment state to a symbolic (here, Rewa… ▽ More

    Submitted 23 November, 2022; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: NeurIPS Deep Reinforcement Learning Workshop 2022

  8. arXiv:2211.04591  [pdf, other

    cs.LG cs.AI cs.CL

    Learning to Follow Instructions in Text-Based Games

    Authors: Mathieu Tuli, Andrew C. Li, Pashootan Vaezipoor, Toryn Q. Klassen, Scott Sanner, Sheila A. McIlraith

    Abstract: Text-based games present a unique class of sequential decision making problem in which agents interact with a partially observable, simulated environment via actions and observations conveyed through natural language. Such observations typically include instructions that, in a reinforcement learning (RL) setting, can directly or indirectly guide a player towards completing reward-worthy tasks. In… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  9. arXiv:2209.15007  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.RO

    Understanding Collapse in Non-Contrastive Siamese Representation Learning

    Authors: Alexander C. Li, Alexei A. Efros, Deepak Pathak

    Abstract: Contrastive methods have led a recent surge in the performance of self-supervised representation learning (SSL). Recent methods like BYOL or SimSiam purportedly distill these contrastive methods down to their essence, removing bells and whistles, including the negative examples, that do not contribute to downstream performance. These "non-contrastive" methods work surprisingly well without using n… ▽ More

    Submitted 2 November, 2022; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Published at ECCV 2022. Project page at https://alexanderli.com/noncontrastive-ssl/

  10. arXiv:2206.01812  [pdf, other

    cs.LG cs.AI cs.RO

    Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks

    Authors: Andrew C. Li, Pashootan Vaezipoor, Rodrigo Toro Icarte, Sheila A. McIlraith

    Abstract: Deep reinforcement learning has shown promise in discrete domains requiring complex reasoning, including games such as Chess, Go, and Hanabi. However, this type of reasoning is less often observed in long-horizon, continuous domains with high-dimensional observations, where instead RL research has predominantly focused on problems with simple high-level structure (e.g. opening a drawer or moving a… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  11. arXiv:2112.03257  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.RO

    Functional Regularization for Reinforcement Learning via Learned Fourier Features

    Authors: Alexander C. Li, Deepak Pathak

    Abstract: We propose a simple architecture for deep reinforcement learning by embedding inputs into a learned Fourier basis and show that it improves the sample efficiency of both state-based and image-based RL. We perform infinite-width analysis of our architecture using the Neural Tangent Kernel and theoretically show that tuning the initial variance of the Fourier basis is equivalent to functional regula… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted at NeurIPS 2021. Website at https://alexanderli.com/learned-fourier-features

  12. arXiv:2010.02819  [pdf, other

    cs.LG cs.AI

    Interpretable Sequence Classification via Discrete Optimization

    Authors: Maayan Shvo, Andrew C. Li, Rodrigo Toro Icarte, Sheila A. McIlraith

    Abstract: Sequence classification is the task of predicting a class label given a sequence of observations. In many applications such as healthcare monitoring or intrusion detection, early classification is crucial to prompt intervention. In this work, we learn sequence classifiers that favour early classification from an evolving observation trace. While many state-of-the-art sequence classifiers are neura… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

  13. arXiv:2002.11708  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Generalized Hindsight for Reinforcement Learning

    Authors: Alexander C. Li, Lerrel Pinto, Pieter Abbeel

    Abstract: One of the key reasons for the high sample complexity in reinforcement learning (RL) is the inability to transfer knowledge from one task to another. In standard multi-task RL settings, low-reward data collected while trying to solve one task provides little to no signal for solving that particular task and is hence effectively wasted. However, we argue that this data, which is uninformative for o… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  14. arXiv:1910.07737  [pdf, other

    cs.LG stat.ML

    Autoregressive Models: What Are They Good For?

    Authors: Murtaza Dalal, Alexander C. Li, Rohan Taori

    Abstract: Autoregressive (AR) models have become a popular tool for unsupervised learning, achieving state-of-the-art log likelihood estimates. We investigate the use of AR models as density estimators in two settings -- as a learning signal for image translation, and as an outlier detector -- and find that these density estimates are much less reliable than previously thought. We examine the underlying opt… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: Accepted for the Information Theory and Machine Learning workshop at NeurIPS 2019

  15. arXiv:1906.05862  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Sub-policy Adaptation for Hierarchical Reinforcement Learning

    Authors: Alexander C. Li, Carlos Florensa, Ignasi Clavera, Pieter Abbeel

    Abstract: Hierarchical reinforcement learning is a promising approach to tackle long-horizon decision-making problems with sparse rewards. Unfortunately, most methods still decouple the lower-level skill acquisition process and the training of a higher level that controls the skills in a new task. Leaving the skills fixed can lead to significant sub-optimality in the transfer setting. In this work, we propo… ▽ More

    Submitted 13 May, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: ICLR 2020