Skip to main content

Showing 1–10 of 10 results for author: Myers, V

.
  1. arXiv:2406.17098  [pdf, other

    cs.LG cs.AI

    Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making

    Authors: Vivek Myers, Chongyi Zheng, Anca Dragan, Sergey Levine, Benjamin Eysenbach

    Abstract: Temporal distances lie at the heart of many algorithms for planning, control, and reinforcement learning that involve reaching goals, allowing one to estimate the transit time between two states. However, prior attempts to define such temporal distances in stochastic settings have been stymied by an important limitation: these prior approaches do not satisfy the triangle inequality. This is not me… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  2. arXiv:2406.06714  [pdf, other

    cs.LG cs.AI cs.HC

    Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation

    Authors: Michelle Pan, Mariah Schrum, Vivek Myers, Erdem Bıyık, Anca Dragan

    Abstract: Adaptive brain stimulation can treat neurological conditions such as Parkinson's disease and post-stroke motor deficits by influencing abnormal neural activity. Because of patient heterogeneity, each patient requires a unique stimulation policy to achieve optimal neural responses. Model-free reinforcement learning (MFRL) holds promise in learning effective policies for a variety of similar control… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  3. arXiv:2403.04082  [pdf, other

    cs.LG stat.ML

    Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

    Authors: Benjamin Eysenbach, Vivek Myers, Ruslan Salakhutdinov, Sergey Levine

    Abstract: Given time series data, how can we answer questions like "what will happen in the future?" and "how did we get here?" These sorts of probabilistic inference questions are challenging when observations are high-dimensional. In this paper, we show how these questions can have compact, closed form solutions in terms of learned representations. The key idea is to apply a variant of contrastive learnin… ▽ More

    Submitted 18 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Code: https://github.com/vivekmyers/contrastive_planning

  4. arXiv:2308.12952  [pdf, other

    cs.RO cs.LG

    BridgeData V2: A Dataset for Robot Learning at Scale

    Authors: Homer Walke, Kevin Black, Abraham Lee, Moo ** Kim, Max Du, Chongyi Zheng, Tony Zhao, Philippe Hansen-Estruch, Quan Vuong, Andre He, Vivek Myers, Kuan Fang, Chelsea Finn, Sergey Levine

    Abstract: We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors designed to facilitate research on scalable robot learning. BridgeData V2 contains 60,096 trajectories collected across 24 environments on a publicly available low-cost robot. BridgeData V2 provides extensive task and environment variability, leading to skills that can generalize across environments, domains,… ▽ More

    Submitted 17 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 9 pages

  5. arXiv:2307.00117  [pdf, other

    cs.RO cs.LG

    Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control

    Authors: Vivek Myers, Andre He, Kuan Fang, Homer Walke, Philippe Hansen-Estruch, Ching-An Cheng, Mihai Jalobeanu, Andrey Kolobov, Anca Dragan, Sergey Levine

    Abstract: Our goal is for robots to follow natural language instructions like "put the towel next to the microwave." But getting large amounts of labeled data, i.e. data that contains demonstrations of tasks labeled with the language instruction, is prohibitive. In contrast, obtaining policies that respond to image goals is much easier, because any autonomous trial or demonstration can be labeled in hindsig… ▽ More

    Submitted 17 August, 2023; v1 submitted 30 June, 2023; originally announced July 2023.

    Comments: 15 pages, 5 figures

  6. arXiv:2306.08651  [pdf, other

    cs.RO cs.AI

    Toward Grounded Commonsense Reasoning

    Authors: Minae Kwon, Hengyuan Hu, Vivek Myers, Siddharth Karamcheti, Anca Dragan, Dorsa Sadigh

    Abstract: Consider a robot tasked with tidying a desk with a meticulously constructed Lego sports car. A human may recognize that it is not appropriate to disassemble the sports car and put it away as part of the "tidying." How can a robot reach that conclusion? Although large language models (LLMs) have recently been used to enable commonsense reasoning, grounding this reasoning in the real world has been… ▽ More

    Submitted 18 February, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: IEEE International Conference on Robotics and Automation 2024

  7. arXiv:2302.13507  [pdf, other

    cs.LG cs.AI cs.RO

    Active Reward Learning from Online Preferences

    Authors: Vivek Myers, Erdem Bıyık, Dorsa Sadigh

    Abstract: Robot policies need to adapt to human preferences and/or new environments. Human experts may have the domain knowledge required to help robots achieve this adaptation. However, existing works often require costly offline re-training on human feedback, and those feedback usually need to be frequent and too complex for the humans to reliably provide. To avoid placing undue burden on human experts an… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

    Comments: 11 pages, 8 figures, 1 table. Published in the 2023 IEEE International Conference on Robotics and Automation (ICRA)

  8. arXiv:2110.11044  [pdf, other

    cs.LG stat.ML

    Bayesian Meta-Learning Through Variational Gaussian Processes

    Authors: Vivek Myers, Nikhil Sardana

    Abstract: Recent advances in the field of meta-learning have tackled domains consisting of large numbers of small ("few-shot") supervised learning tasks. Meta-learning algorithms must be able to rapidly adapt to any individual few-shot task, fitting to a small support set within a task and using it to predict the labels of the task's query set. This problem setting can be extended to the Bayesian context, w… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  9. arXiv:2109.12750  [pdf, other

    cs.LG cs.AI cs.RO

    Learning Multimodal Rewards from Rankings

    Authors: Vivek Myers, Erdem Bıyık, Nima Anari, Dorsa Sadigh

    Abstract: Learning from human feedback has shown to be a useful approach in acquiring robot reward functions. However, expert feedback is often assumed to be drawn from an underlying unimodal reward function. This assumption does not always hold including in settings where multiple experts provide data or when a single expert provides data for different tasks -- we thus go beyond learning a unimodal reward… ▽ More

    Submitted 18 October, 2021; v1 submitted 26 September, 2021; originally announced September 2021.

    Comments: 17 pages, 12 figures, 2 tables. Published at Conference on Robot Learning (CoRL) 2021

  10. arXiv:2007.10263  [pdf, other

    cs.LG q-bio.QM stat.ML

    A Hierarchical Approach to Scaling Batch Active Search Over Structured Data

    Authors: Vivek Myers, Peyton Greenside

    Abstract: Active search is the process of identifying high-value data points in a large and often high-dimensional parameter space that can be expensive to evaluate. Traditional active search techniques like Bayesian optimization trade off exploration and exploitation over consecutive evaluations, and have historically focused on single or small (<5) numbers of examples evaluated per round. As modern data s… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: Presented at the 2020 ICML Workshop on Real World Experiment Design and Active Learning