Skip to main content

Showing 1–13 of 13 results for author: Burns, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10282  [pdf, other

    cs.LG cs.CV

    Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning

    Authors: Kyle Hsu, Jubayer Ibn Hamid, Kaylee Burns, Chelsea Finn, Jiajun Wu

    Abstract: Inductive biases are crucial in disentangled representation learning for narrowing down an underspecified solution set. In this work, we consider endowing a neural network autoencoder with three select inductive biases from the literature: data compression into a grid-like latent space via quantization, collective independence amongst latents, and minimal functional influence of any latent on how… ▽ More

    Submitted 24 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: ICML 2024 camera-ready. 22 pages, 10 figures, code available at https://github.com/kylehkhsu/tripod

  2. arXiv:2404.06645  [pdf, other

    cs.RO cs.AI

    GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks

    Authors: Kaylee Burns, A**kya Jain, Keegan Go, Fei Xia, Michael Stark, Stefan Schaal, Karol Hausman

    Abstract: Large Language Models (LLMs) have been successful at generating robot policy code, but so far these results have been limited to high-level tasks that do not require precise movement. It is an open question how well such approaches work for tasks that require reasoning over contact forces and working within tight success tolerances. We find that, with the right action space, LLMs are capable of su… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 14 pages, 12 figures

    ACM Class: I.2.9

  3. arXiv:2312.12444  [pdf, other

    cs.CV cs.AI cs.RO

    What Makes Pre-Trained Visual Representations Successful for Robust Manipulation?

    Authors: Kaylee Burns, Zach Witzel, Jubayer Ibn Hamid, Tianhe Yu, Chelsea Finn, Karol Hausman

    Abstract: Inspired by the success of transfer learning in computer vision, roboticists have investigated visual pre-training as a means to improve the learning efficiency and generalization ability of policies learned from pixels. To that end, past work has favored large object interaction datasets, such as first-person videos of humans completing diverse tasks, in pursuit of manipulation-relevant features.… ▽ More

    Submitted 3 November, 2023; originally announced December 2023.

    Comments: 20 pages, 12 figures

  4. arXiv:2305.13546  [pdf, other

    cs.LG cs.AI

    Neural Functional Transformers

    Authors: Allan Zhou, Kaien Yang, Yiding Jiang, Kaylee Burns, Winnie Xu, Samuel Sokota, J. Zico Kolter, Chelsea Finn

    Abstract: The recent success of neural networks as implicit representation of data has driven growing interest in neural functionals: models that can process other neural networks as input by operating directly over their weight spaces. Nevertheless, constructing expressive and efficient neural functional architectures that can handle high-dimensional weight-space objects remains challenging. This paper use… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  5. arXiv:2302.14040  [pdf, other

    cs.LG cs.AI

    Permutation Equivariant Neural Functionals

    Authors: Allan Zhou, Kaien Yang, Kaylee Burns, Adriano Cardace, Yiding Jiang, Samuel Sokota, J. Zico Kolter, Chelsea Finn

    Abstract: This work studies the design of neural networks that can process the weights or gradients of other neural networks, which we refer to as neural functional networks (NFNs). Despite a wide range of potential applications, including learned optimization, processing implicit neural representations, network editing, and policy evaluation, there are few unifying principles for designing effective archit… ▽ More

    Submitted 26 September, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: To appear in Neural Information Processing Systems (NeurIPS), 2023

  6. arXiv:2207.13082  [pdf, other

    cs.LG cs.AI cs.RO

    Offline Reinforcement Learning at Multiple Frequencies

    Authors: Kaylee Burns, Tianhe Yu, Chelsea Finn, Karol Hausman

    Abstract: Leveraging many sources of offline robot data requires grappling with the heterogeneity of such data. In this paper, we focus on one particular aspect of heterogeneity: learning from offline data collected at different control frequencies. Across labs, the discretization of controllers, sampling rates of sensors, and demands of a task of interest may differ, giving rise to a mixture of frequencies… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: Project website: https://sites.google.com/stanford.edu/adaptive-nstep-returns/

  7. arXiv:2203.01983  [pdf, other

    cs.RO

    Implicit Kinematic Policies: Unifying Joint and Cartesian Action Spaces in End-to-End Robot Learning

    Authors: Aditya Ganapathi, Pete Florence, Jake Varley, Kaylee Burns, Ken Goldberg, Andy Zeng

    Abstract: Action representation is an important yet often overlooked aspect in end-to-end robot learning with deep networks. Choosing one action space over another (e.g. target joint positions, or Cartesian end-effector poses) can result in surprisingly stark performance differences between various downstream tasks -- and as a result, considerable research has been devoted to finding the right action space… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: International Conference on Robotics and Automation (ICRA) 2022

  8. arXiv:2107.09285  [pdf, other

    cs.CL cs.AI

    Neural Abstructions: Abstractions that Support Construction for Grounded Language Learning

    Authors: Kaylee Burns, Christopher D. Manning, Li Fei-Fei

    Abstract: Although virtual agents are increasingly situated in environments where natural language is the most effective mode of interaction with humans, these exchanges are rarely used as an opportunity for learning. Leveraging language interactions effectively requires addressing limitations in the two most common approaches to language grounding: semantic parsers built on top of fixed object categories a… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: 17 pages, 10 figures

    ACM Class: I.2.7

  9. arXiv:1812.00929  [pdf, other

    cs.CV

    SPLAT: Semantic Pixel-Level Adaptation Transforms for Detection

    Authors: Eric Tzeng, Kaylee Burns, Kate Saenko, Trevor Darrell

    Abstract: Domain adaptation of visual detectors is a critical challenge, yet existing methods have overlooked pixel appearance transformations, focusing instead on bootstrap** and/or domain confusion losses. We propose a Semantic Pixel-Level Adaptation Transform (SPLAT) approach to detector adaptation that efficiently generates cross-domain image pairs. Our model uses aligned-pair and/or pseudo-label loss… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  10. arXiv:1809.02156  [pdf, other

    cs.CL cs.CV

    Object Hallucination in Image Captioning

    Authors: Anna Rohrbach, Lisa Anne Hendricks, Kaylee Burns, Trevor Darrell, Kate Saenko

    Abstract: Despite continuously improving performance, contemporary image captioning models are prone to "hallucinating" objects that are not actually in a scene. One problem is that standard metrics only measure similarity to ground truth captions and may not fully capture image relevance. In this work, we propose a new image relevance metric to evaluate current models with veridical visual labels and asses… ▽ More

    Submitted 29 March, 2019; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: Rohrbach and Hendricks contributed equally; accepted to EMNLP 2018

  11. arXiv:1808.09352  [pdf, other

    cs.CL

    Evaluating Theory of Mind in Question Answering

    Authors: Aida Nematzadeh, Kaylee Burns, Erin Grant, Alison Gopnik, Thomas L. Griffiths

    Abstract: We propose a new dataset for evaluating question answering models with respect to their capacity to reason about beliefs. Our tasks are inspired by theory-of-mind experiments that examine whether children are able to reason about the beliefs of others, in particular when those beliefs differ from reality. We evaluate a number of recent neural models with memory augmentation. We find that all fail… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

  12. arXiv:1807.00517  [pdf, other

    cs.CV

    Women also Snowboard: Overcoming Bias in Captioning Models (Extended Abstract)

    Authors: Lisa Anne Hendricks, Kaylee Burns, Kate Saenko, Trevor Darrell, Anna Rohrbach

    Abstract: Most machine learning methods are known to capture and exploit biases of the training data. While some biases are beneficial for learning, others are harmful. Specifically, image captioning models tend to exaggerate biases present in training data. This can lead to incorrect captions in domains where unbiased captions are desired, or required, due to over reliance on the learned prior and image co… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: Burns and Hendricks contributed equally. 2018 ICML Workshop on Fairness, Accountability, and Transparency in Machine Learning (FAT/ML 2018)

  13. arXiv:1803.09797  [pdf, other

    cs.CV

    Women also Snowboard: Overcoming Bias in Captioning Models

    Authors: Kaylee Burns, Lisa Anne Hendricks, Kate Saenko, Trevor Darrell, Anna Rohrbach

    Abstract: Most machine learning methods are known to capture and exploit biases of the training data. While some biases are beneficial for learning, others are harmful. Specifically, image captioning models tend to exaggerate biases present in training data (e.g., if a word is present in 60% of training sentences, it might be predicted in 70% of sentences at test time). This can lead to incorrect captions i… ▽ More

    Submitted 13 March, 2019; v1 submitted 26 March, 2018; originally announced March 2018.

    Comments: 22 pages, 6 figures, Burns and Hendricks contributed equally