Skip to main content

Showing 1–10 of 10 results for author: Walsman, A

.
  1. arXiv:2405.11656  [pdf, other

    cs.RO cs.AI

    URDFormer: A Pipeline for Constructing Articulated Simulation Environments from Real-World Images

    Authors: Zoey Chen, Aaron Walsman, Marius Memmel, Kaichun Mo, Alex Fang, Karthikeya Vemuri, Alan Wu, Dieter Fox, Abhishek Gupta

    Abstract: Constructing simulation scenes that are both visually and physically realistic is a problem of practical interest in domains ranging from robotics to computer vision. This problem has become even more relevant as researchers wielding large data-hungry learning methods seek new sources of training data for physical decision-making systems. However, building simulation models is often still done by… ▽ More

    Submitted 31 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: Accepted at RSS2024

  2. arXiv:2304.02639  [pdf, other

    cs.CV cs.RO

    ENTL: Embodied Navigation Trajectory Learner

    Authors: Klemen Kotar, Aaron Walsman, Roozbeh Mottaghi

    Abstract: We propose Embodied Navigation Trajectory Learner (ENTL), a method for extracting long sequence representations for embodied navigation. Our approach unifies world modeling, localization and imitation learning into a single sequence prediction task. We train our model using vector-quantized predictions of future states conditioned on current states and actions. ENTL's generic architecture enables… ▽ More

    Submitted 29 September, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  3. arXiv:2207.13738  [pdf, other

    cs.CV cs.AI

    Break and Make: Interactive Structural Understanding Using LEGO Bricks

    Authors: Aaron Walsman, Muru Zhang, Klemen Kotar, Karthik Desingh, Ali Farhadi, Dieter Fox

    Abstract: Visual understanding of geometric structures with complex spatial relationships is a fundamental component of human intelligence. As children, we learn how to reason about structure not only from observation, but also by interacting with the world around us -- by taking things apart and putting them back together again. The ability to reason about structure and compositionality allows us to not on… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: ECCV 2022. LTRON simulator and environment page: https://github.com/aaronwalsman/ltron. Training examples: https://github.com/aaronwalsman/ltron-torch-eccv22

  4. arXiv:2009.13146  [pdf, other

    cs.RO cs.CV cs.LG

    Amodal 3D Reconstruction for Robotic Manipulation via Stability and Connectivity

    Authors: William Agnew, Christopher Xie, Aaron Walsman, Octavian Murad, Caelen Wang, Pedro Domingos, Siddhartha Srinivasa

    Abstract: Learning-based 3D object reconstruction enables single- or few-shot estimation of 3D object models. For robotics, this holds the potential to allow model-based methods to rapidly adapt to novel objects and scenes. Existing 3D reconstruction techniques optimize for visual reconstruction fidelity, typically measured by chamfer distance or voxel IOU. We find that when applied to realistic, cluttered… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

  5. arXiv:2007.02519  [pdf, other

    cs.CV cs.LG

    FLUID: A Unified Evaluation Framework for Flexible Sequential Data

    Authors: Matthew Wallingford, Aditya Kusupati, Keivan Alizadeh-Vahid, Aaron Walsman, Aniruddha Kembhavi, Ali Farhadi

    Abstract: Modern ML methods excel when training data is IID, large-scale, and well labeled. Learning in less ideal conditions remains an open challenge. The sub-fields of few-shot, continual, transfer, and representation learning have made substantial strides in learning under adverse conditions; each affording distinct advantages through methods and insights. These methods address different challenges such… ▽ More

    Submitted 10 April, 2023; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: 27 pages, 6 figures. Project page: https://raivn.cs.washington.edu/projects/FLUID/

    Journal ref: Transactions on Machine Learning Research 2023

  6. arXiv:1908.01504  [pdf, other

    cs.CV cs.RO

    Part Segmentation for Highly Accurate Deformable Tracking in Occlusions via Fully Convolutional Neural Networks

    Authors: Weilin Wan, Aaron Walsman, Dieter Fox

    Abstract: Successfully tracking the human body is an important perceptual challenge for robots that must work around people. Existing methods fall into two broad categories: geometric tracking and direct pose estimation using machine learning. While recent work has shown direct estimation techniques can be quite powerful, geometric tracking methods using point clouds can provide a very high level of 3D accu… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

    Journal ref: IEEE International Conference on Robotics and Automation 2019

  7. arXiv:1811.08824  [pdf, other

    cs.CV cs.RO

    Early Fusion for Goal Directed Robotic Vision

    Authors: Aaron Walsman, Yonatan Bisk, Saadia Gabriel, Dipendra Misra, Yoav Artzi, Ye** Choi, Dieter Fox

    Abstract: Building perceptual systems for robotics which perform well under tight computational budgets requires novel architectures which rethink the traditional computer vision pipeline. Modern vision architectures require the agent to build a summary representation of the entire scene, even if most of the input is irrelevant to the agent's current goal. In this work, we flip this paradigm, by introducing… ▽ More

    Submitted 7 August, 2019; v1 submitted 21 November, 2018; originally announced November 2018.

  8. arXiv:1801.07357  [pdf, other

    cs.AI

    CHALET: Cornell House Agent Learning Environment

    Authors: Claudia Yan, Dipendra Misra, Andrew Bennnett, Aaron Walsman, Yonatan Bisk, Yoav Artzi

    Abstract: We present CHALET, a 3D house simulator with support for navigation and manipulation. CHALET includes 58 rooms and 10 house configuration, and allows to easily create new house and room layouts. CHALET supports a range of common household activities, including moving objects, toggling appliances, and placing objects inside closeable containers. The environment and actions available are designed to… ▽ More

    Submitted 16 September, 2019; v1 submitted 22 January, 2018; originally announced January 2018.

  9. arXiv:1711.07999  [pdf, other

    cs.CV

    Dynamic High Resolution Deformable Articulated Tracking

    Authors: Aaron Walsman, Weilin Wan, Tanner Schmidt, Dieter Fox

    Abstract: The last several years have seen significant progress in using depth cameras for tracking articulated objects such as human bodies, hands, and robotic manipulators. Most approaches focus on tracking skeletal parameters of a fixed shape model, which makes them insufficient for applications that require accurate estimates of deformable object surfaces. To overcome this limitation, we present a 3D mo… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: 10 pages, 8 figures, Presented at 3DV 2017

  10. Benchmarking in Manipulation Research: The YCB Object and Model Set and Benchmarking Protocols

    Authors: Berk Calli, Aaron Walsman, Arjun Singh, Siddhartha Srinivasa, Pieter Abbeel, Aaron M. Dollar

    Abstract: In this paper we present the Yale-CMU-Berkeley (YCB) Object and Model set, intended to be used to facilitate benchmarking in robotic manipulation, prosthetic design and rehabilitation research. The objects in the set are designed to cover a wide range of aspects of the manipulation problem; it includes objects of daily life with different shapes, sizes, textures, weight and rigidity, as well as so… ▽ More

    Submitted 10 February, 2015; originally announced February 2015.

    Comments: Submitted to Robotics and Automation Magazine (RAM) Special Issue on Replicable and Measurable Robotics Research. 35 Pages

    Journal ref: IEEE Robotics & Automation Magazine, 22 (2015) 36 - 52