Skip to main content

Showing 1–10 of 10 results for author: Lee, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2212.03858  [pdf, other

    cs.RO cs.CV

    See, Hear, and Feel: Smart Sensory Fusion for Robotic Manipulation

    Authors: Hao Li, Yizhi Zhang, Junzhe Zhu, Shaoxiong Wang, Michelle A Lee, Huazhe Xu, Edward Adelson, Li Fei-Fei, Ruohan Gao, Jiajun Wu

    Abstract: Humans use all of their senses to accomplish different tasks in everyday activities. In contrast, existing work on robotic manipulation mostly relies on one, or occasionally two modalities, such as vision and touch. In this work, we systematically study how visual, auditory, and tactile perception can jointly help robots to solve complex manipulation tasks. We build a robot system that can see wit… ▽ More

    Submitted 8 December, 2022; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: In CoRL 2022. Li and Zhang equal contribution; Gao and Wu equal advising. Project page: https://ai.stanford.edu/~rhgao/see_hear_feel/

  2. arXiv:2107.07502  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.MM

    MultiBench: Multiscale Benchmarks for Multimodal Representation Learning

    Authors: Paul Pu Liang, Yiwei Lyu, Xiang Fan, Zetian Wu, Yun Cheng, Jason Wu, Leslie Chen, Peter Wu, Michelle A. Lee, Yuke Zhu, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: Learning multimodal representations involves integrating information from multiple heterogeneous sources of data. It is a challenging yet crucial area with numerous real-world applications in multimedia, affective computing, robotics, finance, human-computer interaction, and healthcare. Unfortunately, multimodal research has seen limited resources to study (1) generalization across domains and mod… ▽ More

    Submitted 10 November, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021 Datasets and Benchmarks Track. Code: https://github.com/pliang279/MultiBench and Website: https://cmu-multicomp-lab.github.io/multibench/

  3. arXiv:2105.08257  [pdf, other

    cs.RO

    Differentiable Factor Graph Optimization for Learning Smoothers

    Authors: Brent Yi, Michelle A. Lee, Alina Kloss, Roberto Martín-Martín, Jeannette Bohg

    Abstract: A recent line of work has shown that end-to-end optimization of Bayesian filters can be used to learn state estimators for systems whose underlying models are difficult to hand-design or tune, while retaining the core advantages of probabilistic state estimation. As an alternative approach for state estimation in these settings, we present an end-to-end approach for learning state estimators model… ▽ More

    Submitted 23 August, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: IROS 2021. 9 pages with references and appendix

  4. arXiv:2101.02725  [pdf, other

    cs.RO

    Interpreting Contact Interactions to Overcome Failure in Robot Assembly Tasks

    Authors: Peter A. Zachares, Michelle A. Lee, Wenzhao Lian, Jeannette Bohg

    Abstract: A key challenge towards the goal of multi-part assembly tasks is finding robust sensorimotor control methods in the presence of uncertainty. In contrast to previous works that rely on a priori knowledge on whether two parts match, we aim to learn this through physical interaction. We propose a hierarchical approach that enables a robot to autonomously assemble parts while being uncertain about par… ▽ More

    Submitted 11 May, 2021; v1 submitted 7 January, 2021; originally announced January 2021.

  5. arXiv:2012.00201  [pdf, other

    cs.RO cs.AI cs.LG

    Detect, Reject, Correct: Crossmodal Compensation of Corrupted Sensors

    Authors: Michelle A. Lee, Matthew Tan, Yuke Zhu, Jeannette Bohg

    Abstract: Using sensor data from multiple modalities presents an opportunity to encode redundant and complementary features that can be useful when one modality is corrupted or noisy. Humans do this everyday, relying on touch and proprioceptive feedback in visually-challenging environments. However, robots might not always know when their sensors are corrupted, as even broken sensors can return valid values… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: 8 pages, 5 figures

  6. arXiv:2010.13021  [pdf, other

    cs.RO

    Multimodal Sensor Fusion with Differentiable Filters

    Authors: Michelle A. Lee, Brent Yi, Roberto Martín-Martín, Silvio Savarese, Jeannette Bohg

    Abstract: Leveraging multimodal information with recursive Bayesian filters improves performance and robustness of state estimation, as recursive filters can combine different modalities according to their uncertainties. Prior work has studied how to optimally fuse different sensor modalities with analytical state estimation algorithms. However, deriving the dynamics and measurement models along with their… ▽ More

    Submitted 23 December, 2020; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: Published in IROS 2020. Updated sponsors, fixed Kalman gain typo

  7. arXiv:2005.10872  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Guided Uncertainty-Aware Policy Optimization: Combining Learning and Model-Based Strategies for Sample-Efficient Policy Learning

    Authors: Michelle A. Lee, Carlos Florensa, Jonathan Tremblay, Nathan Ratliff, Animesh Garg, Fabio Ramos, Dieter Fox

    Abstract: Traditional robotic approaches rely on an accurate model of the environment, a detailed description of how to perform the task, and a robust perception system to keep track of the current state. On the other hand, reinforcement learning approaches can operate directly from raw sensory inputs with only a reward signal to describe the task, but are extremely sample-inefficient and brittle. In this w… ▽ More

    Submitted 26 May, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Journal ref: International Conference in Robotics and Automation 2020

  8. arXiv:1907.13098  [pdf, other

    cs.RO cs.LG

    Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks

    Authors: Michelle A. Lee, Yuke Zhu, Peter Zachares, Matthew Tan, Krishnan Srinivasan, Silvio Savarese, Li Fei-Fei, Animesh Garg, Jeannette Bohg

    Abstract: Contact-rich manipulation tasks in unstructured environments often require both haptic and visual feedback. It is non-trivial to manually design a robot controller that combines these modalities which have very different characteristics. While deep reinforcement learning has shown success in learning control policies for high-dimensional inputs, these algorithms are generally intractable to deploy… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1810.10191

  9. arXiv:1906.08880  [pdf, other

    cs.RO cs.AI cs.LG

    Variable Impedance Control in End-Effector Space: An Action Space for Reinforcement Learning in Contact-Rich Tasks

    Authors: Roberto Martín-Martín, Michelle A. Lee, Rachel Gardner, Silvio Savarese, Jeannette Bohg, Animesh Garg

    Abstract: Reinforcement Learning (RL) of contact-rich manipulation tasks has yielded impressive results in recent years. While many studies in RL focus on varying the observation space or reward model, few efforts focused on the choice of action space (e.g. joint or end-effector space, position, velocity, etc.). However, studies in robot motion control indicate that choosing an action space that conforms to… ▽ More

    Submitted 2 August, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: IROS19

  10. arXiv:1810.10191  [pdf, other

    cs.RO cs.AI cs.LG

    Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks

    Authors: Michelle A. Lee, Yuke Zhu, Krishnan Srinivasan, Parth Shah, Silvio Savarese, Li Fei-Fei, Animesh Garg, Jeannette Bohg

    Abstract: Contact-rich manipulation tasks in unstructured environments often require both haptic and visual feedback. However, it is non-trivial to manually design a robot controller that combines modalities with very different characteristics. While deep reinforcement learning has shown success in learning control policies for high-dimensional inputs, these algorithms are generally intractable to deploy on… ▽ More

    Submitted 7 March, 2019; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: ICRA 2019