Skip to main content

Showing 1–50 of 85 results for author: Pinto, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07539  [pdf, other

    cs.RO

    BAKU: An Efficient Transformer for Multi-Task Policy Learning

    Authors: Siddhant Haldar, Zhuoran Peng, Lerrel Pinto

    Abstract: Training generalist agents capable of solving diverse tasks is challenging, often requiring large datasets of expert demonstrations. This is particularly problematic in robotics, where each data point requires physical execution of actions in the real world. Thus, there is a pressing need for architectures that can effectively leverage the available training data. In this work, we present BAKU, a… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2406.04318  [pdf, other

    cs.LG cs.AI cs.CV

    Adaptive Sampling of k-Space in Magnetic Resonance for Rapid Pathology Prediction

    Authors: Chen-Yu Yen, Raghav Singhal, Umang Sharma, Rajesh Ranganath, Sumit Chopra, Lerrel Pinto

    Abstract: Magnetic Resonance (MR) imaging, despite its proven diagnostic utility, remains an inaccessible imaging modality for disease surveillance at the population level. A major factor rendering MR inaccessible is lengthy scan times. An MR scanner collects measurements associated with the underlying anatomy in the Fourier space, also known as the k-space. Creating a high-fidelity image requires collectin… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ICML 2024. Project website at https://adaptive-sampling-mr.github.io

  3. arXiv:2403.07870  [pdf, other

    cs.RO

    OPEN TEACH: A Versatile Teleoperation System for Robotic Manipulation

    Authors: Aadhithya Iyer, Zhuoran Peng, Yinlong Dai, Irmak Guzey, Siddhant Haldar, Soumith Chintala, Lerrel Pinto

    Abstract: Open-sourced, user-friendly tools form the bedrock of scientific advancement across disciplines. The widespread adoption of data-driven learning has led to remarkable progress in multi-fingered dexterity, bimanual manipulation, and applications ranging from logistics to home robotics. However, existing data collection platforms are often proprietary, costly, or tailored to specific robotic morphol… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  4. arXiv:2403.03181  [pdf, other

    cs.LG cs.AI cs.RO

    Behavior Generation with Latent Actions

    Authors: Seungjae Lee, Yibin Wang, Haritheja Etukuru, H. ** Kim, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

    Abstract: Generative modeling of complex behaviors from labeled datasets has been a longstanding problem in decision making. Unlike language or image generation, decision making requires modeling actions - continuous-valued vectors that are multimodal in their distribution, potentially drawn from uncurated sources, where generation errors can compound in sequential prediction. A recent class of models calle… ▽ More

    Submitted 28 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Github repo: https://github.com/jayLEE0301/vq_bet_official

  5. arXiv:2402.10211  [pdf, other

    cs.LG cs.RO eess.SP

    Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling

    Authors: Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman, Carmel Majidi, Abhinav Gupta, Tess Hellebrekers, Lerrel Pinto

    Abstract: Reasoning from sequences of raw sensory data is a ubiquitous problem across fields ranging from medical devices to robotics. These problems often involve using long sequences of raw sensor data (e.g. magnetometers, piezoresistors) to predict sequences of desirable physical quantities (e.g. force, inertial measurements). While classical approaches are powerful for locally-linear prediction problems… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  6. arXiv:2401.12202  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics

    Authors: Peiqi Liu, Yaswanth Orru, Jay Vakil, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

    Abstract: Remarkable progress has been made in recent years in the fields of vision, language, and robotics. We now have vision models capable of recognizing objects based on language queries, navigation systems that can effectively control mobile systems, and gras** models that can handle a wide range of objects. Despite these advancements, general-purpose applications of robotics still lag behind, even… ▽ More

    Submitted 29 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: Github repo: https://github.com/ok-robot/ok-robot

  7. arXiv:2401.09252  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    3D Scene Geometry Estimation from 360$^\circ$ Imagery: A Survey

    Authors: Thiago Lopes Trugillo da Silveira, Paulo Gamarra Lessa Pinto, Jeffri Erwin Murrugarra Llerena, Claudio Rosito Jung

    Abstract: This paper provides a comprehensive survey on pioneer and state-of-the-art 3D scene geometry estimation methodologies based on single, two, or multiple images captured under the omnidirectional optics. We first revisit the basic concepts of the spherical camera model, and review the most common acquisition technologies and representation formats suitable for omnidirectional (also called 360… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Published in ACM Computing Surveys

    Journal ref: ACM Comput. Surv. 55, 4, Article 68, 2023

  8. arXiv:2312.17261  [pdf, other

    cs.CV cs.LG

    Transformer-Based Multi-Object Smoothing with Decoupled Data Association and Smoothing

    Authors: Juliano Pinto, Georg Hess, Yuxuan Xia, Henk Wymeersch, Lennart Svensson

    Abstract: Multi-object tracking (MOT) is the task of estimating the state trajectories of an unknown and time-varying number of objects over a certain time window. Several algorithms have been proposed to tackle the multi-object smoothing task, where object detections can be conditioned on all the measurements in the time window. However, the best-performing methods suffer from intractable computational com… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  9. arXiv:2312.07540  [pdf, other

    cs.AI cs.CL cs.LG

    diff History for Neural Language Agents

    Authors: Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

    Abstract: Neural Language Models (LMs) offer an exciting solution for general-purpose embodied control. However, a key technical issue arises when using an LM-based controller: environment observations must be converted to text, which coupled with history, results in long and verbose textual prompts. As a result, prior work in LM agents is limited to restricted domains with small observation size as well as… ▽ More

    Submitted 11 June, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: ICML 2024 version

  10. arXiv:2311.16098  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    On Bringing Robots Home

    Authors: Nur Muhammad Mahi Shafiullah, Anant Rai, Haritheja Etukuru, Yiqian Liu, Ishan Misra, Soumith Chintala, Lerrel Pinto

    Abstract: Throughout history, we have successfully integrated various machines into our homes. Dishwashers, laundry machines, stand mixers, and robot vacuums are a few recent examples. However, these machines excel at performing only a single task effectively. The concept of a "generalist machine" in homes - a domestic assistant that can adapt and learn from our needs, all while remaining cost-effective - h… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Project website and videos are available at https://dobb-e.com, technical documentation for getting started is available at https://docs.dobb-e.com, and code is released at https://github.com/notmahi/dobb-e

  11. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  12. arXiv:2310.08573  [pdf, other

    cs.RO

    PolyTask: Learning Unified Policies through Behavior Distillation

    Authors: Siddhant Haldar, Lerrel Pinto

    Abstract: Unified models capable of solving a wide variety of tasks have gained traction in vision and NLP due to their ability to share regularities and structures across tasks, which improves individual task performance and reduces computational footprint. However, the impact of such models remains limited in embodied learning problems, which present unique challenges due to interactivity, sample ineffici… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  13. arXiv:2309.12300  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    See to Touch: Learning Tactile Dexterity through Visual Incentives

    Authors: Irmak Guzey, Yinlong Dai, Ben Evans, Soumith Chintala, Lerrel Pinto

    Abstract: Equip** multi-fingered robots with tactile sensing is crucial for achieving the precise, contact-rich, and dexterous manipulation that humans excel at. However, relying solely on tactile sensing fails to provide adequate cues for reasoning about objects' spatial configurations, limiting the ability to correct errors and adapt to changing situations. In this paper, we present Tactile Adaptation f… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  14. arXiv:2306.12554  [pdf, other

    cs.LG cs.AI

    Improving Long-Horizon Imitation Through Instruction Prediction

    Authors: Joey Hejna, Pieter Abbeel, Lerrel Pinto

    Abstract: Complex, long-horizon planning and its combinatorial nature pose steep challenges for learning-based agents. Difficulties in such settings are exacerbated in low data regimes where over-fitting stifles generalization and compounding errors hurt accuracy. In this work, we explore the use of an often unused source of auxiliary supervision: language. Inspired by recent advances in transformer-based m… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Published at AAAI 2023

  15. arXiv:2306.00942  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Train Offline, Test Online: A Real Robot Learning Benchmark

    Authors: Gaoyue Zhou, Victoria Dean, Mohan Kumar Srirama, Aravind Rajeswaran, Jyothish Pari, Kyle Hatch, Aryan Jain, Tianhe Yu, Pieter Abbeel, Lerrel Pinto, Chelsea Finn, Abhinav Gupta

    Abstract: Three challenges limit the progress of robot learning research: robots are expensive (few labs can participate), everyone uses different robots (findings do not generalize across labs), and we lack internet-scale robotics data. We take on these challenges via a new benchmark: Train Offline, Test Online (TOTO). TOTO provides remote users with access to shared robotic hardware for evaluating methods… ▽ More

    Submitted 30 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICRA 2023

  16. arXiv:2305.19240  [pdf, other

    cs.LG cs.AI

    NetHack is Hard to Hack

    Authors: Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

    Abstract: Neural policy learning methods have achieved remarkable results in various control problems, ranging from Atari games to simulated locomotion. However, these methods struggle in long-horizon tasks, especially in open-ended environments with multi-modal observations, such as the popular dungeon-crawler game, NetHack. Intriguingly, the NeurIPS 2021 NetHack Challenge revealed that symbolic agents out… ▽ More

    Submitted 30 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  17. arXiv:2303.12076  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Dexterity from Touch: Self-Supervised Pre-Training of Tactile Representations with Robotic Play

    Authors: Irmak Guzey, Ben Evans, Soumith Chintala, Lerrel Pinto

    Abstract: Teaching dexterity to multi-fingered robots has been a longstanding challenge in robotics. Most prominent work in this area focuses on learning controllers or policies that either operate on visual observations or state estimates derived from vision. However, such methods perform poorly on fine-grained manipulation tasks that require reasoning about contact forces or about objects occluded by the… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Video and code can be accessed here: https://tactile-dexterity.github.io/

  18. arXiv:2303.01497  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations

    Authors: Siddhant Haldar, Jyothish Pari, Anant Rai, Lerrel Pinto

    Abstract: While imitation learning provides us with an efficient toolkit to train robots, learning skills that are robust to environment variations remains a significant challenge. Current approaches address this challenge by relying either on large amounts of demonstrations that span environment variations or on handcrafted reward functions that require state estimates. Both directions are not scalable to… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Code and robot videos are available at https://fast-imitation.github.io/

  19. arXiv:2210.10047  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data

    Authors: Zichen Jeff Cui, Yibin Wang, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

    Abstract: While large-scale sequence modeling from offline data has led to impressive performance gains in natural language and image generation, directly translating such ideas to robotics has been challenging. One critical reason for this is that uncurated robot demonstration data, i.e. play data, collected from non-expert human demonstrators are often noisy, diverse, and distributionally multi-modal. Thi… ▽ More

    Submitted 15 December, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: Code and data available at: https://play-to-policy.github.io; (fixed metadata author name format)

  20. arXiv:2210.06463  [pdf, other

    cs.RO cs.AI cs.CV cs.HC cs.LG

    Holo-Dex: Teaching Dexterity with Immersive Mixed Reality

    Authors: Sridhar Pandian Arunachalam, Irmak Güzey, Soumith Chintala, Lerrel Pinto

    Abstract: A fundamental challenge in teaching robots is to provide an effective interface for human teachers to demonstrate useful skills to a robot. This challenge is exacerbated in dexterous manipulation, where teaching high-dimensional, contact-rich behaviors often require esoteric teleoperation tools. In this work, we present Holo-Dex, a framework for dexterous manipulation that places a teacher in an i… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Data, code and videos are available at https://holo-dex.github.io

  21. arXiv:2210.05663  [pdf, other

    cs.RO cs.CV

    CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

    Authors: Nur Muhammad Mahi Shafiullah, Chris Paxton, Lerrel Pinto, Soumith Chintala, Arthur Szlam

    Abstract: We propose CLIP-Fields, an implicit scene model that can be used for a variety of tasks, such as segmentation, instance identification, semantic search over space, and view localization. CLIP-Fields learns a map** from spatial locations to semantic embedding vectors. Importantly, we show that this map** can be trained with supervision coming only from web-image and web-text trained models such… ▽ More

    Submitted 22 May, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Code, video, and interactive demonstrations available at https://mahis.life/clip-fields. Accepted for publication at Robotics: Science and Systems 2023 in Daegu, Korea

  22. arXiv:2210.01116  [pdf, other

    cs.RO cs.LG cs.SD eess.AS

    That Sounds Right: Auditory Self-Supervision for Dynamic Robot Manipulation

    Authors: Abitha Thankaraj, Lerrel Pinto

    Abstract: Learning to produce contact-rich, dynamic behaviors from raw sensory data has been a longstanding challenge in robotics. Prominent approaches primarily focus on using visual or tactile sensing, where unfortunately one fails to capture high-frequency interaction, while the other can be too delicate for large-scale data collection. In this work, we propose a data-centric approach to dynamic manipula… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: Videos and audio data are best seen on our project website: audio-robot-learning.github.io

  23. arXiv:2208.02932  [pdf, other

    cs.AI cs.HC cs.LG

    Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment

    Authors: Yilei Zeng, Jiali Duan, Yang Li, Emilio Ferrara, Lerrel Pinto, C. -C. Jay Kuo, Stefanos Nikolaidis

    Abstract: Human-centered AI considers human experiences with AI performance. While abundant research has been hel** AI achieve superhuman performance either by fully automatic or weak supervision learning, fewer endeavors are experimenting with how AI can tailor to humans' preferred skill level given fine-grained input. In this work, we guide the curriculum reinforcement learning results towards a preferr… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 6 pages, 7 figures

    ACM Class: I.2.6

  24. arXiv:2206.15469  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Watch and Match: Supercharging Imitation with Regularized Optimal Transport

    Authors: Siddhant Haldar, Vaibhav Mathur, Denis Yarats, Lerrel Pinto

    Abstract: Imitation learning holds tremendous promise in learning policies efficiently for complex decision making problems. Current state-of-the-art algorithms often use inverse reinforcement learning (IRL), where given a set of expert demonstrations, an agent alternatively infers a reward function and the associated optimal policy. However, such IRL approaches often require substantial online interactions… ▽ More

    Submitted 20 February, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: Code and robot videos are available on https://rot-robot.github.io/

  25. arXiv:2206.11251  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Behavior Transformers: Cloning $k$ modes with one stone

    Authors: Nur Muhammad Mahi Shafiullah, Zichen Jeff Cui, Ariuntuya Altanzaya, Lerrel Pinto

    Abstract: While behavior learning has made impressive progress in recent times, it lags behind computer vision and natural language processing due to its inability to leverage large, human-generated datasets. Human behaviors have wide variance, multiple modes, and human demonstrations typically do not come with reward labels. These properties limit the applicability of current methods in Offline RL and Beha… ▽ More

    Submitted 11 October, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Code and data available at https://github.com/notmahi/bet

  26. arXiv:2203.13251  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Dexterous Imitation Made Easy: A Learning-Based Framework for Efficient Dexterous Manipulation

    Authors: Sridhar Pandian Arunachalam, Sneha Silwal, Ben Evans, Lerrel Pinto

    Abstract: Optimizing behaviors for dexterous manipulation has been a longstanding challenge in robotics, with a variety of methods from model-based control to model-free reinforcement learning having been previously explored in literature. Perhaps one of the most powerful techniques to learn complex manipulation strategies is imitation learning. However, collecting and learning from demonstrations in dexter… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: The first two authors contributed equally

  27. arXiv:2203.11176  [pdf, other

    cs.LG cs.AI cs.RO

    One After Another: Learning Incremental Skills for a Changing World

    Authors: Nur Muhammad Shafiullah, Lerrel Pinto

    Abstract: Reward-free, unsupervised discovery of skills is an attractive alternative to the bottleneck of hand-designing rewards in environments where task supervision is scarce or expensive. However, current skill pre-training methods, like many RL techniques, make a fundamental assumption - stationary environments during training. Traditional methods learn all their skills simultaneously, which makes it d… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: To be published in The International Conference on Learning Representations (ICLR) 2022

  28. arXiv:2203.08098  [pdf, other

    cs.RO

    RB2: Robotic Manipulation Benchmarking with a Twist

    Authors: Sudeep Dasari, Jianren Wang, Joyce Hong, Shikhar Bahl, Yixin Lin, Austin Wang, Abitha Thankaraj, Karanbir Chahal, Berk Calli, Saurabh Gupta, David Held, Lerrel Pinto, Deepak Pathak, Vikash Kumar, Abhinav Gupta

    Abstract: Benchmarks offer a scientific way to compare algorithms using objective performance metrics. Good benchmarks have two features: (a) they should be widely useful for many research groups; (b) and they should produce reproducible findings. In robotic manipulation research, there is a trade-off between reproducibility and broad accessibility. If the benchmark is kept restrictive (fixed hardware, obje… ▽ More

    Submitted 30 October, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: accepted at the NeurIPS 2021 Datasets and Benchmarks Track

  29. arXiv:2203.05549  [pdf, other

    cs.RO cs.AI cs.LG

    Context is Everything: Implicit Identification for Dynamics Adaptation

    Authors: Ben Evans, Abitha Thankaraj, Lerrel Pinto

    Abstract: Understanding environment dynamics is necessary for robots to act safely and optimally in the world. In realistic scenarios, dynamics are non-stationary and the causal variables such as environment parameters cannot necessarily be precisely measured or inferred, even during training. We propose Implicit Identification for Dynamics Adaptation (IIDA), a simple method to allow predictive models to ad… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Accepted at ICRA 2022

  30. arXiv:2202.07909  [pdf, other

    cs.LG cs.CV eess.SY

    Can Deep Learning be Applied to Model-Based Multi-Object Tracking?

    Authors: Juliano Pinto, Georg Hess, William Ljungbergh, Yuxuan Xia, Henk Wymeersch, Lennart Svensson

    Abstract: Multi-object tracking (MOT) is the problem of tracking the state of an unknown and time-varying number of objects using noisy measurements, with important applications such as autonomous driving, tracking animal behavior, defense systems, and others. In recent years, deep learning (DL) has been increasingly used in MOT for improving tracking performance, but mostly in settings where the measuremen… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

  31. arXiv:2201.13425  [pdf, other

    cs.LG cs.AI

    Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning

    Authors: Denis Yarats, David Brandfonbrener, Hao Liu, Michael Laskin, Pieter Abbeel, Alessandro Lazaric, Lerrel Pinto

    Abstract: Recent progress in deep learning has relied on access to large and diverse datasets. Such data-driven progress has been less evident in offline reinforcement learning (RL), because offline RL data is usually collected to optimize specific target tasks limiting the data's diversity. In this work, we propose Exploratory data for Offline RL (ExORL), a data-centric approach to offline RL. ExORL first… ▽ More

    Submitted 5 April, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

  32. arXiv:2112.01511  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    The Surprising Effectiveness of Representation Learning for Visual Imitation

    Authors: Jyothish Pari, Nur Muhammad Shafiullah, Sridhar Pandian Arunachalam, Lerrel Pinto

    Abstract: While visual imitation learning offers one of the most effective ways of learning from visual demonstrations, generalizing from them requires either hundreds of diverse demonstrations, task specific priors, or large, hard-to-train parametric models. One reason such complexities arise is because standard visual imitation frameworks try to solve two coupled problems at once: learning a succinct but… ▽ More

    Submitted 6 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: The first two authors contributed equally

  33. arXiv:2111.08084  [pdf, ps, other

    cs.IT

    Finding the Minimum Norm and Center Density of Cyclic Lattices via Nonlinear Systems

    Authors: William Lima da Silva Pinto, Carina Alves

    Abstract: Lattices with a circulant generator matrix represent a subclass of cyclic lattices. This subclass can be described by a basis containing a vector and its circular shifts. In this paper, we present certain conditions under which the norm expression of an arbitrary vector of this type of lattice is substantially simplified, and then investigate some of the lattices obtained under these conditions. W… ▽ More

    Submitted 5 July, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: preprint, 28 pages, 1 figure

    MSC Class: 11H31; 52C17; 15A15; 15A03; 90C30

  34. arXiv:2110.15191  [pdf, other

    cs.LG cs.AI cs.RO

    URLB: Unsupervised Reinforcement Learning Benchmark

    Authors: Michael Laskin, Denis Yarats, Hao Liu, Kimin Lee, Albert Zhan, Kevin Lu, Catherine Cang, Lerrel Pinto, Pieter Abbeel

    Abstract: Deep Reinforcement Learning (RL) has emerged as a powerful paradigm to solve a range of complex yet specific control tasks. Yet training generalist agents that can quickly adapt to new tasks remains an outstanding challenge. Recent advances in unsupervised RL have shown that pre-training RL agents with self-supervised intrinsic rewards can result in efficient adaptation. However, these algorithms… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: Code for the Unsupervised Reinforcement Learning Benchmark is available at https://github.com/rll-research/url_benchmark

  35. arXiv:2107.09645  [pdf, other

    cs.AI cs.LG

    Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning

    Authors: Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto

    Abstract: We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach that uses data augmentation to learn directly from pixels. We introduce several improvements that yield state-of-the-art results on the DeepMind Control Suite. Notably, DrQ-v2 is able to solve complex humanoid locomotion tasks directly from… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  36. arXiv:2107.09046  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Playful Interactions for Representation Learning

    Authors: Sarah Young, Jyothish Pari, Pieter Abbeel, Lerrel Pinto

    Abstract: One of the key challenges in visual imitation learning is collecting large amounts of expert demonstrations for a given task. While methods for collecting human demonstrations are becoming easier with teleoperation methods and the use of low-cost assistive tools, we often still require 100-1000 demonstrations for every task to learn a visual representation and policy. To address this, we turn to a… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  37. arXiv:2106.00639  [pdf, other

    eess.AS cs.SD eess.SP

    Multi-modal Point-of-Care Diagnostics for COVID-19 Based On Acoustics and Symptoms

    Authors: Srikanth Raj Chetupalli, Prashant Krishnan, Neeraj Sharma, Ananya Muguli, Rohit Kumar, Viral Nanda, Lancelot Mark Pinto, Prasanta Kumar Ghosh, Sriram Ganapathy

    Abstract: The research direction of identifying acoustic bio-markers of respiratory diseases has received renewed interest following the onset of COVID-19 pandemic. In this paper, we design an approach to COVID-19 diagnostic using crowd-sourced multi-modal data. The data resource, consisting of acoustic signals like cough, breathing, and speech signals, along with the data of symptoms, are recorded using a… ▽ More

    Submitted 5 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: The Manuscript is submitted to IEEE-EMBS Journal of Biomedical and Health Informatics on June 1, 2021

  38. arXiv:2104.02844  [pdf, other

    eess.SY cs.AI

    GEM: Group Enhanced Model for Learning Dynamical Control Systems

    Authors: Philippe Hansen-Estruch, Wenling Shang, Lerrel Pinto, Pieter Abbeel, Stas Tiomkin

    Abstract: Learning the dynamics of a physical system wherein an autonomous agent operates is an important task. Often these systems present apparent geometric structures. For instance, the trajectories of a robotic manipulator can be broken down into a collection of its transitional and rotational motions, fully characterized by the corresponding Lie groups and Lie algebras. In this work, we take advantage… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: 14 pages, 8 figures

  39. arXiv:2104.00734  [pdf, other

    cs.LG cs.RO

    Next Generation Multitarget Trackers: Random Finite Set Methods vs Transformer-based Deep Learning

    Authors: Juliano Pinto, Georg Hess, William Ljungbergh, Yuxuan Xia, Lennart Svensson, Henk Wymeersch

    Abstract: Multitarget Tracking (MTT) is the problem of tracking the states of an unknown number of objects using noisy measurements, with important applications to autonomous driving, surveillance, robotics, and others. In the model-based Bayesian setting, there are conjugate priors that enable us to express the multi-object posterior in closed form, which could theoretically provide Bayes-optimal estimates… ▽ More

    Submitted 4 June, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: 8 pages, 4 figures

  40. arXiv:2103.16732  [pdf, other

    cs.RO cs.AI

    Simultaneous Navigation and Construction Benchmarking Environments

    Authors: Wenyu Han, Chen Feng, Haoran Wu, Alexander Gao, Armand Jordana, Dong Liu, Lerrel Pinto, Ludovic Righetti

    Abstract: We need intelligent robots for mobile construction, the process of navigating in an environment and modifying its structure according to a geometric design. In this task, a major robot vision and learning challenge is how to exactly achieve the design without GPS, due to the difficulty caused by the bi-directional coupling of accurate robot localization and navigation together with strategic envir… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  41. arXiv:2103.09148  [pdf, other

    eess.AS cs.SD

    DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics

    Authors: Ananya Muguli, Lancelot Pinto, Nirmala R., Neeraj Sharma, Prashant Krishnan, Prasanta Kumar Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji, Viral Nanda

    Abstract: The DiCOVA challenge aims at accelerating research in diagnosing COVID-19 using acoustics (DiCOVA), a topic at the intersection of speech and audio processing, respiratory health diagnosis, and machine learning. This challenge is an open call for researchers to analyze a dataset of sound recordings collected from COVID-19 infected and non-COVID-19 individuals for a two-class classification. These… ▽ More

    Submitted 17 June, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: To appear in Proceedings of Interspeech, 2021

  42. arXiv:2102.13192  [pdf, other

    cs.NI

    PlaceRAN: Optimal Placement of Virtualized Network Functions in the Next-generation Radio Access Networks

    Authors: Fernando Zanferrari Morais, Gabriel Matheus de Almeida, Leizer Pinto, Kleber Vieira Cardoso, Luis M. Contreras, Rodrigo da Rosa Righi, Cristiano Bonato Both

    Abstract: The fifth-generation mobile evolution enables several transformations on Next Generation Radio Access Networks (NG-RAN). The RAN protocol stack is splitting into eight possible disaggregated options combined into three network units, i.e., Central, Distributed, and Radio. Besides that, further advances allow the RAN software to be virtualized on top of general-purpose vendor-neutral hardware, deal… ▽ More

    Submitted 28 March, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

  43. arXiv:2102.13100  [pdf, other

    cs.LG cs.AI cs.RO

    Task-Agnostic Morphology Evolution

    Authors: Donald J. Hejna III, Pieter Abbeel, Lerrel Pinto

    Abstract: Deep reinforcement learning primarily focuses on learning behavior, usually overlooking the fact that an agent's function is largely determined by form. So, how should one go about finding a morphology fit for solving tasks in a given environment? Current approaches that co-adapt morphology and behavior use a specific task's reward as a signal for morphology optimization. However, this often requi… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: ICLR 2021

  44. arXiv:2102.11271  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning with Prototypical Representations

    Authors: Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto

    Abstract: Learning effective representations in image-based environments is crucial for sample efficient Reinforcement Learning (RL). Unfortunately, in RL, representation learning is confounded with the exploratory experience of the agent -- learning a useful representation requires diverse data, while effective exploration is only possible with coherent representations. Furthermore, we would like to learn… ▽ More

    Submitted 20 July, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Journal ref: ICML 2021

  45. arXiv:2012.09811  [pdf, other

    cs.RO cs.CV cs.LG

    Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency

    Authors: Qiang Zhang, Tete Xiao, Alexei A. Efros, Lerrel Pinto, Xiaolong Wang

    Abstract: At the heart of many robotics problems is the challenge of learning correspondences across domains. For instance, imitation learning requires obtaining correspondence between humans and robots; sim-to-real requires correspondence between physics simulators and the real world; transfer learning requires correspondences between different robotics environments. This paper aims to learn correspondence… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: Project page: https://sjtuzq.github.io/cycle_dynamics.html

  46. arXiv:2012.07975  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Visual Robotic Control Efficiently with Contrastive Pre-training and Data Augmentation

    Authors: Albert Zhan, Ruihan Zhao, Lerrel Pinto, Pieter Abbeel, Michael Laskin

    Abstract: Recent advances in unsupervised representation learning significantly improved the sample efficiency of training Reinforcement Learning policies in simulated environments. However, similar gains have not yet been seen for real-robot reinforcement learning. In this work, we focus on enabling data-efficient real-robot learning from pixels. We present Contrastive Pre-training and Data Augmentation fo… ▽ More

    Submitted 16 October, 2022; v1 submitted 14 December, 2020; originally announced December 2020.

  47. arXiv:2011.06698  [pdf, other

    cs.RO cs.CV cs.LG

    Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation

    Authors: Bryan Chen, Alexander Sax, Gene Lewis, Iro Armeni, Silvio Savarese, Amir Zamir, Jitendra Malik, Lerrel Pinto

    Abstract: Vision-based robotics often separates the control loop into one module for perception and a separate module for control. It is possible to train the whole system end-to-end (e.g. with deep RL), but doing it "from scratch" comes with a high sample complexity cost and the final result is often brittle, failing unexpectedly if the test environment differs from that of training. We study the effects… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Extended version of CoRL 2020 camera ready. Supplementary released separately

  48. arXiv:2008.04899  [pdf, other

    cs.RO cs.CV cs.LG

    Visual Imitation Made Easy

    Authors: Sarah Young, Dhiraj Gandhi, Shubham Tulsiani, Abhinav Gupta, Pieter Abbeel, Lerrel Pinto

    Abstract: Visual imitation learning provides a framework for learning complex manipulation behaviors by leveraging human demonstrations. However, current interfaces for imitation such as kinesthetic teaching or teleoperation prohibitively restrict our ability to efficiently collect large-scale data in the wild. Obtaining such diverse demonstration data is paramount for the generalization of learned skills t… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

  49. Coinductive proof search for polarized logic with applications to full intuitionistic propositional logic

    Authors: José Espírito Santo, Ralph Matthes, Luís Pinto

    Abstract: The approach to proof search dubbed "coinductive proof search", and previously developed by the authors for implicational intuitionistic logic, is in this paper extended to LJP, a focused sequent-calculus presentation of polarized intuitionistic logic, including an array of positive and negative connectives. As before, this includes develo** a coinductive description of the search space generate… ▽ More

    Submitted 30 March, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

    Comments: 22 pages incl. appendices; we now stress the dependence of the results on specific proof systems (seen in the abstract, hence the change of title). LJT now comes at the end of the main text. Thm 8 (was Thm 14) evolved, and we abandon modifications in the vector of declarations in two clauses for finitary representation. There is new material on type finiteness in LJP (developed in the appendix)

  50. arXiv:2007.07333  [pdf

    cs.SI cs.CY

    Individual Factors that Influence Effort and Contributions on Wikipedia

    Authors: Luiz F. Pinto, Carlos Denner dos Santos, Silvia Onoyama

    Abstract: In this work, we aim to analyze how attitude, self-efficacy, and altruism influence effort and active contributions on Wikipedia. We propose a new conceptual model based on the theory of planned behavior and findings from the literature on online communities. This model differs from other models that have been previously proposed by considering altruism in its various facets (identification, recip… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: Presented at AoM 2019 in Boston