Skip to main content

Showing 1–50 of 107 results for author: Pinto, L

.
  1. arXiv:2406.07539  [pdf, other

    cs.RO

    BAKU: An Efficient Transformer for Multi-Task Policy Learning

    Authors: Siddhant Haldar, Zhuoran Peng, Lerrel Pinto

    Abstract: Training generalist agents capable of solving diverse tasks is challenging, often requiring large datasets of expert demonstrations. This is particularly problematic in robotics, where each data point requires physical execution of actions in the real world. Thus, there is a pressing need for architectures that can effectively leverage the available training data. In this work, we present BAKU, a… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2406.04318  [pdf, other

    cs.LG cs.AI cs.CV

    Adaptive Sampling of k-Space in Magnetic Resonance for Rapid Pathology Prediction

    Authors: Chen-Yu Yen, Raghav Singhal, Umang Sharma, Rajesh Ranganath, Sumit Chopra, Lerrel Pinto

    Abstract: Magnetic Resonance (MR) imaging, despite its proven diagnostic utility, remains an inaccessible imaging modality for disease surveillance at the population level. A major factor rendering MR inaccessible is lengthy scan times. An MR scanner collects measurements associated with the underlying anatomy in the Fourier space, also known as the k-space. Creating a high-fidelity image requires collectin… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ICML 2024. Project website at https://adaptive-sampling-mr.github.io

  3. arXiv:2403.18836  [pdf, other

    math.RT math.CT

    Triangulated structure for Bondarenko's Categories

    Authors: Germán Benitez, Gustavo Costa, Lucas Q. Pinto

    Abstract: V. Bondarenko and Y. Drozd gives a description of all indecomposable objects in a category of representations of posets, nowadays known as the Bondarenko's category. This category was essential for V. Bekkert and H. Merklen classify all indecomposable objects of the derived category of gentle algebras. In view of this connection with the derived category, which possess a triangulated structure, an… ▽ More

    Submitted 14 February, 2024; originally announced March 2024.

    Comments: 12 pages

    MSC Class: 16G20; 18G80

  4. arXiv:2403.08439  [pdf, other

    q-bio.QM

    Characterisation of Anti-Arrhythmic Drug Effects on Cardiac Electrophysiology using Physics-Informed Neural Networks

    Authors: Ching-En Chiu, Arieh Levy Pinto, Rasheda A Chowdhury, Kim Christensen, Marta Varela

    Abstract: The ability to accurately infer cardiac electrophysiological (EP) properties is key to improving arrhythmia diagnosis and treatment. In this work, we developed a physics-informed neural networks (PINNs) framework to predict how different myocardial EP parameters are modulated by anti-arrhythmic drugs. Using $\textit{in vitro}$ optical map** images and the 3-channel Fenton-Karma model, we estimat… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in the 21st IEEE International Symposium on Biomedical Imaging 2024

  5. arXiv:2403.07870  [pdf, other

    cs.RO

    OPEN TEACH: A Versatile Teleoperation System for Robotic Manipulation

    Authors: Aadhithya Iyer, Zhuoran Peng, Yinlong Dai, Irmak Guzey, Siddhant Haldar, Soumith Chintala, Lerrel Pinto

    Abstract: Open-sourced, user-friendly tools form the bedrock of scientific advancement across disciplines. The widespread adoption of data-driven learning has led to remarkable progress in multi-fingered dexterity, bimanual manipulation, and applications ranging from logistics to home robotics. However, existing data collection platforms are often proprietary, costly, or tailored to specific robotic morphol… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  6. arXiv:2403.03181  [pdf, other

    cs.LG cs.AI cs.RO

    Behavior Generation with Latent Actions

    Authors: Seungjae Lee, Yibin Wang, Haritheja Etukuru, H. ** Kim, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

    Abstract: Generative modeling of complex behaviors from labeled datasets has been a longstanding problem in decision making. Unlike language or image generation, decision making requires modeling actions - continuous-valued vectors that are multimodal in their distribution, potentially drawn from uncurated sources, where generation errors can compound in sequential prediction. A recent class of models calle… ▽ More

    Submitted 28 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Github repo: https://github.com/jayLEE0301/vq_bet_official

  7. arXiv:2402.10211  [pdf, other

    cs.LG cs.RO eess.SP

    Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling

    Authors: Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman, Carmel Majidi, Abhinav Gupta, Tess Hellebrekers, Lerrel Pinto

    Abstract: Reasoning from sequences of raw sensory data is a ubiquitous problem across fields ranging from medical devices to robotics. These problems often involve using long sequences of raw sensor data (e.g. magnetometers, piezoresistors) to predict sequences of desirable physical quantities (e.g. force, inertial measurements). While classical approaches are powerful for locally-linear prediction problems… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  8. arXiv:2401.12202  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics

    Authors: Peiqi Liu, Yaswanth Orru, Jay Vakil, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

    Abstract: Remarkable progress has been made in recent years in the fields of vision, language, and robotics. We now have vision models capable of recognizing objects based on language queries, navigation systems that can effectively control mobile systems, and gras** models that can handle a wide range of objects. Despite these advancements, general-purpose applications of robotics still lag behind, even… ▽ More

    Submitted 29 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: Github repo: https://github.com/ok-robot/ok-robot

  9. arXiv:2401.09252  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    3D Scene Geometry Estimation from 360$^\circ$ Imagery: A Survey

    Authors: Thiago Lopes Trugillo da Silveira, Paulo Gamarra Lessa Pinto, Jeffri Erwin Murrugarra Llerena, Claudio Rosito Jung

    Abstract: This paper provides a comprehensive survey on pioneer and state-of-the-art 3D scene geometry estimation methodologies based on single, two, or multiple images captured under the omnidirectional optics. We first revisit the basic concepts of the spherical camera model, and review the most common acquisition technologies and representation formats suitable for omnidirectional (also called 360… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Published in ACM Computing Surveys

    Journal ref: ACM Comput. Surv. 55, 4, Article 68, 2023

  10. arXiv:2312.17261  [pdf, other

    cs.CV cs.LG

    Transformer-Based Multi-Object Smoothing with Decoupled Data Association and Smoothing

    Authors: Juliano Pinto, Georg Hess, Yuxuan Xia, Henk Wymeersch, Lennart Svensson

    Abstract: Multi-object tracking (MOT) is the task of estimating the state trajectories of an unknown and time-varying number of objects over a certain time window. Several algorithms have been proposed to tackle the multi-object smoothing task, where object detections can be conditioned on all the measurements in the time window. However, the best-performing methods suffer from intractable computational com… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  11. arXiv:2312.07540  [pdf, other

    cs.AI cs.CL cs.LG

    diff History for Neural Language Agents

    Authors: Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

    Abstract: Neural Language Models (LMs) offer an exciting solution for general-purpose embodied control. However, a key technical issue arises when using an LM-based controller: environment observations must be converted to text, which coupled with history, results in long and verbose textual prompts. As a result, prior work in LM agents is limited to restricted domains with small observation size as well as… ▽ More

    Submitted 11 June, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: ICML 2024 version

  12. arXiv:2311.16098  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    On Bringing Robots Home

    Authors: Nur Muhammad Mahi Shafiullah, Anant Rai, Haritheja Etukuru, Yiqian Liu, Ishan Misra, Soumith Chintala, Lerrel Pinto

    Abstract: Throughout history, we have successfully integrated various machines into our homes. Dishwashers, laundry machines, stand mixers, and robot vacuums are a few recent examples. However, these machines excel at performing only a single task effectively. The concept of a "generalist machine" in homes - a domestic assistant that can adapt and learn from our needs, all while remaining cost-effective - h… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Project website and videos are available at https://dobb-e.com, technical documentation for getting started is available at https://docs.dobb-e.com, and code is released at https://github.com/notmahi/dobb-e

  13. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  14. arXiv:2310.08573  [pdf, other

    cs.RO

    PolyTask: Learning Unified Policies through Behavior Distillation

    Authors: Siddhant Haldar, Lerrel Pinto

    Abstract: Unified models capable of solving a wide variety of tasks have gained traction in vision and NLP due to their ability to share regularities and structures across tasks, which improves individual task performance and reduces computational footprint. However, the impact of such models remains limited in embodied learning problems, which present unique challenges due to interactivity, sample ineffici… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  15. arXiv:2310.08174  [pdf, other

    physics.ins-det hep-ex physics.ao-ph physics.space-ph

    Map** Water on the Moon and Mars using a Muon Tomograph

    Authors: Olin Lyod Pinto, Jörg Miikael Tiit

    Abstract: The search for water on the Lunar and Martian surfaces is a fundamental aspect of space exploration, contributing to the understanding of the history and evolution of these celestial bodies. However, the current understanding of the distribution, concentration, origin, and migration of water on these surfaces is limited. Moreover, there is a need for more detailed data on these aspects of Lunar an… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: The paper has been submitted to the JOURNAL OF ADVANCED INSTRUMENTATION IN SCIENCE

  16. arXiv:2309.12300  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    See to Touch: Learning Tactile Dexterity through Visual Incentives

    Authors: Irmak Guzey, Yinlong Dai, Ben Evans, Soumith Chintala, Lerrel Pinto

    Abstract: Equip** multi-fingered robots with tactile sensing is crucial for achieving the precise, contact-rich, and dexterous manipulation that humans excel at. However, relying solely on tactile sensing fails to provide adequate cues for reasoning about objects' spatial configurations, limiting the ability to correct errors and adapt to changing situations. In this paper, we present Tactile Adaptation f… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  17. arXiv:2307.12505  [pdf

    q-bio.NC nlin.AO physics.data-an

    Optimizing parameter search for community detection in time evolving networks of complex systems

    Authors: ItaloIvo Lima Dias Pinto, Javier Omar Garcia, Kanika Bansal

    Abstract: Network representations have been effectively employed to analyze complex systems across various areas and applications, leading to the development of network science as a core tool to study systems with multiple components and complex interactions. There is a growing interest in understanding the temporal dynamics of complex networks to decode the underlying dynamic processes through the temporal… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

    Comments: 28 pages, 7 figures

  18. arXiv:2306.12554  [pdf, other

    cs.LG cs.AI

    Improving Long-Horizon Imitation Through Instruction Prediction

    Authors: Joey Hejna, Pieter Abbeel, Lerrel Pinto

    Abstract: Complex, long-horizon planning and its combinatorial nature pose steep challenges for learning-based agents. Difficulties in such settings are exacerbated in low data regimes where over-fitting stifles generalization and compounding errors hurt accuracy. In this work, we explore the use of an often unused source of auxiliary supervision: language. Inspired by recent advances in transformer-based m… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Published at AAAI 2023

  19. arXiv:2306.00942  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Train Offline, Test Online: A Real Robot Learning Benchmark

    Authors: Gaoyue Zhou, Victoria Dean, Mohan Kumar Srirama, Aravind Rajeswaran, Jyothish Pari, Kyle Hatch, Aryan Jain, Tianhe Yu, Pieter Abbeel, Lerrel Pinto, Chelsea Finn, Abhinav Gupta

    Abstract: Three challenges limit the progress of robot learning research: robots are expensive (few labs can participate), everyone uses different robots (findings do not generalize across labs), and we lack internet-scale robotics data. We take on these challenges via a new benchmark: Train Offline, Test Online (TOTO). TOTO provides remote users with access to shared robotic hardware for evaluating methods… ▽ More

    Submitted 30 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICRA 2023

  20. arXiv:2305.19240  [pdf, other

    cs.LG cs.AI

    NetHack is Hard to Hack

    Authors: Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

    Abstract: Neural policy learning methods have achieved remarkable results in various control problems, ranging from Atari games to simulated locomotion. However, these methods struggle in long-horizon tasks, especially in open-ended environments with multi-modal observations, such as the popular dungeon-crawler game, NetHack. Intriguingly, the NeurIPS 2021 NetHack Challenge revealed that symbolic agents out… ▽ More

    Submitted 30 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  21. arXiv:2303.12076  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Dexterity from Touch: Self-Supervised Pre-Training of Tactile Representations with Robotic Play

    Authors: Irmak Guzey, Ben Evans, Soumith Chintala, Lerrel Pinto

    Abstract: Teaching dexterity to multi-fingered robots has been a longstanding challenge in robotics. Most prominent work in this area focuses on learning controllers or policies that either operate on visual observations or state estimates derived from vision. However, such methods perform poorly on fine-grained manipulation tasks that require reasoning about contact forces or about objects occluded by the… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Video and code can be accessed here: https://tactile-dexterity.github.io/

  22. arXiv:2303.05647  [pdf, other

    cond-mat.mes-hall hep-th quant-ph

    Electronic states in quantum wires on a Möbius strip

    Authors: J. J. L. R. Pinto, J. E. G. Silva, C. A. S. Almeida

    Abstract: We study the properties of a two-dimensional non-relativistic electron gas (TDEG) constrained on wires along a Möbius strip. We considered wires around the strip and along the transverse direction, across the width of the strip. For each direction, we investigate how the curvature modifies the electronic states and their corresponding energy spectrum. At the center of the strip, the wires around t… ▽ More

    Submitted 6 June, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: 16 pages, 11 captioned figures. Updated version to match that one published in Physica Scripta

    Journal ref: Phys. Scr. 99 (2024) 0659c2

  23. arXiv:2303.01497  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations

    Authors: Siddhant Haldar, Jyothish Pari, Anant Rai, Lerrel Pinto

    Abstract: While imitation learning provides us with an efficient toolkit to train robots, learning skills that are robust to environment variations remains a significant challenge. Current approaches address this challenge by relying either on large amounts of demonstrations that span environment variations or on handcrafted reward functions that require state estimates. Both directions are not scalable to… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Code and robot videos are available at https://fast-imitation.github.io/

  24. arXiv:2210.10047  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data

    Authors: Zichen Jeff Cui, Yibin Wang, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

    Abstract: While large-scale sequence modeling from offline data has led to impressive performance gains in natural language and image generation, directly translating such ideas to robotics has been challenging. One critical reason for this is that uncurated robot demonstration data, i.e. play data, collected from non-expert human demonstrators are often noisy, diverse, and distributionally multi-modal. Thi… ▽ More

    Submitted 15 December, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: Code and data available at: https://play-to-policy.github.io; (fixed metadata author name format)

  25. arXiv:2210.06463  [pdf, other

    cs.RO cs.AI cs.CV cs.HC cs.LG

    Holo-Dex: Teaching Dexterity with Immersive Mixed Reality

    Authors: Sridhar Pandian Arunachalam, Irmak Güzey, Soumith Chintala, Lerrel Pinto

    Abstract: A fundamental challenge in teaching robots is to provide an effective interface for human teachers to demonstrate useful skills to a robot. This challenge is exacerbated in dexterous manipulation, where teaching high-dimensional, contact-rich behaviors often require esoteric teleoperation tools. In this work, we present Holo-Dex, a framework for dexterous manipulation that places a teacher in an i… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Data, code and videos are available at https://holo-dex.github.io

  26. arXiv:2210.05663  [pdf, other

    cs.RO cs.CV

    CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

    Authors: Nur Muhammad Mahi Shafiullah, Chris Paxton, Lerrel Pinto, Soumith Chintala, Arthur Szlam

    Abstract: We propose CLIP-Fields, an implicit scene model that can be used for a variety of tasks, such as segmentation, instance identification, semantic search over space, and view localization. CLIP-Fields learns a map** from spatial locations to semantic embedding vectors. Importantly, we show that this map** can be trained with supervision coming only from web-image and web-text trained models such… ▽ More

    Submitted 22 May, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Code, video, and interactive demonstrations available at https://mahis.life/clip-fields. Accepted for publication at Robotics: Science and Systems 2023 in Daegu, Korea

  27. arXiv:2210.01116  [pdf, other

    cs.RO cs.LG cs.SD eess.AS

    That Sounds Right: Auditory Self-Supervision for Dynamic Robot Manipulation

    Authors: Abitha Thankaraj, Lerrel Pinto

    Abstract: Learning to produce contact-rich, dynamic behaviors from raw sensory data has been a longstanding challenge in robotics. Prominent approaches primarily focus on using visual or tactile sensing, where unfortunately one fails to capture high-frequency interaction, while the other can be too delicate for large-scale data collection. In this work, we propose a data-centric approach to dynamic manipula… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: Videos and audio data are best seen on our project website: audio-robot-learning.github.io

  28. arXiv:2208.02932  [pdf, other

    cs.AI cs.HC cs.LG

    Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment

    Authors: Yilei Zeng, Jiali Duan, Yang Li, Emilio Ferrara, Lerrel Pinto, C. -C. Jay Kuo, Stefanos Nikolaidis

    Abstract: Human-centered AI considers human experiences with AI performance. While abundant research has been hel** AI achieve superhuman performance either by fully automatic or weak supervision learning, fewer endeavors are experimenting with how AI can tailor to humans' preferred skill level given fine-grained input. In this work, we guide the curriculum reinforcement learning results towards a preferr… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 6 pages, 7 figures

    ACM Class: I.2.6

  29. arXiv:2206.15469  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Watch and Match: Supercharging Imitation with Regularized Optimal Transport

    Authors: Siddhant Haldar, Vaibhav Mathur, Denis Yarats, Lerrel Pinto

    Abstract: Imitation learning holds tremendous promise in learning policies efficiently for complex decision making problems. Current state-of-the-art algorithms often use inverse reinforcement learning (IRL), where given a set of expert demonstrations, an agent alternatively infers a reward function and the associated optimal policy. However, such IRL approaches often require substantial online interactions… ▽ More

    Submitted 20 February, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: Code and robot videos are available on https://rot-robot.github.io/

  30. arXiv:2206.11251  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Behavior Transformers: Cloning $k$ modes with one stone

    Authors: Nur Muhammad Mahi Shafiullah, Zichen Jeff Cui, Ariuntuya Altanzaya, Lerrel Pinto

    Abstract: While behavior learning has made impressive progress in recent times, it lags behind computer vision and natural language processing due to its inability to leverage large, human-generated datasets. Human behaviors have wide variance, multiple modes, and human demonstrations typically do not come with reward labels. These properties limit the applicability of current methods in Offline RL and Beha… ▽ More

    Submitted 11 October, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Code and data available at https://github.com/notmahi/bet

  31. arXiv:2203.13251  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Dexterous Imitation Made Easy: A Learning-Based Framework for Efficient Dexterous Manipulation

    Authors: Sridhar Pandian Arunachalam, Sneha Silwal, Ben Evans, Lerrel Pinto

    Abstract: Optimizing behaviors for dexterous manipulation has been a longstanding challenge in robotics, with a variety of methods from model-based control to model-free reinforcement learning having been previously explored in literature. Perhaps one of the most powerful techniques to learn complex manipulation strategies is imitation learning. However, collecting and learning from demonstrations in dexter… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: The first two authors contributed equally

  32. arXiv:2203.11176  [pdf, other

    cs.LG cs.AI cs.RO

    One After Another: Learning Incremental Skills for a Changing World

    Authors: Nur Muhammad Shafiullah, Lerrel Pinto

    Abstract: Reward-free, unsupervised discovery of skills is an attractive alternative to the bottleneck of hand-designing rewards in environments where task supervision is scarce or expensive. However, current skill pre-training methods, like many RL techniques, make a fundamental assumption - stationary environments during training. Traditional methods learn all their skills simultaneously, which makes it d… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: To be published in The International Conference on Learning Representations (ICLR) 2022

  33. arXiv:2203.08098  [pdf, other

    cs.RO

    RB2: Robotic Manipulation Benchmarking with a Twist

    Authors: Sudeep Dasari, Jianren Wang, Joyce Hong, Shikhar Bahl, Yixin Lin, Austin Wang, Abitha Thankaraj, Karanbir Chahal, Berk Calli, Saurabh Gupta, David Held, Lerrel Pinto, Deepak Pathak, Vikash Kumar, Abhinav Gupta

    Abstract: Benchmarks offer a scientific way to compare algorithms using objective performance metrics. Good benchmarks have two features: (a) they should be widely useful for many research groups; (b) and they should produce reproducible findings. In robotic manipulation research, there is a trade-off between reproducibility and broad accessibility. If the benchmark is kept restrictive (fixed hardware, obje… ▽ More

    Submitted 30 October, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: accepted at the NeurIPS 2021 Datasets and Benchmarks Track

  34. arXiv:2203.05549  [pdf, other

    cs.RO cs.AI cs.LG

    Context is Everything: Implicit Identification for Dynamics Adaptation

    Authors: Ben Evans, Abitha Thankaraj, Lerrel Pinto

    Abstract: Understanding environment dynamics is necessary for robots to act safely and optimally in the world. In realistic scenarios, dynamics are non-stationary and the causal variables such as environment parameters cannot necessarily be precisely measured or inferred, even during training. We propose Implicit Identification for Dynamics Adaptation (IIDA), a simple method to allow predictive models to ad… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Accepted at ICRA 2022

  35. arXiv:2202.07909  [pdf, other

    cs.LG cs.CV eess.SY

    Can Deep Learning be Applied to Model-Based Multi-Object Tracking?

    Authors: Juliano Pinto, Georg Hess, William Ljungbergh, Yuxuan Xia, Henk Wymeersch, Lennart Svensson

    Abstract: Multi-object tracking (MOT) is the problem of tracking the state of an unknown and time-varying number of objects using noisy measurements, with important applications such as autonomous driving, tracking animal behavior, defense systems, and others. In recent years, deep learning (DL) has been increasingly used in MOT for improving tracking performance, but mostly in settings where the measuremen… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

  36. arXiv:2201.13425  [pdf, other

    cs.LG cs.AI

    Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning

    Authors: Denis Yarats, David Brandfonbrener, Hao Liu, Michael Laskin, Pieter Abbeel, Alessandro Lazaric, Lerrel Pinto

    Abstract: Recent progress in deep learning has relied on access to large and diverse datasets. Such data-driven progress has been less evident in offline reinforcement learning (RL), because offline RL data is usually collected to optimize specific target tasks limiting the data's diversity. In this work, we propose Exploratory data for Offline RL (ExORL), a data-centric approach to offline RL. ExORL first… ▽ More

    Submitted 5 April, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

  37. arXiv:2112.01511  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    The Surprising Effectiveness of Representation Learning for Visual Imitation

    Authors: Jyothish Pari, Nur Muhammad Shafiullah, Sridhar Pandian Arunachalam, Lerrel Pinto

    Abstract: While visual imitation learning offers one of the most effective ways of learning from visual demonstrations, generalizing from them requires either hundreds of diverse demonstrations, task specific priors, or large, hard-to-train parametric models. One reason such complexities arise is because standard visual imitation frameworks try to solve two coupled problems at once: learning a succinct but… ▽ More

    Submitted 6 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: The first two authors contributed equally

  38. arXiv:2111.08084  [pdf, ps, other

    cs.IT

    Finding the Minimum Norm and Center Density of Cyclic Lattices via Nonlinear Systems

    Authors: William Lima da Silva Pinto, Carina Alves

    Abstract: Lattices with a circulant generator matrix represent a subclass of cyclic lattices. This subclass can be described by a basis containing a vector and its circular shifts. In this paper, we present certain conditions under which the norm expression of an arbitrary vector of this type of lattice is substantially simplified, and then investigate some of the lattices obtained under these conditions. W… ▽ More

    Submitted 5 July, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: preprint, 28 pages, 1 figure

    MSC Class: 11H31; 52C17; 15A15; 15A03; 90C30

  39. arXiv:2110.15191  [pdf, other

    cs.LG cs.AI cs.RO

    URLB: Unsupervised Reinforcement Learning Benchmark

    Authors: Michael Laskin, Denis Yarats, Hao Liu, Kimin Lee, Albert Zhan, Kevin Lu, Catherine Cang, Lerrel Pinto, Pieter Abbeel

    Abstract: Deep Reinforcement Learning (RL) has emerged as a powerful paradigm to solve a range of complex yet specific control tasks. Yet training generalist agents that can quickly adapt to new tasks remains an outstanding challenge. Recent advances in unsupervised RL have shown that pre-training RL agents with self-supervised intrinsic rewards can result in efficient adaptation. However, these algorithms… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: Code for the Unsupervised Reinforcement Learning Benchmark is available at https://github.com/rll-research/url_benchmark

  40. An Uncertainty-Aware Performance Measure for Multi-Object Tracking

    Authors: Juliano Pinto, Yuxuan Xia, Lennart Svensson, Henk Wymeersch

    Abstract: Evaluating the performance of multi-object tracking (MOT) methods is not straightforward, and existing performance measures fail to consider all the available uncertainty information in the MOT context. This can lead practitioners to select models which produce uncertainty estimates of lower quality, negatively impacting any downstream systems that rely on them. Additionally, most MOT performance… ▽ More

    Submitted 9 September, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: Accepted to IEEE Signal Processing Letters 2021

  41. arXiv:2107.09645  [pdf, other

    cs.AI cs.LG

    Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning

    Authors: Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto

    Abstract: We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach that uses data augmentation to learn directly from pixels. We introduce several improvements that yield state-of-the-art results on the DeepMind Control Suite. Notably, DrQ-v2 is able to solve complex humanoid locomotion tasks directly from… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  42. arXiv:2107.09046  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Playful Interactions for Representation Learning

    Authors: Sarah Young, Jyothish Pari, Pieter Abbeel, Lerrel Pinto

    Abstract: One of the key challenges in visual imitation learning is collecting large amounts of expert demonstrations for a given task. While methods for collecting human demonstrations are becoming easier with teleoperation methods and the use of low-cost assistive tools, we often still require 100-1000 demonstrations for every task to learn a visual representation and policy. To address this, we turn to a… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  43. arXiv:2107.03856  [pdf, ps, other

    physics.bio-ph

    Characterization of biophysical determinants of spatio-temporal calcium dynamics in astrocytes

    Authors: Thais Appelt Peres Bartiê, Leonel Teixeira Pinto

    Abstract: Most of the functions performed by astrocytes in brain information processing are related to calcium waves. Experimental studies involving calcium waves present discrepant results, leading to gaps in the full understanding of the functions of these cells. The use of mathematical models help to understand the experimental results, identifying chemical mechanisms involved in calcium waves and the li… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: 26 pages, 26 figures, research paper with calcium wave in astrocytes: modeling and simulation

    MSC Class: 65N06

  44. The use of hyaluronic acid in individuals with cleft lip and palate: Literature review

    Authors: Kelly Fernanda Molena, Lidiane de Castro Pinto, Gisele da Silva Dalben

    Abstract: Since the Resolution 198/2019 of Brazilian Dental Council, which regulates orofacial harmonization as a dental specialty, and the advent of various uses of facial fillers, such as hyaluronic acid (HA), it is possible to perform both esthetic and functional corrections in individuals. Individuals with cleft lip and palate (CLP) present lip irregularities even after orofacial rehabilitation with an… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Journal ref: J Cleft Lip Palate Craniofac Anomal 2021;8:143-8

  45. arXiv:2106.00639  [pdf, other

    eess.AS cs.SD eess.SP

    Multi-modal Point-of-Care Diagnostics for COVID-19 Based On Acoustics and Symptoms

    Authors: Srikanth Raj Chetupalli, Prashant Krishnan, Neeraj Sharma, Ananya Muguli, Rohit Kumar, Viral Nanda, Lancelot Mark Pinto, Prasanta Kumar Ghosh, Sriram Ganapathy

    Abstract: The research direction of identifying acoustic bio-markers of respiratory diseases has received renewed interest following the onset of COVID-19 pandemic. In this paper, we design an approach to COVID-19 diagnostic using crowd-sourced multi-modal data. The data resource, consisting of acoustic signals like cough, breathing, and speech signals, along with the data of symptoms, are recorded using a… ▽ More

    Submitted 5 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: The Manuscript is submitted to IEEE-EMBS Journal of Biomedical and Health Informatics on June 1, 2021

  46. arXiv:2104.02844  [pdf, other

    eess.SY cs.AI

    GEM: Group Enhanced Model for Learning Dynamical Control Systems

    Authors: Philippe Hansen-Estruch, Wenling Shang, Lerrel Pinto, Pieter Abbeel, Stas Tiomkin

    Abstract: Learning the dynamics of a physical system wherein an autonomous agent operates is an important task. Often these systems present apparent geometric structures. For instance, the trajectories of a robotic manipulator can be broken down into a collection of its transitional and rotational motions, fully characterized by the corresponding Lie groups and Lie algebras. In this work, we take advantage… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: 14 pages, 8 figures

  47. arXiv:2104.00734  [pdf, other

    cs.LG cs.RO

    Next Generation Multitarget Trackers: Random Finite Set Methods vs Transformer-based Deep Learning

    Authors: Juliano Pinto, Georg Hess, William Ljungbergh, Yuxuan Xia, Lennart Svensson, Henk Wymeersch

    Abstract: Multitarget Tracking (MTT) is the problem of tracking the states of an unknown number of objects using noisy measurements, with important applications to autonomous driving, surveillance, robotics, and others. In the model-based Bayesian setting, there are conjugate priors that enable us to express the multi-object posterior in closed form, which could theoretically provide Bayes-optimal estimates… ▽ More

    Submitted 4 June, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: 8 pages, 4 figures

  48. arXiv:2103.16732  [pdf, other

    cs.RO cs.AI

    Simultaneous Navigation and Construction Benchmarking Environments

    Authors: Wenyu Han, Chen Feng, Haoran Wu, Alexander Gao, Armand Jordana, Dong Liu, Lerrel Pinto, Ludovic Righetti

    Abstract: We need intelligent robots for mobile construction, the process of navigating in an environment and modifying its structure according to a geometric design. In this task, a major robot vision and learning challenge is how to exactly achieve the design without GPS, due to the difficulty caused by the bi-directional coupling of accurate robot localization and navigation together with strategic envir… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  49. arXiv:2103.09148  [pdf, other

    eess.AS cs.SD

    DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics

    Authors: Ananya Muguli, Lancelot Pinto, Nirmala R., Neeraj Sharma, Prashant Krishnan, Prasanta Kumar Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji, Viral Nanda

    Abstract: The DiCOVA challenge aims at accelerating research in diagnosing COVID-19 using acoustics (DiCOVA), a topic at the intersection of speech and audio processing, respiratory health diagnosis, and machine learning. This challenge is an open call for researchers to analyze a dataset of sound recordings collected from COVID-19 infected and non-COVID-19 individuals for a two-class classification. These… ▽ More

    Submitted 17 June, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: To appear in Proceedings of Interspeech, 2021

  50. arXiv:2102.13192  [pdf, other

    cs.NI

    PlaceRAN: Optimal Placement of Virtualized Network Functions in the Next-generation Radio Access Networks

    Authors: Fernando Zanferrari Morais, Gabriel Matheus de Almeida, Leizer Pinto, Kleber Vieira Cardoso, Luis M. Contreras, Rodrigo da Rosa Righi, Cristiano Bonato Both

    Abstract: The fifth-generation mobile evolution enables several transformations on Next Generation Radio Access Networks (NG-RAN). The RAN protocol stack is splitting into eight possible disaggregated options combined into three network units, i.e., Central, Distributed, and Radio. Besides that, further advances allow the RAN software to be virtualized on top of general-purpose vendor-neutral hardware, deal… ▽ More

    Submitted 28 March, 2021; v1 submitted 25 February, 2021; originally announced February 2021.