Skip to main content

Showing 1–21 of 21 results for author: Bonatti, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.02437  [pdf, other

    cs.CV

    EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields

    Authors: Anish Bhattacharya, Ratnesh Madaan, Fernando Cladera, Sai Vemprala, Rogerio Bonatti, Kostas Daniilidis, Ashish Kapoor, Vijay Kumar, Nikolai Matni, Jayesh K. Gupta

    Abstract: We present EvDNeRF, a pipeline for generating event data and training an event-based dynamic NeRF, for the purpose of faithfully reconstructing eventstreams on scenes with rigid and non-rigid deformations that may be too fast to capture with a standard camera. Event cameras register asynchronous per-pixel brightness changes at MHz rates with high dynamic range, making them ideal for observing fast… ▽ More

    Submitted 6 December, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: 16 pages, 20 figures, 2 tables

  2. arXiv:2307.07909  [pdf, other

    cs.AI

    Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

    Authors: Yao Wei, Yanchao Sun, Ruijie Zheng, Sai Vemprala, Rogerio Bonatti, Shuhang Chen, Ratnesh Madaan, Zhongjie Ba, Ashish Kapoor, Shuang Ma

    Abstract: We introduce DualMind, a generalist agent designed to tackle various decision-making tasks that addresses challenges posed by current methods, such as overfitting behaviors and dependence on task-specific fine-tuning. DualMind uses a novel "Dual-phase" training strategy that emulates how humans learn to act in the world. The model first learns fundamental common knowledge through a self-supervised… ▽ More

    Submitted 9 October, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

  3. arXiv:2306.17582  [pdf, other

    cs.AI cs.CL cs.HC cs.LG cs.RO

    ChatGPT for Robotics: Design Principles and Model Abilities

    Authors: Sai Vemprala, Rogerio Bonatti, Arthur Bucker, Ashish Kapoor

    Abstract: This paper presents an experimental study regarding the use of OpenAI's ChatGPT for robotics applications. We outline a strategy that combines design principles for prompt engineering and the creation of a high-level function library which allows ChatGPT to adapt to different robotics tasks, simulators, and form factors. We focus our evaluations on the effectiveness of different prompt engineering… ▽ More

    Submitted 19 July, 2023; v1 submitted 20 February, 2023; originally announced June 2023.

  4. arXiv:2303.04212  [pdf, other

    cs.RO cs.LG

    ConBaT: Control Barrier Transformer for Safe Policy Learning

    Authors: Yue Meng, Sai Vemprala, Rogerio Bonatti, Chuchu Fan, Ashish Kapoor

    Abstract: Large-scale self-supervised models have recently revolutionized our ability to perform a variety of tasks within the vision and language domains. However, using such models for autonomous systems is challenging because of safety requirements: besides executing correct actions, an autonomous agent must also avoid the high cost and potentially fatal critical mistakes. Traditionally, self-supervised… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  5. arXiv:2301.09816  [pdf, other

    cs.LG cs.AI

    SMART: Self-supervised Multi-task pretrAining with contRol Transformers

    Authors: Yanchao Sun, Shuang Ma, Ratnesh Madaan, Rogerio Bonatti, Furong Huang, Ashish Kapoor

    Abstract: Self-supervised pretraining has been extensively studied in language and vision domains, where a unified model can be easily adapted to various downstream tasks by pretraining representations without explicit labels. When it comes to sequential decision-making tasks, however, it is difficult to properly design such a pretraining approach that can cope with both high-dimensional perceptual informat… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

  6. arXiv:2209.11133  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    PACT: Perception-Action Causal Transformer for Autoregressive Robotics Pre-Training

    Authors: Rogerio Bonatti, Sai Vemprala, Shuang Ma, Felipe Frujeri, Shuhang Chen, Ashish Kapoor

    Abstract: Robotics has long been a field riddled with complex systems architectures whose modules and connections, whether traditional or learning-based, require significant human expertise and prior knowledge. Inspired by large pre-trained language models, this work introduces a paradigm for pre-training a general purpose representation that can serve as a starting point for multiple tasks on a given robot… ▽ More

    Submitted 23 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

  7. arXiv:2208.02918  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    LATTE: LAnguage Trajectory TransformEr

    Authors: Arthur Bucker, Luis Figueredo, Sami Haddadin, Ashish Kapoor, Shuang Ma, Sai Vemprala, Rogerio Bonatti

    Abstract: Natural language is one of the most intuitive ways to express human intent. However, translating instructions and commands towards robotic motion generation and deployment in the real world is far from being an easy task. The challenge of combining a robot's inherent low-level geometric and kinodynamic constraints with a human's high-level semantic instructions traditionally is solved using task-s… ▽ More

    Submitted 16 September, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

  8. arXiv:2203.13411  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Resha** Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers

    Authors: Arthur Bucker, Luis Figueredo, Sami Haddadin, Ashish Kapoor, Shuang Ma, Rogerio Bonatti

    Abstract: Natural language is the most intuitive medium for us to interact with other people when expressing commands and instructions. However, using language is seldom an easy task when humans need to express their intent towards robots, since most of the current language interfaces require rigid templates with a static set of action targets and commands. In this work, we provide a flexible language-based… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  9. arXiv:2110.03119  [pdf, other

    cs.RO

    Adaptive Safety Margin Estimation for Safe Real-Time Replanning under Time-Varying Disturbance

    Authors: Cherie Ho, Jay Patrikar, Rogerio Bonatti, Sebastian Scherer

    Abstract: Safe navigation in real-time is challenging because engineers need to work with uncertain vehicle dynamics, variable external disturbances, and imperfect controllers. A common safety strategy is to inflate obstacles by hand-defined margins. However, arbitrary static margins often fail in more dynamic scenarios, and using worst-case assumptions is overly conservative for most settings where disturb… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

  10. arXiv:2108.03936  [pdf, other

    cs.RO cs.CV

    3D Human Reconstruction in the Wild with Collaborative Aerial Cameras

    Authors: Cherie Ho, Andrew Jong, Harry Freeman, Rohan Rao, Rogerio Bonatti, Sebastian Scherer

    Abstract: Aerial vehicles are revolutionizing applications that require capturing the 3D structure of dynamic targets in the wild, such as sports, medicine, and entertainment. The core challenges in develo** a motion-capture system that operates in outdoors environments are: (1) 3D inference requires multiple simultaneous viewpoints of the target, (2) occlusion caused by obstacles is frequent when trackin… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 7 pages, 11 figures, IROS 2021

  11. arXiv:2104.01272  [pdf, other

    cs.RO eess.SY

    Visual Servoing Approach for Autonomous UAV Landing on a Moving Vehicle

    Authors: Azarakhsh Keipour, Guilherme A. S. Pereira, Rogerio Bonatti, Rohit Garg, Puru Rastogi, Geetesh Dubey, Sebastian Scherer

    Abstract: Many aerial robotic applications require the ability to land on moving platforms, such as delivery trucks and marine research boats. We present a method to autonomously land an Unmanned Aerial Vehicle on a moving vehicle. A visual servoing controller approaches the ground vehicle using velocity commands calculated directly in image space. The control laws generate velocity commands in all three di… ▽ More

    Submitted 26 December, 2022; v1 submitted 2 April, 2021; originally announced April 2021.

    Comments: 18 pages. Published in Sensors Journal

    Journal ref: Sensors 2022, 22(17), 6549

  12. arXiv:2011.10118  [pdf, other

    cs.CV cs.AI cs.GR cs.HC cs.RO

    Batteries, camera, action! Learning a semantic control space for expressive robot cinematography

    Authors: Rogerio Bonatti, Arthur Bucker, Sebastian Scherer, Mustafa Mukadam, Jessica Hodgins

    Abstract: Aerial vehicles are revolutionizing the way film-makers can capture shots of actors by composing novel aerial and dynamic viewpoints. However, despite great advancements in autonomous flight technology, generating expressive camera behaviors is still a challenge and requires non-technical users to edit a large number of unintuitive control parameters. In this work, we develop a data-driven framewo… ▽ More

    Submitted 31 March, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

  13. arXiv:2011.05437  [pdf, other

    cs.RO cs.CV cs.MA

    Do You See What I See? Coordinating Multiple Aerial Cameras for Robot Cinematography

    Authors: Arthur Bucker, Rogerio Bonatti, Sebastian Scherer

    Abstract: Aerial cinematography is significantly expanding the capabilities of film-makers. Recent progress in autonomous unmanned aerial vehicles (UAVs) has further increased the potential impact of aerial cameras, with systems that can safely track actors in unstructured cluttered environments. Professional productions, however, require the use of multiple cameras simultaneously to record different viewpo… ▽ More

    Submitted 31 March, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

  14. arXiv:1910.06988  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Autonomous Aerial Cinematography In Unstructured Environments With Learned Artistic Decision-Making

    Authors: Rogerio Bonatti, Wenshan Wang, Cherie Ho, Aayush Ahuja, Mirko Gschwindt, Efe Camci, Erdal Kayacan, Sanjiban Choudhury, Sebastian Scherer

    Abstract: Aerial cinematography is revolutionizing industries that require live and dynamic camera viewpoints such as entertainment, sports, and security. However, safely piloting a drone while filming a moving target in the presence of obstacles is immensely taxing, often requiring multiple expert human operators. Hence, there is demand for an autonomous cinematographer that can reason about both geometry… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  15. arXiv:1909.06993  [pdf, other

    cs.CV cs.RO

    Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal Representations

    Authors: Rogerio Bonatti, Ratnesh Madaan, Vibhav Vineet, Sebastian Scherer, Ashish Kapoor

    Abstract: Machines are a long way from robustly solving open-world perception-control tasks, such as first-person view (FPV) aerial navigation. While recent advances in end-to-end Machine Learning, especially Imitation and Reinforcement Learning appear promising, they are constrained by the need of large amounts of difficult-to-collect labeled real-world data. Simulated data, on the other hand, is easy to g… ▽ More

    Submitted 8 March, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

  16. arXiv:1907.04905  [pdf

    cs.IR cs.CL cs.LG

    Development of email classifier in Brazilian Portuguese using feature selection for automatic response

    Authors: Rogerio Bonatti, Arthur Gola de Paula

    Abstract: Automatic email categorization is an important application of text classification. We study the automatic reply of email business messages in Brazilian Portuguese. We present a novel corpus containing messages from a real application, and baseline categorization experiments using Naive Bayes and support Vector Machines. We then discuss the effect of lemmatization and the role of part-of-speech tag… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

  17. arXiv:1904.02579  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Can a Robot Become a Movie Director? Learning Artistic Principles for Aerial Cinematography

    Authors: Mirko Gschwindt, Efe Camci, Rogerio Bonatti, Wenshan Wang, Erdal Kayacan, Sebastian Scherer

    Abstract: Aerial filming is constantly gaining importance due to the recent advances in drone technology. It invites many intriguing, unsolved problems at the intersection of aesthetical and scientific challenges. In this work, we propose a deep reinforcement learning agent which supervises motion planning of a filming drone by making desirable shot mode selections based on aesthetical values of video shots… ▽ More

    Submitted 15 October, 2019; v1 submitted 4 April, 2019; originally announced April 2019.

  18. arXiv:1904.02319  [pdf, other

    cs.RO cs.CV cs.LG

    Towards a Robust Aerial Cinematography Platform: Localizing and Tracking Moving Targets in Unstructured Environments

    Authors: Rogerio Bonatti, Cherie Ho, Wenshan Wang, Sanjiban Choudhury, Sebastian Scherer

    Abstract: The use of drones for aerial cinematography has revolutionized several applications and industries that require live and dynamic camera viewpoints such as entertainment, sports, and security. However, safely controlling a drone while filming a moving target usually requires multiple expert human operators; hence the need for an autonomous cinematographer. Current approaches have severe real-life l… ▽ More

    Submitted 28 July, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

  19. arXiv:1903.11174  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Improved Generalization of Heading Direction Estimation for Aerial Filming Using Semi-supervised Regression

    Authors: Wenshan Wang, Aayush Ahuja, Yanfu Zhang, Rogerio Bonatti, Sebastian Scherer

    Abstract: In the task of Autonomous aerial filming of a moving actor (e.g. a person or a vehicle), it is crucial to have a good heading direction estimation for the actor from the visual input. However, the models obtained in other similar tasks, such as pedestrian collision risk analysis and human-robot interaction, are very difficult to generalize to the aerial filming task, because of the difference in d… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

  20. arXiv:1810.07225  [pdf, other

    cs.RO cs.AI cs.LG

    Integrating kinematics and environment context into deep inverse reinforcement learning for predicting off-road vehicle trajectories

    Authors: Yanfu Zhang, Wenshan Wang, Rogerio Bonatti, Daniel Maturana, Sebastian Scherer

    Abstract: Predicting the motion of a mobile agent from a third-person perspective is an important component for many robotics applications, such as autonomous navigation and tracking. With accurate motion prediction of other agents, robots can plan for more intelligent behaviors to achieve specified objectives, instead of acting in a purely reactive way. Previous work addresses motion prediction by either o… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: CoRL 2018

  21. arXiv:1808.09563  [pdf, other

    cs.RO cs.AI

    Autonomous drone cinematographer: Using artistic principles to create smooth, safe, occlusion-free trajectories for aerial filming

    Authors: Rogerio Bonatti, Yanfu Zhang, Sanjiban Choudhury, Wenshan Wang, Sebastian Scherer

    Abstract: Autonomous aerial cinematography has the potential to enable automatic capture of aesthetically pleasing videos without requiring human intervention, empowering individuals with the capability of high-end film studios. Current approaches either only handle off-line trajectory generation, or offer strategies that reason over short time horizons and simplistic representations for obstacles, which re… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.