Skip to main content

Showing 1–6 of 6 results for author: Ramirez, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.21043  [pdf, other

    cs.LG cs.AI

    Target Networks and Over-parameterization Stabilize Off-policy Bootstrap** with Function Approximation

    Authors: Fengdi Che, Chenjun Xiao, **cheng Mei, Bo Dai, Ramki Gummadi, Oscar A Ramirez, Christopher K Harris, A. Rupam Mahmood, Dale Schuurmans

    Abstract: We prove that the combination of a target network and over-parameterized linear function approximation establishes a weaker convergence condition for bootstrapped value estimation in certain cases, even with off-policy data. Our condition is naturally satisfied for expected updates over the entire state-action space or learning with a batch of complete trajectories from episodic Markov decision pr… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the 41 st International Conference on Machine Learning, 2024

  2. arXiv:2203.13551  [pdf, other

    cs.LG

    Feature extraction using Spectral Clustering for Gene Function Prediction using Hierarchical Multi-label Classification

    Authors: Miguel Romero, Oscar Ramírez, Jorge Finke, Camilo Rocha

    Abstract: Gene annotation addresses the problem of predicting unknown associations between gene and functions (e.g., biological processes) of a specific organism. Despite recent advances, the cost and time demanded by annotation procedures that rely largely on in vivo biological experiments remain prohibitively high. This paper presents a novel in silico approach for to the annotation problem that combines… ▽ More

    Submitted 28 April, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

  3. arXiv:2109.00137  [pdf, other

    cs.RO cs.CV cs.LG

    Implicit Behavioral Cloning

    Authors: Pete Florence, Corey Lynch, Andy Zeng, Oscar Ramirez, Ayzaan Wahid, Laura Downs, Adrian Wong, Johnny Lee, Igor Mordatch, Jonathan Tompson

    Abstract: We find that across a wide range of robot policy learning scenarios, treating supervised policy learning with an implicit model generally performs better, on average, than commonly used explicit models. We present extensive experiments on this finding, and we provide both intuitive insight and theoretical arguments distinguishing the properties of implicit models compared to their explicit counter… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

  4. Park4U Mate: Context-Aware Digital Assistant for Personalized Autonomous Parking

    Authors: Antonyo Musabini, Evin Bozbayir, Hervé Marcasuzaa, Omar Adair Islas Ramírez

    Abstract: People park their vehicle depending on interior and exterior contexts. They do it naturally, even unconsciously. For instance, with a baby seat on the rear, the driver might leave more space on one side to be able to get the baby out easily; or when grocery shop**, s/he may position the vehicle to remain the trunk accessible. Autonomous vehicles are becoming technically effective at driving from… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: Accepted at 2021 IEEE Intelligent Vehicles Symposium - IV (matching camera-ready version)

  5. arXiv:2101.02722  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    The Distracting Control Suite -- A Challenging Benchmark for Reinforcement Learning from Pixels

    Authors: Austin Stone, Oscar Ramirez, Kurt Konolige, Rico Jonschkowski

    Abstract: Robots have to face challenging perceptual settings, including changes in viewpoint, lighting, and background. Current simulated reinforcement learning (RL) benchmarks such as DM Control provide visual input without such complexity, which limits the transfer of well-performing methods to the real world. In this paper, we extend DM Control with three kinds of visual distractions (variations in back… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: Code available at https://github.com/google-research/google-research/tree/master/distracting_control

  6. arXiv:1710.03937  [pdf, ps, other

    cs.AI cs.LG cs.RO

    PRM-RL: Long-range Robotic Navigation Tasks by Combining Reinforcement Learning and Sampling-based Planning

    Authors: Aleksandra Faust, Oscar Ramirez, Marek Fiser, Kenneth Oslund, Anthony Francis, James Davidson, Lydia Tapia

    Abstract: We present PRM-RL, a hierarchical method for long-range navigation task completion that combines sampling based path planning with reinforcement learning (RL). The RL agents learn short-range, point-to-point navigation policies that capture robot dynamics and task constraints without knowledge of the large-scale topology. Next, the sampling-based planners provide roadmaps which connect robot confi… ▽ More

    Submitted 16 May, 2018; v1 submitted 11 October, 2017; originally announced October 2017.

    Comments: 9 pages, 7 figures

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2018