Skip to main content

Showing 1–2 of 2 results for author: Ramirez, O A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.21043  [pdf, other

    cs.LG cs.AI

    Target Networks and Over-parameterization Stabilize Off-policy Bootstrap** with Function Approximation

    Authors: Fengdi Che, Chenjun Xiao, **cheng Mei, Bo Dai, Ramki Gummadi, Oscar A Ramirez, Christopher K Harris, A. Rupam Mahmood, Dale Schuurmans

    Abstract: We prove that the combination of a target network and over-parameterized linear function approximation establishes a weaker convergence condition for bootstrapped value estimation in certain cases, even with off-policy data. Our condition is naturally satisfied for expected updates over the entire state-action space or learning with a batch of complete trajectories from episodic Markov decision pr… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the 41 st International Conference on Machine Learning, 2024

  2. Park4U Mate: Context-Aware Digital Assistant for Personalized Autonomous Parking

    Authors: Antonyo Musabini, Evin Bozbayir, Hervé Marcasuzaa, Omar Adair Islas Ramírez

    Abstract: People park their vehicle depending on interior and exterior contexts. They do it naturally, even unconsciously. For instance, with a baby seat on the rear, the driver might leave more space on one side to be able to get the baby out easily; or when grocery shop**, s/he may position the vehicle to remain the trunk accessible. Autonomous vehicles are becoming technically effective at driving from… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: Accepted at 2021 IEEE Intelligent Vehicles Symposium - IV (matching camera-ready version)