Skip to main content

Showing 1–6 of 6 results for author: Mozifian, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.18820  [pdf, other

    cs.LG cs.AI cs.IR

    Robust Reinforcement Learning Objectives for Sequential Recommender Systems

    Authors: Melissa Mozifian, Tristan Sylvain, Dave Evans, Lili Meng

    Abstract: Attention-based sequential recommendation methods have shown promise in accurately capturing users' evolving interests from their past interactions. Recent research has also explored the integration of reinforcement learning (RL) into these models, in addition to generating superior user representations. By framing sequential recommendation as an RL problem with reward signals, we can develop reco… ▽ More

    Submitted 17 April, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

  2. arXiv:2012.03806  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Perspectives on Sim2Real Transfer for Robotics: A Summary of the R:SS 2020 Workshop

    Authors: Sebastian Höfer, Kostas Bekris, Ankur Handa, Juan Camilo Gamboa, Florian Golemo, Melissa Mozifian, Chris Atkeson, Dieter Fox, Ken Goldberg, John Leonard, C. Karen Liu, Jan Peters, Shuran Song, Peter Welinder, Martha White

    Abstract: This report presents the debates, posters, and discussions of the Sim2Real workshop held in conjunction with the 2020 edition of the "Robotics: Science and System" conference. Twelve leaders of the field took competing debate positions on the definition, viability, and importance of transferring skills from simulation to the real world in the context of robotics problems. The debaters also joined… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: Summary of the "2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics" held in conjunction with "Robotics: Science and System 2020". Website: https://sim2real.github.io/

  3. arXiv:2012.02055  [pdf, other

    cs.RO cs.LG

    Intervention Design for Effective Sim2Real Transfer

    Authors: Melissa Mozifian, Amy Zhang, Joelle Pineau, David Meger

    Abstract: The goal of this work is to address the recent success of domain randomization and data augmentation for the sim2real setting. We explain this success through the lens of causal inference, positioning domain randomization and data augmentation as interventions on the environment which encourage invariance to irrelevant features. Such interventions include visual perturbations that have no effect o… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  4. arXiv:2011.01298  [pdf, other

    cs.RO cs.LG

    Sha** Rewards for Reinforcement Learning with Imperfect Demonstrations using Generative Models

    Authors: Yuchen Wu, Melissa Mozifian, Florian Shkurti

    Abstract: The potential benefits of model-free reinforcement learning to real robotics systems are limited by its uninformed exploration that leads to slow convergence, lack of data-efficiency, and unnecessary interactions with the environment. To address these drawbacks we propose a method that combines reinforcement and imitation learning by sha** the reward function with a state-and-action-dependent po… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: submitted to ICRA 2021

  5. arXiv:1906.00410  [pdf, other

    cs.LG cs.RO stat.ML

    Learning Domain Randomization Distributions for Training Robust Locomotion Policies

    Authors: Melissa Mozifian, Juan Camilo Gamboa Higuera, David Meger, Gregory Dudek

    Abstract: Domain randomization (DR) is a successful technique for learning robust policies for robot systems, when the dynamics of the target robot system are unknown. The success of policies trained with domain randomization however, is highly dependent on the correct selection of the randomization distribution. The majority of success stories typically use real world data in order to carefully select the… ▽ More

    Submitted 16 September, 2019; v1 submitted 2 June, 2019; originally announced June 2019.

  6. arXiv:1712.02294  [pdf, other

    cs.CV

    Joint 3D Proposal Generation and Object Detection from View Aggregation

    Authors: Jason Ku, Melissa Mozifian, Jungwook Lee, Ali Harakeh, Steven Waslander

    Abstract: We present AVOD, an Aggregate View Object Detection network for autonomous driving scenarios. The proposed neural network architecture uses LIDAR point clouds and RGB images to generate features that are shared by two subnetworks: a region proposal network (RPN) and a second stage detector network. The proposed RPN uses a novel architecture capable of performing multimodal feature fusion on high r… ▽ More

    Submitted 12 July, 2018; v1 submitted 6 December, 2017; originally announced December 2017.

    Comments: For any inquiries contact aharakeh(at)uwaterloo(dot)ca