Skip to main content

Showing 1–3 of 3 results for author: Mohammedalamen, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.06819  [pdf, other

    cs.LG

    Monitored Markov Decision Processes

    Authors: Simone Parisi, Montaser Mohammedalamen, Alireza Kazemipour, Matthew E. Taylor, Michael Bowling

    Abstract: In reinforcement learning (RL), an agent learns to perform a task by interacting with an environment and receiving feedback (a numerical reward) for its actions. However, the assumption that rewards are always observable is often not applicable in real-world problems. For example, the agent may need to ask a human to supervise its actions or activate a monitoring system to receive feedback. There… ▽ More

    Submitted 13 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: AAMAS 2024, Main Track

  2. arXiv:2110.15907  [pdf, other

    cs.AI cs.LG

    Learning to Be Cautious

    Authors: Montaser Mohammedalamen, Dustin Morrill, Alexander Sieusahai, Yash Satsangi, Michael Bowling

    Abstract: A key challenge in the field of reinforcement learning is to develop agents that behave cautiously in novel situations. It is generally impossible to anticipate all situations that an autonomous system may face or what behavior would best avoid bad outcomes. An agent that could learn to be cautious would overcome this challenge by discovering for itself when and how to behave cautiously. In contra… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  3. arXiv:1901.04772  [pdf, other

    cs.AI

    Transfer Learning for Prosthetics Using Imitation Learning

    Authors: Montaser Mohammedalamen, Waleed D. Khamies, Benjamin Rosman

    Abstract: In this paper, We Apply Reinforcement learning (RL) techniques to train a realistic biomechanical model to work with different people and on different walking environments. We benchmarking 3 RL algorithms: Deep Deterministic Policy Gradient (DDPG), Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO) in OpenSim environment, Also we apply imitation learning to a prosthetic… ▽ More

    Submitted 15 January, 2019; originally announced January 2019.

    Comments: Workshop paper, Black in AI, NeurIPS 2018

    Journal ref: Black in AI Workshop, NeurIPS 2018