Skip to main content

Showing 1–10 of 10 results for author: Khamassi, M

.
  1. arXiv:2403.20177  [pdf

    cs.AI cs.RO q-bio.NC

    Artificial consciousness. Some logical and conceptual preliminaries

    Authors: K. Evers, M. Farisco, R. Chatila, B. D. Earp, I. T. Freire, F. Hamker, E. Nemeth, P. F. M. J. Verschure, M. Khamassi

    Abstract: Is artificial consciousness theoretically possible? Is it plausible? If so, is it technically feasible? To make progress on these questions, it is necessary to lay some groundwork clarifying the logical and empirical conditions for artificial consciousness to arise and the meaning of relevant terms involved. Consciousness is a polysemic word: researchers from different fields, including neuroscien… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  2. arXiv:2403.02514  [pdf, other

    cs.RO cs.AI cs.LG

    Purpose for Open-Ended Learning Robots: A Computational Taxonomy, Definition, and Operationalisation

    Authors: Gianluca Baldassarre, Richard J. Duro, Emilio Cartoni, Mehdi Khamassi, Alejandro Romero, Vieri Giuliano Santucci

    Abstract: Autonomous open-ended learning (OEL) robots are able to cumulatively acquire new skills and knowledge through direct interaction with the environment, for example relying on the guidance of intrinsic motivations and self-generated goals. OEL robots have a high relevance for applications as they can use the autonomously acquired knowledge to accomplish tasks relevant for their human users. OEL robo… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 15 pages, 6 figures

  3. arXiv:2005.06223  [pdf, other

    cs.AI cs.LG cs.NE cs.RO

    DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics

    Authors: Stephane Doncieux, Nicolas Bredeche, Léni Le Goff, Benoît Girard, Alexandre Coninx, Olivier Sigaud, Mehdi Khamassi, Natalia Díaz-Rodríguez, David Filliat, Timothy Hospedales, A. Eiben, Richard Duro

    Abstract: Robots are still limited to controlled conditions, that the robot designer knows with enough details to endow the robot with the appropriate models or behaviors. Learning algorithms add some flexibility with the ability to discover the appropriate behavior given either some demonstrations or a reward to guide its exploration with a reinforcement learning algorithm. Reinforcement learning algorithm… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  4. arXiv:2005.03987  [pdf, other

    cs.RO

    Co** with the variability in humans reward during simulated human-robot interactions through the coordination of multiple learning strategies

    Authors: Rémi Dromnelle, Benoît Girard, Erwan Renaudo, Raja Chatila, Mehdi Khamassi

    Abstract: An important current challenge in Human-Robot Interaction (HRI) is to enable robots to learn on-the-fly from human feedback. However, humans show a great variability in the way they reward robots. We propose to address this issue by enabling the robot to combine different learning strategies, namely model-based (MB) and model-free (MF) reinforcement learning. We simulate two HRI scenarios: a simpl… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: 6 pages, 5 figures, written for the RO-MAN 2020 conference. arXiv admin note: text overlap with arXiv:2004.14698

  5. arXiv:2004.14698  [pdf, other

    cs.RO cs.LG

    How to reduce computation time while sparing performance during robot navigation? A neuro-inspired architecture for autonomous shifting between model-based and model-free learning

    Authors: Rémi Dromnelle, Erwan Renaudo, Guillaume Pourcel, Raja Chatila, Benoît Girard, Mehdi Khamassi

    Abstract: Taking inspiration from how the brain coordinates multiple learning systems is an appealing strategy to endow robots with more flexibility. One of the expected advantages would be for robots to autonomously switch to the least costly system when its performance is satisfying. However, to our knowledge no study on a real robot has yet shown that the measured computational cost is reduced while perf… ▽ More

    Submitted 16 July, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: 12 pages, 4 figures ; Living Machines 2020

  6. arXiv:1812.00253  [pdf, other

    cs.RO cs.CV cs.LG

    A Deep Learning Approach for Multi-View Engagement Estimation of Children in a Child-Robot Joint Attention task

    Authors: Jack Hadfield, Georgia Chalvatzaki, Petros Koutras, Mehdi Khamassi, Costas S. Tzafestas, Petros Maragos

    Abstract: In this work we tackle the problem of child engagement estimation while children freely interact with a robot in their room. We propose a deep-based multi-view solution that takes advantage of recent developments in human pose detection. We extract the child's pose from different RGB-D cameras placed elegantly in the room, fuse the results and feed them to a deep neural network trained for classif… ▽ More

    Submitted 1 December, 2018; originally announced December 2018.

    Comments: 7 pages, 6 figures

  7. Prioritized Swee** Neural DynaQ with Multiple Predecessors, and Hippocampal Replays

    Authors: Lise Aubin, Mehdi Khamassi, Benoît Girard

    Abstract: During sleep and awake rest, the hippocampus replays sequences of place cells that have been activated during prior experiences. These have been interpreted as a memory consolidation process, but recent results suggest a possible interpretation in terms of reinforcement learning. The Dyna reinforcement learning algorithms use off-line replays to improve learning. Under limited replay budget, a pri… ▽ More

    Submitted 13 August, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: Living Machines 2018 (Paris, France)

  8. Adaptive coordination of working-memory and reinforcement learning in non-human primates performing a trial-and-error problem solving task

    Authors: Guillaume Viejo, Benoît Girard, Emmanuel Procyk, Mehdi Khamassi

    Abstract: Accumulating evidence suggest that human behavior in trial-and-error learning tasks based on decisions between discrete actions may involve a combination of reinforcement learning (RL) and working-memory (WM). While the understanding of brain activity at stake in this type of tasks often involve the comparison with non-human primate neurophysiological results, it is not clear whether monkeys use s… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

    Comments: Behavioural Brain Research, Elsevier, 2017

  9. Sustainable computational science: the ReScience initiative

    Authors: Nicolas P. Rougier, Konrad Hinsen, Frédéric Alexandre, Thomas Arildsen, Lorena Barba, Fabien C. Y. Benureau, C. Titus Brown, Pierre de Buyl, Ozan Caglayan, Andrew P. Davison, Marc André Delsuc, Georgios Detorakis, Alexandra K. Diem, Damien Drix, Pierre Enel, Benoît Girard, Olivia Guest, Matt G. Hall, Rafael Neto Henriques, Xavier Hinaut, Kamil S Jaron, Mehdi Khamassi, Almar Klein, Tiina Manninen, Pietro Marchesi , et al. (20 additional authors not shown)

    Abstract: Computer science offers a large set of tools for prototy**, writing, running, testing, validating, sharing and reproducing results, however computational science lags behind. In the best case, authors may provide their source code as a compressed archive and they may feel confident their research is reproducible. But this is not exactly true. James Buckheit and David Donoho proposed more than tw… ▽ More

    Submitted 11 November, 2017; v1 submitted 14 July, 2017; originally announced July 2017.

    Comments: 8 pages, 1 figure

    Journal ref: PeerJ Computer Science 3:e142 (2017)

  10. arXiv:1610.01986  [pdf, other

    cs.LG

    Active exploration in parameterized reinforcement learning

    Authors: Mehdi Khamassi, Costas Tzafestas

    Abstract: Online model-free reinforcement learning (RL) methods with continuous actions are playing a prominent role when dealing with real-world applications such as Robotics. However, when confronted to non-stationary environments, these methods crucially rely on an exploration-exploitation trade-off which is rarely dynamically and automatically adjusted to changes in the environment. Here we propose an a… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

    Comments: Submitted to EWRL2016