Skip to main content

Showing 1–15 of 15 results for author: Cuayahuitl, H

.
  1. arXiv:2308.12792  [pdf, other

    cs.SD eess.AS

    Sparks of Large Audio Models: A Survey and Outlook

    Authors: Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Erik Cambria, Björn W. Schuller

    Abstract: This survey paper provides a comprehensive overview of the recent advancements and challenges in applying large language models to the field of audio signal processing. Audio processing, with its diverse signal representations and a wide range of sources--from human voices to musical instruments and environmental sounds--poses challenges distinct from those found in traditional Natural Language Pr… ▽ More

    Submitted 21 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Under review, Repo URL: https://github.com/EmulationAI/awesome-large-audio-models

  2. arXiv:2303.11607  [pdf, other

    cs.CL cs.SD eess.AS

    Transformers in Speech Processing: A Survey

    Authors: Siddique Latif, Aun Zaidi, Heriberto Cuayahuitl, Fahad Shamshad, Moazzam Shoukat, Junaid Qadir

    Abstract: The remarkable success of transformers in the field of natural language processing has sparked the interest of the speech-processing community, leading to an exploration of their potential for modeling long-range dependencies within speech sequences. Recently, transformers have gained prominence across various speech-related domains, including automatic speech recognition, speech synthesis, speech… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: under-review

  3. arXiv:2208.00478  [pdf, other

    cs.LG cs.RO

    Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination

    Authors: Abdalkarim Mohtasib, Gerhard Neumann, Heriberto Cuayahuitl

    Abstract: Learning robotic tasks in the real world is still highly challenging and effective practical solutions remain to be found. Traditional methods used in this area are imitation learning and reinforcement learning, but they both have limitations when applied to real robots. Combining reinforcement learning with pre-collected demonstrations is a promising approach that can help in learning control pol… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

  4. arXiv:2112.05621  [pdf, other

    cs.RO

    Reward-Based Environment States for Robot Manipulation Policy Learning

    Authors: Cédérick Mouliets, Isabelle Ferrané, Heriberto Cuayáhuitl

    Abstract: Training robot manipulation policies is a challenging and open problem in robotics and artificial intelligence. In this paper we propose a novel and compact state representation based on the rewards predicted from an image-based task success classifier. Our experiments, using the Pepper robot in simulation with two deep reinforcement learning algorithms on a grab-and-lift task, reveal that our pro… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: NeurIPS Workshop on Deployable Decision Making in Embodied Systems, 2021

  5. arXiv:2108.03222  [pdf, other

    cs.RO cs.LG

    A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning

    Authors: Abdalkarim Mohtasib, Gerhard Neumann, Heriberto Cuayahuitl

    Abstract: Deep Reinforcement Learning (DRL) is a promising approach for teaching robots new behaviour. However, one of its main limitations is the need for carefully hand-coded reward signals by an expert. We argue that it is crucial to automate the reward learning process so that new skills can be taught to robots by their users. To address such automation, we consider task success classifiers using visual… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

  6. arXiv:2107.00722  [pdf, other

    cs.RO cs.AI cs.LG

    Neural Task Success Classifiers for Robotic Manipulation from Few Real Demonstrations

    Authors: Abdalkarim Mohtasib, Amir Ghalamzan E., Nicola Bellotto, Heriberto Cuayáhuitl

    Abstract: Robots learning a new manipulation task from a small amount of demonstrations are increasingly demanded in different workspaces. A classifier model assessing the quality of actions can predict the successful completion of a task, which can be used by intelligent agents for action-selection. This paper presents a novel classifier that learns to classify task completion only from a few demonstration… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 8 pages

  7. arXiv:2101.00240  [pdf, other

    cs.SD cs.LG eess.AS

    A Survey on Deep Reinforcement Learning for Audio-Based Applications

    Authors: Siddique Latif, Heriberto Cuayáhuitl, Farrukh Pervez, Fahad Shamshad, Hafiz Shehbaz Ali, Erik Cambria

    Abstract: Deep reinforcement learning (DRL) is poised to revolutionise the field of artificial intelligence (AI) by endowing autonomous systems with high levels of understanding of the real world. Currently, deep learning (DL) is enabling DRL to effectively solve various intractable problems in various fields. Most importantly, DRL algorithms are also being employed in audio signal processing to learn direc… ▽ More

    Submitted 1 January, 2021; originally announced January 2021.

    Comments: Under Review

  8. Ensemble-Based Deep Reinforcement Learning for Chatbots

    Authors: Heriberto Cuayáhuitl, Donghyeon Lee, Seonghan Ryu, Yong** Cho, Sungja Choi, Satish Indurthi, Seunghak Yu, Hyungtak Choi, Inchul Hwang, Jihie Kim

    Abstract: Trainable chatbots that exhibit fluent and human-like conversations remain a big challenge in artificial intelligence. Deep Reinforcement Learning (DRL) is promising for addressing this challenge, but its successful application remains an open question. This article describes a novel ensemble-based approach applied to value-based DRL chatbots, which use finite action sets as a form of meaning repr… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: arXiv admin note: text overlap with arXiv:1908.10331

  9. A Data-Efficient Deep Learning Approach for Deployable Multimodal Social Robots

    Authors: Heriberto Cuayáhuitl

    Abstract: The deep supervised and reinforcement learning paradigms (among others) have the potential to endow interactive multimodal social robots with the ability of acquiring skills autonomously. But it is still not very clear yet how they can be best deployed in real world applications. As a step in this direction, we propose a deep learning-based approach for efficiently training a humanoid robot to pla… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

  10. arXiv:1908.10331  [pdf, other

    cs.AI cs.CL cs.LG cs.NE

    Deep Reinforcement Learning for Chatbots Using Clustered Actions and Human-Likeness Rewards

    Authors: Heriberto Cuayáhuitl, Donghyeon Lee, Seonghan Ryu, Sungja Choi, Inchul Hwang, Jihie Kim

    Abstract: Training chatbots using the reinforcement learning paradigm is challenging due to high-dimensional states, infinite action spaces and the difficulty in specifying the reward function. We address such problems using clustered actions instead of infinite actions, and a simple but promising reward function based on human-likeness scores derived from human-human dialogue data. We train Deep Reinforcem… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: In International Joint Conference of Neural Networks (IJCNN), 2019

  11. arXiv:1812.00350  [pdf, ps, other

    cs.CL cs.AI

    A Study on Dialogue Reward Prediction for Open-Ended Conversational Agents

    Authors: Heriberto Cuayáhuitl, Seonghan Ryu, Donghyeon Lee, Jihie Kim

    Abstract: The amount of dialogue history to include in a conversational agent is often underestimated and/or set in an empirical and thus possibly naive way. This suggests that principled investigations into optimal context windows are urgently needed given that the amount of dialogue history and corresponding representations can play an important role in the overall performance of a conversational system.… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: In NeurIPS Workshop on Conversational AI: "Today's Practice and Tomorrow's Potential", December 2018

  12. arXiv:1611.08675  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Deep Reinforcement Learning for Multi-Domain Dialogue Systems

    Authors: Heriberto Cuayáhuitl, Seunghak Yu, Ashley Williamson, Jacob Carse

    Abstract: Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems. We propose a method for multi-domain dialogue policy learning---termed NDQN, and apply it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. Experimental results comparing DQN (baseline) versus NDQN (proposed) using simulations r… ▽ More

    Submitted 26 November, 2016; originally announced November 2016.

    Comments: NIPS Workshop on Deep Reinforcement Learning, 2016

  13. arXiv:1611.08666  [pdf, ps, other

    cs.LG cs.AI cs.RO

    Training an Interactive Humanoid Robot Using Multimodal Deep Reinforcement Learning

    Authors: Heriberto Cuayáhuitl, Guillaume Couly, Clément Olalainty

    Abstract: Training robots to perceive, act and communicate using multiple modalities still represents a challenging problem, particularly if robots are expected to learn efficiently from small sets of example interactions. We describe a learning approach as a step in this direction, where we teach a humanoid robot how to play the game of noughts and crosses. Given that multiple multimodal skills can be trai… ▽ More

    Submitted 26 November, 2016; originally announced November 2016.

    Comments: NIPS Workshop on Future of Interactive Learning Machines, 2016

  14. arXiv:1601.04574  [pdf, other

    cs.AI cs.LG

    SimpleDS: A Simple Deep Reinforcement Learning Dialogue System

    Authors: Heriberto Cuayáhuitl

    Abstract: This paper presents 'SimpleDS', a simple and publicly available dialogue system trained with deep reinforcement learning. In contrast to previous reinforcement learning dialogue systems, this system avoids manual feature engineering by performing action selection directly from raw text of the last system and (noisy) user responses. Our initial results, in the restaurant domain, show that it is ind… ▽ More

    Submitted 18 January, 2016; originally announced January 2016.

    Comments: International Workshop on Spoken Dialogue Systems (IWSDS), 2016

  15. arXiv:1511.08099  [pdf, other

    cs.AI cs.LG

    Strategic Dialogue Management via Deep Reinforcement Learning

    Authors: Heriberto Cuayáhuitl, Simon Keizer, Oliver Lemon

    Abstract: Artificially intelligent agents equipped with strategic skills that can negotiate during their interactions with other natural or artificial agents are still underdeveloped. This paper describes a successful application of Deep Reinforcement Learning (DRL) for training intelligent agents with strategic conversational skills, in a situated dialogue setting. Previous studies have modelled the behavi… ▽ More

    Submitted 25 November, 2015; originally announced November 2015.

    Comments: NIPS'15 Workshop on Deep Reinforcement Learning