Skip to main content

Showing 1–9 of 9 results for author: Jabaian, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01315  [pdf, other

    cs.CL

    Language Portability Strategies for Open-domain Dialogue with Pre-trained Language Models from High to Low Resource Languages

    Authors: Ahmed Njifenjou, Virgile Sucal, Bassam Jabaian, Fabrice Lefèvre

    Abstract: In this paper we propose a study of linguistic portability strategies of large pre-trained language models (PLMs) used for open-domain dialogue systems in a high-resource language for this task. In particular the target low-resource language (L_T) will be simulated with French, as it lacks of task-specific resources and allows our human evaluation, when the source language (L_S) is English. For ob… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: The 13th International Workshop on Spoken Dialogue Systems Technology (IWSDS '23)

  2. arXiv:2406.18460  [pdf, other

    cs.CL cs.AI cs.HC

    Role-Play Zero-Shot Prompting with Large Language Models for Open-Domain Human-Machine Conversation

    Authors: Ahmed Njifenjou, Virgile Sucal, Bassam Jabaian, Fabrice Lefèvre

    Abstract: Recently, various methods have been proposed to create open-domain conversational agents with Large Language Models (LLMs). These models are able to answer user queries, but in a one-way Q&A format rather than a true conversation. Fine-tuning on particular datasets is the usual way to modify their style to increase conversational ability, but this is expensive and usually only available in a few l… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Updated version of a paper originally submitted at SIGDIAL 2023

  3. arXiv:2406.12141  [pdf, other

    cs.CL cs.SD eess.AS

    A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding

    Authors: Gaëlle Laperrière, Sahar Ghannay, Bassam Jabaian, Yannick Estève

    Abstract: Self-Supervised Learning is vastly used to efficiently represent speech for Spoken Language Understanding, gradually replacing conventional approaches. Meanwhile, textual SSL models are proposed to encode language-agnostic semantics. SAMU-XLSR framework employed this semantic information to enrich multilingual speech representations. A recent study investigated SAMU-XLSR in-domain semantic enrichm… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: In Proceedings of Interspeech 2024

  4. Semantic enrichment towards efficient speech representations

    Authors: Gaëlle Laperrière, Ha Nguyen, Sahar Ghannay, Bassam Jabaian, Yannick Estève

    Abstract: Over the past few years, self-supervised learned speech representations have emerged as fruitful replacements for conventional surface representations when solving Spoken Language Understanding (SLU) tasks. Simultaneously, multilingual models trained on massive textual data were introduced to encode language agnostic semantics. Recently, the SAMU-XLSR approach introduced a way to make profit from… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: INTERSPEECH 2023

    Journal ref: Proc. Interspeech 2023, 705-709

  5. arXiv:2110.13213  [pdf, other

    cs.CL cs.HC

    Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users

    Authors: Matthieu Riou, Bassam Jabaian, Stéphane Huet, Fabrice Lefèvre

    Abstract: Design of dialogue systems has witnessed many advances lately, yet acquiring huge set of data remains an hindrance to their fast development for a new task or language. Besides, training interactive systems with batch data is not satisfactory. On-line learning is pursued in this paper as a convenient way to alleviate these difficulties. After the system modules are initiated, a single process hand… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:1810.00924

  6. arXiv:2106.13045  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Where are we in semantic concept extraction for Spoken Language Understanding?

    Authors: Sahar Ghannay, Antoine Caubrière, Salima Mdhaffar, Gaëlle Laperrière, Bassam Jabaian, Yannick Estève

    Abstract: Spoken language understanding (SLU) topic has seen a lot of progress these last three years, with the emergence of end-to-end neural approaches. Spoken language understanding refers to natural language processing tasks related to semantic extraction from speech signal, like named entity recognition from speech or slot filling task in a context of human-machine dialogue. Classically, SLU tasks were… ▽ More

    Submitted 11 October, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: Accepted in the SPECOM 2021 conference

  7. arXiv:2002.05955  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    A Data Efficient End-To-End Spoken Language Understanding Architecture

    Authors: Marco Dinarelli, Nikita Kapoor, Bassam Jabaian, Laurent Besacier

    Abstract: End-to-end architectures have been recently proposed for spoken language understanding (SLU) and semantic parsing. Based on a large amount of data, those models learn jointly acoustic and linguistic-sequential features. Such architectures give very good results in the context of domain, intent and slot detection, their application in a more complex semantic chunking and tagging task is less easy.… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Accepted to ICASSP 2020

  8. arXiv:1810.00924  [pdf, other

    cs.CL cs.LG

    Joint On-line Learning of a Zero-shot Spoken Semantic Parser and a Reinforcement Learning Dialogue Manager

    Authors: Matthieu Riou, Bassam Jabaian, Stéphane Huet, Fabrice Lefèvre

    Abstract: Despite many recent advances for the design of dialogue systems, a true bottleneck remains the acquisition of data required to train its components. Unlike many other language processing applications, dialogue systems require interactions with users, therefore it is complex to develop them with pre-recorded data. Building on previous works, on-line learning is pursued here as a most convenient way… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

  9. arXiv:1702.06510  [pdf, ps, other

    cs.IR cs.CL

    Algorithmes de classification et d'optimisation: participation du LIA/ADOC á DEFT'14

    Authors: Luis Adrián Cabrera-Diego, Stéphane Huet, Bassam Jabaian, Alejandro Molina, Juan-Manuel Torres-Moreno, Marc El-Bèze, Barthélémy Durette

    Abstract: This year, the DEFT campaign (Défi Fouilles de Textes) incorporates a task which aims at identifying the session in which articles of previous TALN conferences were presented. We describe the three statistical systems developed at LIA/ADOC for this task. A fusion of these systems enables us to obtain interesting results (micro-precision score of 0.76 measured on the test corpus)

    Submitted 21 February, 2017; originally announced February 2017.

    Comments: 8 pages, 3 tables, Conference paper (in French)