Skip to main content

Showing 1–7 of 7 results for author: Gan, W S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.05971  [pdf, other

    cs.CL

    Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents

    Authors: **g Yang Lee, Kong Aik Lee, Woon Seng Gan

    Abstract: In recent years, latent variable models, such as the Conditional Variational Auto Encoder (CVAE), have been applied to both personalized and empathetic dialogue generation. Prior work have largely focused on generating diverse dialogue responses that exhibit persona consistency and empathy. However, when it comes to the contextual coherence of the generated responses, there is still room for impro… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: Accepted at ICASSP 2022

  2. arXiv:2111.11363  [pdf, other

    cs.CL cs.AI

    DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation

    Authors: **g Yang Lee, Kong Aik Lee, Woon Seng Gan

    Abstract: The generation of personalized dialogue is vital to natural and human-like conversation. Typically, personalized dialogue generation models involve conditioning the generated response on the dialogue history and a representation of the persona/personality of the interlocutor. As it is impractical to obtain the persona/personality representations for every interlocutor, recent works have explored t… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: Accepted at ICAART 2022 as Full Paper

  3. arXiv:2108.03377  [pdf, other

    cs.CL cs.AI

    Generating Personalized Dialogue via Multi-Task Meta-Learning

    Authors: **g Yang Lee, Kong Aik Lee, Woon Seng Gan

    Abstract: Conventional approaches to personalized dialogue generation typically require a large corpus, as well as predefined persona information. However, in a real-world setting, neither a large corpus of training data nor persona information are readily available. To address these practical limitations, we propose a novel multi-task meta-learning approach which involves training a model to adapt to new p… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

    Comments: Accepted at SemDial 2021 (PotsDial 2021)

    Journal ref: Proceedings of the 25th Workshop on the Semantics and Pragmatics of Dialogue - Full Papers, pp 88-97 (2021)

  4. arXiv:2107.10471  [pdf, ps, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning

    Authors: Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Ngoc Khanh Nguyen, Zhen Jian Lee, Douglas L. Jones, Woon Seng Gan

    Abstract: The Sørensen--Dice Coefficient has recently seen rising popularity as a loss function (also known as Dice loss) due to its robustness in tasks where the number of negative samples significantly exceeds that of positive samples, such as semantic segmentation, natural language processing, and sound event detection. Conventional training of polyphonic sound event detection systems with binary cross-e… ▽ More

    Submitted 2 October, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Submitted to the 6th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2021

  5. arXiv:2107.10469  [pdf, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    What Makes Sound Event Localization and Detection Difficult? Insights from Error Analysis

    Authors: Thi Ngoc Tho Nguyen, Karn N. Watcharasupat, Zhen Jian Lee, Ngoc Khanh Nguyen, Douglas L. Jones, Woon Seng Gan

    Abstract: Sound event localization and detection (SELD) is an emerging research topic that aims to unify the tasks of sound event detection and direction-of-arrival estimation. As a result, SELD inherits the challenges of both tasks, such as noise, reverberation, interference, polyphony, and non-stationarity of sound sources. Furthermore, SELD often faces an additional challenge of assigning correct corresp… ▽ More

    Submitted 2 October, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Accepted for the 6th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2021

    Journal ref: Proceedings of the Detection and Classification of Acoustic Scenes and Events 2021 Workshop, pp. 120-124

  6. arXiv:2106.15190  [pdf, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection

    Authors: Thi Ngoc Tho Nguyen, Karn Watcharasupat, Ngoc Khanh Nguyen, Douglas L. Jones, Woon Seng Gan

    Abstract: Sound event localization and detection consists of two subtasks which are sound event detection and direction-of-arrival estimation. While sound event detection mainly relies on time-frequency patterns to distinguish different sound classes, direction-of-arrival estimation uses magnitude or phase differences between microphones to estimate source directions. Therefore, it is often difficult to joi… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: 5 pages, Technical Report for DCASE 2021 Challenge Task 3. arXiv admin note text overlap with arXiv:2110.00275

  7. arXiv:1911.11373  [pdf, other

    eess.AS cs.SD

    A two-step system for sound event localization and detection

    Authors: T. N. T. Nguyen, D. L. Jones, R. Ranjan, S. Jayabalan, W. S. Gan

    Abstract: Sound event detection and sound event localization requires different features from audio input signals. While sound event detection mainly relies on time-frequency patterns to distinguish different event classes, sound event localization uses magnitude or phase differences between microphones to estimate source directions. Therefore, we propose a two-step system to do sound event localization and… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: 5 pages