Skip to main content

Showing 1–12 of 12 results for author: Dauwels, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2305.19493  [pdf

    eess.AS

    MERLIon CCS Challenge Evaluation Plan

    Authors: Leibny Paola Garcia Perera, Y. H. Victoria Chua, Hexin Liu, Fei Ting Woon, Andy W. H. Khong, Justin Dauwels, Sanjeev Khudanpur, Suzy J. Styles

    Abstract: This paper introduces the inaugural Multilingual Everyday Recordings- Language Identification on Code-Switched Child-Directed Speech (MERLIon CCS) Challenge, focused on develo** robust language identification and language diarization systems that are reliable for non-standard, accented, spontaneous code-switched, child-directed speech collected via Zoom. Aligning closely with Interspeech 2023 th… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Evaluation plan for Interspeech 2023 special session "MERLIon"

  2. arXiv:2305.18925  [pdf, other

    eess.AS cs.CL cs.SD

    Investigating model performance in language identification: beyond simple error statistics

    Authors: Suzy J. Styles, Victoria Y. H. Chua, Fei Ting Woon, Hexin Liu, Leibny Paola Garcia Perera, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels

    Abstract: Language development experts need tools that can automatically identify languages from fluent, conversational speech, and provide reliable estimates of usage rates at the level of an individual recording. However, language identification systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted to Interspeech 2023, 5 pages, 5 figures

  3. arXiv:2305.18881  [pdf, other

    eess.AS

    MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization

    Authors: Victoria Y. H. Chua, Hexin Liu, Leibny Paola Garcia Perera, Fei Ting Woon, **yi Wong, Xiangyu Zhang, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles

    Abstract: To enhance the reliability and robustness of language identification (LID) and language diarization (LD) systems for heterogeneous populations and scenarios, there is a need for speech processing models to be trained on datasets that feature diverse language registers and speech patterns. We present the MERLIon CCS challenge, featuring a first-of-its-kind Zoom video call dataset of parent-child sh… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted by Interspeech 2023, 5 pages, 2 figures, 3 tables

  4. arXiv:2208.02405  [pdf, other

    eess.SP

    Transformer Convolutional Neural Networks for Automated Artifact Detection in Scalp EEG

    Authors: Wei Yan Peh, Yuanyuan Yao, Justin Dauwels

    Abstract: It is well known that electroencephalograms (EEGs) often contain artifacts due to muscle activity, eye blinks, and various other causes. Detecting such artifacts is an essential first step toward a correct interpretation of EEGs. Although much effort has been devoted to semi-automated and automated artifact detection in EEG, the problem of artifact detection remains challenging. In this paper, we… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: This is an extension to a paper presented at the 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) Scottish Event Campus, Glasgow, UK, July 11-15, 2022

  5. arXiv:2208.00025  [pdf, other

    eess.SP

    Six-center Assessment of CNN-Transformer with Belief Matching Loss for Patient-independent Seizure Detection in EEG

    Authors: Wei Yan Peh, Prasanth Thangavel, Yuanyuan Yao, John Thomas, Yee Leng Tan, Justin Dauwels

    Abstract: Neurologists typically identify epileptic seizures from electroencephalograms (EEGs) by visual inspection. This process is often time-consuming, especially for EEG recordings that last hours or days. To expedite the process, a reliable, automated, and patient-independent seizure detector is essential. However, develo** a patient-independent seizure detector is challenging as seizures exhibit div… ▽ More

    Submitted 22 November, 2022; v1 submitted 29 July, 2022; originally announced August 2022.

    Comments: Submitting to IJNS

  6. arXiv:2203.03218  [pdf, other

    eess.AS cs.CL cs.SD

    Enhance Language Identification using Dual-mode Model with Knowledge Distillation

    Authors: Hexin Liu, Leibny Paola Garcia Perera, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles, Sanjeev Khudanpur

    Abstract: In this paper, we propose to employ a dual-mode framework on the x-vector self-attention (XSA-LID) model with knowledge distillation (KD) to enhance its language identification (LID) performance for both long and short utterances. The dual-mode XSA-LID model is trained by jointly optimizing both the full and short modes with their respective inputs being the full-length speech and its short clip e… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Submitted to Odyssey 2022

  7. arXiv:2107.05318  [pdf, other

    eess.IV cs.CV

    R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery

    Authors: Rongkai Zhang, Jiang Zhu, Zhiyuan Zha, Justin Dauwels, Bihan Wen

    Abstract: State-of-the-art image denoisers exploit various types of deep neural networks via deterministic training. Alternatively, very recent works utilize deep reinforcement learning for restoring images with diverse or unknown corruptions. Though deep reinforcement learning can generate effective policy networks for operator selection or architecture search in image restoration, how it is connected to t… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: Accepted by ICIP 2021

  8. Multi-center validation study of automated classification of pathological slowing in adult scalp electroencephalograms via frequency features

    Authors: Wei Yan Peh, John Thomas, Elham Bagheri, Rima Chaudhari, Sagar Karia, Rahul Rathakrishnan, Vinay Saini, Nilesh Shah, Rohit Srivastava, Yee-Leng Tan, Justin Dauwels

    Abstract: Pathological slowing in the electroencephalogram (EEG) is widely investigated for the diagnosis of neurological disorders. Currently, the gold standard for slowing detection is the visual inspection of the EEG by experts, which is time-consuming and subjective. To address those issues, we propose three automated approaches to detect slowing in EEG: Threshold-based Detecting System (TDS), Shallow L… ▽ More

    Submitted 26 January, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 24 pages. For submission to International Journal of Neural Systems (IJNS)

  9. arXiv:2008.13443  [pdf, other

    stat.ML cs.LG eess.SP

    On the Quality Requirements of Demand Prediction for Dynamic Public Transport

    Authors: Inon Peled, Kelvin Lee, Yu Jiang, Justin Dauwels, Francisco C. Pereira

    Abstract: As Public Transport (PT) becomes more dynamic and demand-responsive, it increasingly depends on predictions of transport demand. But how accurate need such predictions be for effective PT operation? We address this question through an experimental case study of PT trips in Metropolitan Copenhagen, Denmark, which we conduct independently of any specific prediction models. First, we simulate errors… ▽ More

    Submitted 6 November, 2021; v1 submitted 31 August, 2020; originally announced August 2020.

    Comments: 26 pages, 9 tables, 6 figures

  10. arXiv:1911.03667  [pdf, other

    cs.LG eess.SP stat.ML

    Factored Latent-Dynamic Conditional Random Fields for Single and Multi-label Sequence Modeling

    Authors: Satyajit Neogi, Justin Dauwels

    Abstract: Conditional Random Fields (CRF) are frequently applied for labeling and segmenting sequence data. Morency et al. (2007) introduced hidden state variables in a labeled CRF structure in order to model the latent dynamics within class labels, thus improving the labeling performance. Such a model is known as Latent-Dynamic CRF (LDCRF). We present Factored LDCRF (FLDCRF), a structure that allows multip… ▽ More

    Submitted 12 November, 2019; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: To be submitted to Journal of Machine Learning Research (JMLR)

  11. arXiv:1907.11881  [pdf, other

    cs.CV cs.RO eess.IV stat.ML

    Context Model for Pedestrian Intention Prediction using Factored Latent-Dynamic Conditional Random Fields

    Authors: Satyajit Neogi, Michael Hoy, Kang Dang, Hang Yu, Justin Dauwels

    Abstract: Smooth handling of pedestrian interactions is a key requirement for Autonomous Vehicles (AV) and Advanced Driver Assistance Systems (ADAS). Such systems call for early and accurate prediction of a pedestrian's crossing/not-crossing behaviour in front of the vehicle. Existing approaches to pedestrian behaviour prediction make use of pedestrian motion, his/her location in a scene and static context… ▽ More

    Submitted 15 September, 2020; v1 submitted 27 July, 2019; originally announced July 2019.

    Comments: Accepted by IEEE Transactions on Intelligent Transportation Systems

  12. arXiv:1907.05274  [pdf, other

    cs.CV cs.LG eess.IV

    Affine Disentangled GAN for Interpretable and Robust AV Perception

    Authors: Letao Liu, Martin Saerbeck, Justin Dauwels

    Abstract: Autonomous vehicles (AV) have progressed rapidly with the advancements in computer vision algorithms. The deep convolutional neural network as the main contributor to this advancement has boosted the classification accuracy dramatically. However, the discovery of adversarial examples reveals the generalization gap between dataset and the real world. Furthermore, affine transformations may also con… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.