Skip to main content

Showing 1–10 of 10 results for author: Ahmed, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.12609  [pdf, other

    eess.AS cs.SD

    Mamba in Speech: Towards an Alternative to Self-Attention

    Authors: Xiangyu Zhang, Qiquan Zhang, Hexin Liu, Tianyi Xiao, Xinyuan Qian, Beena Ahmed, Eliathamby Ambikairajah, Haizhou Li, Julien Epps

    Abstract: Transformer and its derivatives have achieved success in diverse tasks across computer vision, natural language processing, and speech processing. To reduce the complexity of computations within the multi-head self-attention mechanism in Transformer, Selective State Space Models (i.e., Mamba) were proposed as an alternative. Mamba exhibited its effectiveness in natural language processing and comp… ▽ More

    Submitted 30 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  2. arXiv:2402.13276  [pdf, other

    eess.AS cs.AI cs.SD

    When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection

    Authors: Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps

    Abstract: Depression is a critical concern in global mental health, prompting extensive research into AI-based detection methods. Among various AI technologies, Large Language Models (LLMs) stand out for their versatility in mental healthcare applications. However, their primary limitation arises from their exclusive dependence on textual input, which constrains their overall capabilities. Furthermore, the… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  3. arXiv:2311.07037  [pdf, other

    eess.AS cs.AI cs.CL

    Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method

    Authors: Mostafa Shahin, Julien Epps, Beena Ahmed

    Abstract: The automatic identification and analysis of pronunciation errors, known as Mispronunciation Detection and Diagnosis (MDD) plays a crucial role in Computer Aided Pronunciation Learning (CAPL) tools such as Second-Language (L2) learning or speech therapy applications. Existing MDD methods relying on analysing phonemes can only detect categorical errors of phonemes that have an adequate amount of tr… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  4. arXiv:2310.10922  [pdf, other

    cs.CL cs.SD eess.AS

    Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio

    Authors: Antoni Dimitriadis, Siqi Pan, Vidhyasaharan Sethu, Beena Ahmed

    Abstract: Self-supervised learning has been used to leverage unlabelled data, improving accuracy and generalisation of speech systems through the training of representation models. While many recent works have sought to produce effective representations across a variety of acoustic domains, languages, modalities and even simultaneous speakers, these studies have all been limited to single-channel audio reco… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  5. arXiv:2211.07769  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Children's Speech Recognition by Fine-tuning Self-supervised Adult Speech Representations

    Authors: Renee Lu, Mostafa Shahin, Beena Ahmed

    Abstract: Children's speech recognition is a vital, yet largely overlooked domain when building inclusive speech technologies. The major challenge impeding progress in this domain is the lack of adequate child speech corpora; however, recent advances in self-supervised learning have created a new opportunity for overcoming this problem of data scarcity. In this paper, we leverage self-supervised adult speec… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Under-review @ Speech Communication Journal

  6. arXiv:2210.10231  [pdf, other

    cs.SD cs.CL eess.AS

    Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning

    Authors: Mostafa Shahin, Beena Ahmed, Julien Epps

    Abstract: One of the major challenges in acoustic modelling of child speech is the rapid changes that occur in the children's articulators as they grow up, their differing growth rates and the subsequent high variability in the same age group. These high acoustic variations along with the scarcity of child speech corpora have impeded the development of a reliable speech recognition system for children. In t… ▽ More

    Submitted 6 November, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: Submitted to ICASSP2023

  7. arXiv:2102.04300  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning Models May Spuriously Classify Covid-19 from X-ray Images Based on Confounders

    Authors: Kaoutar Ben Ahmed, Lawrence O. Hall, Dmitry B. Goldgof, Gregory M. Goldgof, Rahul Paul

    Abstract: Identifying who is infected with the Covid-19 virus is critical for controlling its spread. X-ray machines are widely available worldwide and can quickly provide images that can be used for diagnosis. A number of recent studies claim it may be possible to build highly accurate models, using deep learning, to detect Covid-19 from chest X-ray images. This paper explores the robustness and generaliza… ▽ More

    Submitted 8 January, 2021; originally announced February 2021.

  8. arXiv:2002.11188  [pdf, other

    cs.NI cs.IR eess.AS

    IoT Based Real Time Noise Map** System for Urban Sound Pollution Study

    Authors: Sakib Ahmed, Touseef Saleh Bin Ahmed, Sumaiya Jafreen, Jannatul Tajrin, Jia Uddin

    Abstract: This paper describes the development of a system that enables real time data visualization via a webapp regarding sound intensity using multiple node devices connected through internet. The prototypes were realized using ATmega328 (Arduino Nano) and ESP8266 hardware modules, NodeMCU Arduino wrapper library, Google maps and firebase API along with JavaScript webapp. System architecture is such that… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Comments: Appendix by Sakib Ahmed Accepted as Conference Paper at ICIEV and icIVPR, 2018, Student Conference on Informatics, Electronics & Vision (SCIEV): Paper ID 175

  9. arXiv:1803.02159  [pdf, other

    eess.SY math.OC

    Exogenous Approach to Grid Cost Allocation in Peer-to-Peer Electricity Markets

    Authors: T. Baroche, P. Pinson, R. Le Goff Latimier., H. Ben Ahmed

    Abstract: The deployment of distributed energy resources, combined with a more proactive demand side, is inducing a new paradigm in power system operation and electricity markets. Within a consumer-centric market framework, peer-to-peer approaches have gained substantial interest. Peer-to-peer markets rely on multi-bilateral direct negotiation among all players to match supply and demand, and with product d… ▽ More

    Submitted 6 March, 2018; originally announced March 2018.

  10. arXiv:1404.6389  [pdf, other

    eess.SY

    Computing an Optimal Control Policy for an Energy Storage

    Authors: Pierre Haessig, Thibaut Kovaltchouk, Bernard Multon, Hamid Ben Ahmed, Stéphane Lascaud

    Abstract: We introduce StoDynProg, a small library created to solve Optimal Control problems arising in the management of Renewable Power Sources, in particular when coupled with an Energy Storage System. The library implements generic Stochastic Dynamic Programming (SDP) numerical methods which can solve a large class of Dynamic Optimization problems. We demonstrate the library capabilities with a prototyp… ▽ More

    Submitted 25 April, 2014; originally announced April 2014.

    Comments: Part of the Proceedings of the 6th European Conference on Python in Science (EuroSciPy 2013), Pierre de Buyl and Nelle Varoquaux editors, (2014)

    Report number: euroscipy-proceedings2013-08