Skip to main content

Showing 1–13 of 13 results for author: Haidar, M A

.
  1. arXiv:2309.14239  [pdf

    q-bio.TO

    Simulation-Based Design of Bicuspidization of the Aortic Valve

    Authors: Alexander D. Kaiser, Moussa A. Haidar, Perry S. Choi, Amit Sharir, Alison L. Marsden, Michael R. Ma

    Abstract: Objective: Severe congenital aortic valve pathology in the growing patient remains a challenging clinical scenario. Bicuspidization of the diseased aortic valve has proven to be a promising repair technique with acceptable durability. However, most understanding of the procedure is empirical and retrospective. This work seeks to design the optimal gross morphology associated with surgical bicuspid… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    MSC Class: 92C35 (Primary); 92C50; 92C32; 76Z05 (Secondary) ACM Class: J.3.1

  2. arXiv:2206.11157  [pdf, other

    eess.AS

    Conformer with dual-mode chunked attention for joint online and offline ASR

    Authors: Felix Weninger, Marco Gaudesi, Md Akmal Haidar, Nicola Ferri, Jesús Andrés-Ferrer, Puming Zhan

    Abstract: In this paper, we present an in-depth study on online attention mechanisms and distillation techniques for dual-mode (i.e., joint online and offline) ASR using the Conformer Transducer. In the dual-mode Conformer Transducer model, layers can function in online or offline mode while sharing parameters, and in-place knowledge distillation from offline to online mode is applied in training to improve… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: To appear in INTERSPEECH 2022

  3. arXiv:2204.07674  [pdf, other

    cs.CL

    CILDA: Contrastive Data Augmentation using Intermediate Layer Knowledge Distillation

    Authors: Md Akmal Haidar, Mehdi Rezagholizadeh, Abbas Ghaddar, Khalil Bibi, Philippe Langlais, Pascal Poupart

    Abstract: Knowledge distillation (KD) is an efficient framework for compressing large-scale pre-trained language models. Recent years have seen a surge of research aiming to improve KD by leveraging Contrastive Learning, Intermediate Layer Distillation, Data Augmentation, and Adversarial Training. In this work, we propose a learning based data augmentation technique tailored for knowledge distillation, call… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  4. arXiv:2109.10164  [pdf, other

    cs.CL

    RAIL-KD: RAndom Intermediate Layer Map** for Knowledge Distillation

    Authors: Md Akmal Haidar, Nithin Anchuri, Mehdi Rezagholizadeh, Abbas Ghaddar, Philippe Langlais, Pascal Poupart

    Abstract: Intermediate layer knowledge distillation (KD) can improve the standard KD technique (which only targets the output of teacher and student models) especially over large pre-trained language models. However, intermediate layer distillation suffers from excessive computational burdens and engineering efforts required for setting up a proper layer map**. To address these problems, we propose a RAnd… ▽ More

    Submitted 1 October, 2021; v1 submitted 21 September, 2021; originally announced September 2021.

  5. arXiv:2103.13329  [pdf, other

    eess.AS cs.CL cs.SD

    Fine-tuning of Pre-trained End-to-end Speech Recognition with Generative Adversarial Networks

    Authors: Md Akmal Haidar, Mehdi Rezagholizadeh

    Abstract: Adversarial training of end-to-end (E2E) ASR systems using generative adversarial networks (GAN) has recently been explored for low-resource ASR corpora. GANs help to learn the true data representation through a two-player min-max game. However, training an E2E ASR model using a large ASR corpus with a GAN framework has never been explored, because it might take excessively long time due to high-v… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: Accepted in ICASSP 2021 conference

  6. arXiv:2103.09903  [pdf, other

    cs.AI cs.LG

    Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation

    Authors: Md Akmal Haidar, Chao Xing, Mehdi Rezagholizadeh

    Abstract: End-to-end automatic speech recognition (ASR), unlike conventional ASR, does not have modules to learn the semantic representation from speech encoder. Moreover, the higher frame-rate of speech representation prevents the model to learn the semantic representation properly. Therefore, the models that are constructed by the lower frame-rate of speech encoder lead to better performance. For Transfor… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

  7. arXiv:2011.05449  [pdf, other

    cs.CL

    From Unsupervised Machine Translation To Adversarial Text Generation

    Authors: Ahmad Rashid, Alan Do-Omri, Md. Akmal Haidar, Qun Liu, Mehdi Rezagholizadeh

    Abstract: We present a self-attention based bilingual adversarial text generator (B-GAN) which can learn to generate text from the encoder representation of an unsupervised neural machine translation system. B-GAN is able to generate a distributed latent space representation which can be paired with an attention based decoder to generate fluent sentences. When trained on an encoder shared between two langua… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: Accepted at ICASSP 2020

  8. arXiv:1911.03604  [pdf, other

    cs.CL cs.SD eess.AS

    A Simplified Fully Quantized Transformer for End-to-end Speech Recognition

    Authors: Alex Bie, Bharat Venkitesh, Joao Monteiro, Md. Akmal Haidar, Mehdi Rezagholizadeh

    Abstract: While significant improvements have been made in recent years in terms of end-to-end automatic speech recognition (ASR) performance, such improvements were obtained through the use of very large neural networks, unfit for embedded use on edge devices. That being said, in this paper, we work on simplifying and compressing Transformer-based encoder-decoder architectures for the end-to-end ASR task.… ▽ More

    Submitted 24 March, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: Submitted to IEEE Signal Processing Letters Minor changes in Section 3

  9. arXiv:1910.06720  [pdf, other

    cs.CL cs.LG

    Improving Word Embedding Factorization for Compression Using Distilled Nonlinear Neural Decomposition

    Authors: Vasileios Lioutas, Ahmad Rashid, Krtin Kumar, Md Akmal Haidar, Mehdi Rezagholizadeh

    Abstract: Word-embeddings are vital components of Natural Language Processing (NLP) models and have been extensively explored. However, they consume a lot of memory which poses a challenge for edge deployment. Embedding matrices, typically, contain most of the parameters for language models and about a third for machine translation systems. In this paper, we propose Distilled Embedding, an (input/output) em… ▽ More

    Submitted 10 November, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted at Findings of EMNLP 2020

  10. arXiv:1905.01976  [pdf, other

    cs.CL

    TextKD-GAN: Text Generation using KnowledgeDistillation and Generative Adversarial Networks

    Authors: Md. Akmal Haidar, Mehdi Rezagholizadeh

    Abstract: Text generation is of particular interest in many NLP applications such as machine translation, language modeling, and text summarization. Generative adversarial networks (GANs) achieved a remarkable success in high quality image generation in computer vision,and recently, GANs have gained lots of interest from the NLP community as well. However, achieving similar success in NLP would be more chal… ▽ More

    Submitted 23 April, 2019; originally announced May 2019.

    Comments: arXiv admin note: text overlap with arXiv:1904.07293

    Journal ref: 32nd Canadian Conference on Artificial Intelligence 2019

  11. arXiv:1904.07293  [pdf, other

    cs.CL cs.LG

    Latent Code and Text-based Generative Adversarial Networks for Soft-text Generation

    Authors: Md. Akmal Haidar, Mehdi Rezagholizadeh, Alan Do-Omri, Ahmad Rashid

    Abstract: Text generation with generative adversarial networks (GANs) can be divided into the text-based and code-based categories according to the type of signals used for discrimination. In this work, we introduce a novel text-based approach called Soft-GAN to effectively exploit GAN setup for text generation. We demonstrate how autoencoders (AEs) can be used for providing a continuous representation of s… ▽ More

    Submitted 23 April, 2019; v1 submitted 15 April, 2019; originally announced April 2019.

    Journal ref: 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics

  12. arXiv:1904.04742  [pdf, other

    cs.CL cs.LG

    Bilingual-GAN: A Step Towards Parallel Text Generation

    Authors: Ahmad Rashid, Alan Do-Omri, Md. Akmal Haidar, Qun Liu, Mehdi Rezagholizadeh

    Abstract: Latent space based GAN methods and attention based sequence to sequence models have achieved impressive results in text generation and unsupervised machine translation respectively. Leveraging the two domains, we propose an adversarial latent space based model capable of generating parallel sentences in two languages concurrently and translating bidirectionally. The bilingual generation goal is ac… ▽ More

    Submitted 14 May, 2019; v1 submitted 9 April, 2019; originally announced April 2019.

  13. SALSA-TEXT : self attentive latent space based adversarial text generation

    Authors: Jules Gagnon-Marchand, Hamed Sadeghi, Md. Akmal Haidar, Mehdi Rezagholizadeh

    Abstract: Inspired by the success of self attention mechanism and Transformer architecture in sequence transduction and image generation applications, we propose novel self attention-based architectures to improve the performance of adversarial latent code- based schemes in text generation. Adversarial latent code-based text generation has recently gained a lot of attention due to their promising results. I… ▽ More

    Submitted 8 October, 2018; v1 submitted 28 September, 2018; originally announced September 2018.

    Comments: 10 pages, 3 figures, under review at ICLR 2019

    Journal ref: Canadian AI 2019