Skip to main content

Showing 1–6 of 6 results for author: Shekar, M C

.
  1. arXiv:2404.18842  [pdf, other

    cs.CV

    VISION: Toward a Standardized Process for Radiology Image Management at the National Level

    Authors: Kathryn Knight, Ioana Danciu, Olga Ovchinnikova, Jacob Hinkle, Mayanka Chandra Shekar, Debangshu Mukherjee, Eileen McAllister, Caitlin Rizy, Kelly Cho, Amy C. Justice, Joseph Erdos, Peter Kuzmak, Lauren Costa, Yuk-Lam Ho, Reddy Madipadga, Suzanne Tamang, Ian Goethert

    Abstract: The compilation and analysis of radiological images poses numerous challenges for researchers. The sheer volume of data as well as the computational needs of algorithms capable of operating on images are extensive. Additionally, the assembly of these images alone is difficult, as these exams may differ widely in terms of clinical context, structured annotation available for model training, modalit… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  2. arXiv:2311.02382  [pdf, other

    cs.DC cs.AI

    Ultra-Long Sequence Distributed Transformer

    Authors: Xiao Wang, Isaac Lyngaas, Aristeidis Tsaris, Peng Chen, Sajal Dash, Mayanka Chandra Shekar, Tao Luo, Hong-Jun Yoon, Mohamed Wahib, John Gouley

    Abstract: Transformer models trained on long sequences often achieve higher accuracy than short sequences. Unfortunately, conventional transformers struggle with long sequence training due to the overwhelming computation and memory requirements. Existing methods for long sequence training offer limited speedup and memory reduction, and may compromise accuracy. This paper presents a novel and efficient distr… ▽ More

    Submitted 8 November, 2023; v1 submitted 4 November, 2023; originally announced November 2023.

  3. arXiv:2306.06524  [pdf, other

    eess.AS cs.CL cs.SD

    What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model

    Authors: Mu Yang, Ram C. M. C. Shekar, Okim Kang, John H. L. Hansen

    Abstract: This study is focused on understanding and quantifying the change in phoneme and prosody information encoded in the Self-Supervised Learning (SSL) model, brought by an accent identification (AID) fine-tuning task. This problem is addressed based on model probing. Specifically, we conduct a systematic layer-wise analysis of the representations of the Transformer layers on a phoneme correlation task… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Accepted by Interspeech 2023

  4. arXiv:2211.10565  [pdf, other

    eess.AS cs.HC cs.LG cs.SD

    Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting

    Authors: Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen

    Abstract: In the context of keyword spotting (KWS), the replacement of handcrafted speech features by learnable features has not yielded superior KWS performance. In this study, we demonstrate that filterbank learning outperforms handcrafted speech features for KWS whenever the number of filterbank channels is severely decreased. Reducing the number of channels might yield certain KWS performance drop, but… ▽ More

    Submitted 23 February, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

  5. arXiv:2101.01337  [pdf, ps, other

    cs.CL cs.LG

    Integration of Domain Knowledge using Medical Knowledge Graph Deep Learning for Cancer Phenoty**

    Authors: Mohammed Alawad, Shang Gao, Mayanka Chandra Shekar, S. M. Shamimul Hasan, J. Blair Christian, Xiao-Cheng Wu, Eric B. Durbin, Jennifer Doherty, Antoinette Stroup, Linda Coyle, Lynne Penberthy, Georgia Tourassi

    Abstract: A key component of deep learning (DL) for natural language processing (NLP) is word embeddings. Word embeddings that effectively capture the meaning and context of the word that they represent can significantly improve the performance of downstream DL models for various NLP tasks. Many existing word embeddings techniques capture the context of words based on word co-occurrence in documents and tex… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

  6. arXiv:2008.06764  [pdf, other

    eess.AS cs.SD

    FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data

    Authors: Aditya Joglekar, John H. L. Hansen, Meena Chandra Shekar, Abhijeet Sangwan

    Abstract: The Fearless Steps Initiative by UTDallas-CRSS led to the digitization, recovery, and diarization of 19,000 hours of original analog audio data, as well as the development of algorithms to extract meaningful information from this multi-channel naturalistic data resource. The 2020 FEARLESS STEPS (FS-2) Challenge is the second annual challenge held for the Speech and Language Technology community to… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: Paper Accepted in the Interspeech 2020 Conference