Skip to main content

Showing 1–7 of 7 results for author: Awasthi, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00129  [pdf

    eess.IV cs.AI cs.HC

    Multimodal Learning and Cognitive Processes in Radiology: MedGaze for Chest X-ray Scanpath Prediction

    Authors: Akash Awasthi, Ngan Le, Zhigang Deng, Rishi Agrawal, Carol C. Wu, Hien Van Nguyen

    Abstract: Predicting human gaze behavior within computer vision is integral for develo** interactive systems that can anticipate user attention, address fundamental questions in cognitive science, and hold implications for fields like human-computer interaction (HCI) and augmented/virtual reality (AR/VR) systems. Despite methodologies introduced for modeling human eye gaze behavior, applying these models… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Submitted to the Journal

  2. arXiv:2406.19686  [pdf

    eess.IV cs.AI cs.CV cs.HC

    Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction

    Authors: Akash Awasthi, Ngan Le, Zhigang Deng, Carol C. Wu, Hien Van Nguyen

    Abstract: Human-AI collaboration to identify and correct perceptual errors in chest radiographs has not been previously explored. This study aimed to develop a collaborative AI system, CoRaX, which integrates eye gaze data and radiology reports to enhance diagnostic accuracy in chest radiology by pinpointing perceptual errors and refining the decision-making process. Using public datasets REFLACX and EGD-CX… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Under Review in Journal

  3. arXiv:2404.18981  [pdf, other

    eess.IV cs.AI

    Decoding Radiologists' Intentions: A Novel System for Accurate Region Identification in Chest X-ray Image Analysis

    Authors: Akash Awasthi, Safwan Ahmad, Bryant Le, Hien Van Nguyen

    Abstract: In the realm of chest X-ray (CXR) image analysis, radiologists meticulously examine various regions, documenting their observations in reports. The prevalence of errors in CXR diagnoses, particularly among inexperienced radiologists and hospital residents, underscores the importance of understanding radiologists' intentions and the corresponding regions of interest. This understanding is crucial f… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted in ISBI 2024

  4. arXiv:2311.09623  [pdf

    eess.IV cs.CV

    Apoptosis classification using attention based spatio temporal graph convolution neural network

    Authors: Akash Awasthi

    Abstract: Accurate classification of apoptosis plays an important role in cell biology research. There are many state-of-the-art approaches which use deep CNNs to perform the apoptosis classification but these approaches do not account for the cell interaction. Our paper proposes the Attention Graph spatio-temporal graph convolutional network to classify the cell death based on the target cells in the video… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  5. arXiv:2106.02443  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Teaching keyword spotters to spot new keywords with limited examples

    Authors: Abhijeet Awasthi, Kevin Kilgour, Hassan Rom

    Abstract: Learning to recognize new keywords with just a few examples is essential for personalizing keyword spotting (KWS) models to a user's choice of keywords. However, modern KWS models are typically trained on large datasets and restricted to a small vocabulary of keywords, limiting their transferability to a broad range of unseen keywords. Towards easily customizable KWS models, we present KeySEM (Key… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: In INTERSPEECH 2021

  6. arXiv:2103.03142  [pdf, other

    cs.SD cs.CL eess.AS

    Error-driven Fixed-Budget ASR Personalization for Accented Speakers

    Authors: Abhijeet Awasthi, Aman Kansal, Sunita Sarawagi, Preethi Jyothi

    Abstract: We consider the task of personalizing ASR models while being constrained by a fixed budget on recording speaker-specific utterances. Given a speaker and an ASR model, we propose a method of identifying sentences for which the speaker's utterances are likely to be harder for the given ASR model to recognize. We assume a tiny amount of speaker-specific data to learn phoneme-level error models which… ▽ More

    Submitted 2 June, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: In ICASSP 2021

  7. arXiv:2006.13519  [pdf, other

    eess.AS cs.CL cs.SD

    Black-box Adaptation of ASR for Accented Speech

    Authors: Kartik Khandelwal, Preethi Jyothi, Abhijeet Awasthi, Sunita Sarawagi

    Abstract: We introduce the problem of adapting a black-box, cloud-based ASR system to speech from a target accent. While leading online ASR services obtain impressive performance on main-stream accents, they perform poorly on sub-populations - we observed that the word error rate (WER) achieved by Google's ASR API on Indian accents is almost twice the WER on US accents. Existing adaptation methods either re… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: A slightly different version submitted to INTERSPEECH 2020 (currently under review)