Skip to main content

Showing 1–14 of 14 results for author: Balagopalan, A

.
  1. arXiv:2403.01628  [pdf, ps, other

    cs.LG

    Recent Advances, Applications, and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables at ML4H 2023 Symposium

    Authors: Hyewon Jeong, Sarah Jabbour, Yuzhe Yang, Rahul Thapta, Hussein Mozannar, William Jongwon Han, Nikita Mehandru, Michael Wornow, Vladislav Lialin, Xin Liu, Alejandro Lozano, Jiacheng Zhu, Rafal Dariusz Kocielnik, Keith Harrigian, Haoran Zhang, Edward Lee, Milos Vukadinovic, Aparna Balagopalan, Vincent Jeanselme, Katherine Matton, Ilker Demirel, Jason Fries, Parisa Rashidi, Brett Beaulieu-Jones, Xuhai Orson Xu , et al. (18 additional authors not shown)

    Abstract: The third ML4H symposium was held in person on December 10, 2023, in New Orleans, Louisiana, USA. The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the \ac{ML4H} community. Encouraged by the successful virtual roundtables in the previous year, we organized eleven in-person roundtables and four vir… ▽ More

    Submitted 5 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: ML4H 2023, Research Roundtables

  2. arXiv:2312.10308  [pdf, other

    cs.LG

    Event-Based Contrastive Learning for Medical Time Series

    Authors: Hyewon Jeong, Nassim Oufattole, Matthew Mcdermott, Aparna Balagopalan, Bryan Jangeesingh, Marzyeh Ghassemi, Collin Stultz

    Abstract: In clinical practice, one often needs to identify whether a patient is at high risk of adverse outcomes after some key medical event. For example, quantifying the risk of adverse outcomes after an acute cardiovascular event helps healthcare providers identify those patients at the highest risk of poor outcomes; i.e., patients who benefit from invasive therapies that can lower their risk. Assessing… ▽ More

    Submitted 19 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted at Unifying Representations in Neural Models Workshop in NeurIPS 2023

  3. arXiv:2305.05608  [pdf, other

    cs.IR cs.CY cs.LG

    The Role of Relevance in Fair Ranking

    Authors: Aparna Balagopalan, Abigail Z. Jacobs, Asia Biega

    Abstract: Online platforms mediate access to opportunity: relevance-based rankings create and constrain options by allocating exposure to job openings and job candidates in hiring platforms, or sellers in a marketplace. In order to do so responsibly, these socially consequential systems employ various fairness measures and interventions, many of which seek to allocate exposure based on worthiness. Because t… ▽ More

    Submitted 6 June, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Published in SIGIR 2023

  4. arXiv:2205.03295  [pdf, other

    cs.LG cs.AI cs.CY

    The Road to Explainability is Paved with Bias: Measuring the Fairness of Explanations

    Authors: Aparna Balagopalan, Haoran Zhang, Kimia Hamidieh, Thomas Hartvigsen, Frank Rudzicz, Marzyeh Ghassemi

    Abstract: Machine learning models in safety-critical settings like healthcare are often blackboxes: they contain a large number of parameters which are not transparent to users. Post-hoc explainability methods where a simple, human-interpretable model imitates the behavior of these blackbox models are often proposed to help users trust model predictions. In this work, we audit the quality of such explanatio… ▽ More

    Submitted 2 June, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: Published in FAccT 2022

  5. arXiv:2110.08931  [pdf, other

    cs.CL

    Quantifying the Task-Specific Information in Text-Based Classifications

    Authors: Zining Zhu, Aparna Balagopalan, Marzyeh Ghassemi, Frank Rudzicz

    Abstract: Recently, neural natural language models have attained state-of-the-art performance on a wide variety of tasks, but the high performance can result from superficial, surface-level cues (Bender and Koller, 2020; Niven and Kao, 2020). These surface cues, as the ``shortcuts'' inherent in the datasets, do not contribute to the *task-specific information* (TSI) of the classification tasks. While it is… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

  6. arXiv:2106.01555  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Comparing Acoustic-based Approaches for Alzheimer's Disease Detection

    Authors: Aparna Balagopalan, Jekaterina Novikova

    Abstract: Robust strategies for Alzheimer's disease (AD) detection are important, given the high prevalence of AD. In this paper, we study the performance and generalizability of three approaches for AD detection from speech on the recent ADReSSo challenge dataset: 1) using conventional acoustic features 2) using novel pre-trained acoustic embeddings 3) combining acoustic features and embeddings. We find th… ▽ More

    Submitted 15 September, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted to INTERSPEECH 2021; update includes corrections to last two rows of Table 2 and corresponding text edits

  7. arXiv:2011.06153  [pdf, ps, other

    cs.CL cs.LG

    Augmenting BERT Carefully with Underrepresented Linguistic Features

    Authors: Aparna Balagopalan, Jekaterina Novikova

    Abstract: Fine-tuned Bidirectional Encoder Representations from Transformers (BERT)-based sequence classification models have proven to be effective for detecting Alzheimer's Disease (AD) from transcripts of human speech. However, previous research shows it is possible to improve BERT's performance on various tasks by augmenting the model with additional information. In this work, we use probing tasks as in… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  8. arXiv:2010.06579  [pdf, other

    cs.LG cs.CL

    Fantastic Features and Where to Find Them: Detecting Cognitive Impairment with a Subsequence Classification Guided Approach

    Authors: Benjamin Eyre, Aparna Balagopalan, Jekaterina Novikova

    Abstract: Despite the widely reported success of embedding-based machine learning methods on natural language processing tasks, the use of more easily interpreted engineered features remains common in fields such as cognitive impairment (CI) detection. Manually engineering features from noisy text is time and resource consuming, and can potentially result in features that do not enhance model performance. T… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: EMNLP Workshop on Noisy User-generated Text (W-NUT 2020)

  9. arXiv:2008.01551  [pdf, other

    cs.CL cs.LG

    To BERT or Not To BERT: Comparing Speech and Language-based Approaches for Alzheimer's Disease Detection

    Authors: Aparna Balagopalan, Benjamin Eyre, Frank Rudzicz, Jekaterina Novikova

    Abstract: Research related to automatically detecting Alzheimer's disease (AD) is important, given the high prevalence of AD and the high cost of traditional methods. Since AD significantly affects the content and acoustics of spontaneous speech, natural language processing and machine learning provide promising techniques for reliably detecting AD. We compare and contrast the performance of two such approa… ▽ More

    Submitted 26 July, 2020; originally announced August 2020.

    Comments: accepted to INTERSPEECH 2020

  10. arXiv:1912.04370  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Cross-Language Aphasia Detection using Optimal Transport Domain Adaptation

    Authors: Aparna Balagopalan, Jekaterina Novikova, Matthew B. A. McDermott, Bret Nestor, Tristan Naumann, Marzyeh Ghassemi

    Abstract: Multi-language speech datasets are scarce and often have small sample sizes in the medical domain. Robust transfer of linguistic features across languages could improve rates of early diagnosis and therapy for speakers of low-resource languages when detecting health conditions from speech. We utilize out-of-domain, unpaired, single-speaker, healthy speech data for training multiple Optimal Transpo… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: Accepted to ML4H at NeurIPS 2019

  11. arXiv:1910.00065  [pdf, other

    cs.CL

    Lexical Features Are More Vulnerable, Syntactic Features Have More Predictive Power

    Authors: Jekaterina Novikova, Aparna Balagopalan, Ksenia Shkaruta, Frank Rudzicz

    Abstract: Understanding the vulnerability of linguistic features extracted from noisy text is important for both develo** better health text classification models and for interpreting vulnerabilities of natural language models. In this paper, we investigate how generic language characteristics, such as syntax or the lexicon, are impacted by artificial text alterations. The vulnerability of features is ana… ▽ More

    Submitted 30 September, 2019; originally announced October 2019.

    Comments: EMNLP Workshop on Noisy User-generated Text (W-NUT 2019)

  12. arXiv:1904.01684  [pdf, other

    cs.CL

    Impact of ASR on Alzheimer's Disease Detection: All Errors are Equal, but Deletions are More Equal than Others

    Authors: Aparna Balagopalan, Ksenia Shkaruta, Jekaterina Novikova

    Abstract: Automatic Speech Recognition (ASR) is a critical component of any fully-automated speech-based dementia detection model. However, despite years of speech recognition research, little is known about the impact of ASR accuracy on dementia detection. In this paper, we experiment with controlled amounts of artificially generated ASR errors and investigate their influence on dementia detection. We find… ▽ More

    Submitted 13 October, 2020; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: EMNLP Workshop on Noisy User-generated Text (W-NUT 2020)

  13. arXiv:1811.12254  [pdf, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    The Effect of Heterogeneous Data for Alzheimer's Disease Detection from Speech

    Authors: Aparna Balagopalan, Jekaterina Novikova, Frank Rudzicz, Marzyeh Ghassemi

    Abstract: Speech datasets for identifying Alzheimer's disease (AD) are generally restricted to participants performing a single task, e.g. describing an image shown to them. As a result, models trained on linguistic features derived from such datasets may not be generalizable across tasks. Building on prior work demonstrating that same-task data of healthy participants helps improve AD detection on a single… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/147

  14. arXiv:1805.02788  [pdf, other

    stat.ML cs.LG

    ReGAN: RE[LAX|BAR|INFORCE] based Sequence Generation using GANs

    Authors: Aparna Balagopalan, Satya Gorti, Mathieu Ravaut, Raeid Saqur

    Abstract: Generative Adversarial Networks (GANs) have seen steep ascension to the peak of ML research zeitgeist in recent years. Mostly catalyzed by its success in the domain of image generation, the technique has seen wide range of adoption in a variety of other problem domains. Although GANs have had a lot of success in producing more realistic images than other approaches, they have only seen limited use… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.