Skip to main content

Showing 1–12 of 12 results for author: Shanmugam, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.14804  [pdf, other

    cs.CY

    Use large language models to promote equity

    Authors: Emma Pierson, Divya Shanmugam, Rajiv Movva, Jon Kleinberg, Monica Agrawal, Mark Dredze, Kadija Ferryman, Judy Wawira Gichoya, Dan Jurafsky, Pang Wei Koh, Karen Levy, Sendhil Mullainathan, Ziad Obermeyer, Harini Suresh, Keyon Vafa

    Abstract: Advances in large language models (LLMs) have driven an explosion of interest about their societal impacts. Much of the discourse around how they will impact social equity has been cautionary or negative, focusing on questions like "how might LLMs be biased and how would we mitigate those biases?" This is a vital discussion: the ways in which AI generally, and LLMs specifically, can entrench biase… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  2. arXiv:2312.00655   

    cs.LG

    Machine Learning for Health symposium 2023 -- Findings track

    Authors: Stefan Hegselmann, Antonio Parziale, Divya Shanmugam, Shengpu Tang, Mercy Nyamewaa Asiedu, Serina Chang, Thomas Hartvigsen, Harvineet Singh

    Abstract: A collection of the accepted Findings papers that were presented at the 3rd Machine Learning for Health symposium (ML4H 2023), which was held on December 10, 2023, in New Orleans, Louisiana, USA. ML4H 2023 invited high-quality submissions on relevant problems in a variety of health-related disciplines including healthcare, biomedicine, and public health. Two submission tracks were offered: the arc… ▽ More

    Submitted 15 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    MSC Class: 68Txx ACM Class: I.2; J.3; I.6; I.4

  3. arXiv:2304.09270  [pdf, other

    cs.CY cs.LG stat.AP

    Coarse race data conceals disparities in clinical risk score performance

    Authors: Rajiv Movva, Divya Shanmugam, Kaihua Hou, Priya Pathak, John Guttag, Nikhil Garg, Emma Pierson

    Abstract: Healthcare data in the United States often records only a patient's coarse race group: for example, both Indian and Chinese patients are typically coded as "Asian." It is unknown, however, whether this coarse coding conceals meaningful disparities in the performance of clinical risk scores across granular race groups. Here we show that it does. Using data from 418K emergency department visits, we… ▽ More

    Submitted 24 August, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: Published at MLHC 2023. v2 includes minor changes from the camera-ready, such as a link to code. Code is available at https://github.com/rmovva/granular-race-disparities_MLHC23

    ACM Class: J.3; K.4.2

  4. arXiv:2207.04312  [pdf, other

    cs.CY

    At the Intersection of Deep Learning and Conceptual Art: The End of Signature

    Authors: Divya Shanmugam, Katie Lewis, Jose Javier Gonzalez-Ortiz, Agnieszka Kurant, John Guttag

    Abstract: MIT wanted to commission a large scale artwork that would serve to 'illuminate a new campus gateway, inaugurate a space of exchange between MIT and Cambridge, and inspire our students, faculty, visitors, and the surrounding community to engage with art in new ways and to have art be part of their daily lives.' Among other things, the art was to reflect the fact that scientific discovery is often t… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

  5. arXiv:2206.13607  [pdf, other

    cs.LG cs.CL

    Improved Text Classification via Test-Time Augmentation

    Authors: Helen Lu, Divya Shanmugam, Harini Suresh, John Guttag

    Abstract: Test-time augmentation -- the aggregation of predictions across transformed examples of test inputs -- is an established technique to improve the performance of image classification models. Importantly, TTA can be used to improve model performance post-hoc, without additional training. Although test-time augmentation (TTA) can be applied to any data modality, it has seen limited adoption in NLP du… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  6. arXiv:2204.04360  [pdf, other

    cs.LG

    Data Augmentation for Electrocardiograms

    Authors: Aniruddh Raghu, Divya Shanmugam, Eugene Pomerantsev, John Guttag, Collin M. Stultz

    Abstract: Neural network models have demonstrated impressive performance in predicting pathologies and outcomes from the 12-lead electrocardiogram (ECG). However, these models often need to be trained with large, labelled datasets, which are not available for many predictive tasks of interest. In this work, we perform an empirical study examining whether training time data augmentation methods can be used t… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Conference on Health, Inference, and Learning (CHIL) 2022

  7. arXiv:2110.04133  [pdf, other

    cs.CY cs.LG

    Quantifying disparities in intimate partner violence: a machine learning method to correct for underreporting

    Authors: Divya Shanmugam, Kaihua Hou, Emma Pierson

    Abstract: Estimating the prevalence of a medical condition, or the proportion of the population in which it occurs, is a fundamental problem in healthcare and public health. Accurate estimates of the relative prevalence across groups -- capturing, for example, that a condition affects women more frequently than men -- facilitate effective and equitable health policy which prioritizes groups who are dispropo… ▽ More

    Submitted 8 December, 2023; v1 submitted 8 October, 2021; originally announced October 2021.

  8. arXiv:2107.08096  [pdf, other

    cs.LG cs.CY cs.IR

    Learning to Limit Data Collection via Scaling Laws: A Computational Interpretation for the Legal Principle of Data Minimization

    Authors: Divya Shanmugam, Samira Shabanian, Fernando Diaz, Michèle Finck, Asia Biega

    Abstract: Modern machine learning systems are increasingly characterized by extensive personal data collection, despite the diminishing returns and increasing societal costs of such practices. Yet, data minimisation is one of the core data protection principles enshrined in the European Union's General Data Protection Regulation ('GDPR') and requires that only personal data that is adequate, relevant and li… ▽ More

    Submitted 12 June, 2022; v1 submitted 16 July, 2021; originally announced July 2021.

    Comments: To appear at ACM Conference on Fairness, Accountability, and Transparency, 2022

  9. arXiv:2011.11156  [pdf, other

    cs.CV

    Better Aggregation in Test-Time Augmentation

    Authors: Divya Shanmugam, Davis Blalock, Guha Balakrishnan, John Guttag

    Abstract: Test-time augmentation -- the aggregation of predictions across transformed versions of a test input -- is a common practice in image classification. Traditionally, predictions are combined using a simple average. In this paper, we present 1) experimental analyses that shed light on cases in which the simple average is suboptimal and 2) a method to address these shortcomings. A key finding is that… ▽ More

    Submitted 11 October, 2021; v1 submitted 22 November, 2020; originally announced November 2020.

    Journal ref: ICCV 2021

  10. arXiv:2007.10233  [pdf, other

    cs.CV cs.LG

    Unsupervised Domain Adaptation in the Absence of Source Data

    Authors: Roshni Sahoo, Divya Shanmugam, John Guttag

    Abstract: Current unsupervised domain adaptation methods can address many types of distribution shift, but they assume data from the source domain is freely available. As the use of pre-trained models becomes more prevalent, it is reasonable to assume that source data is unavailable. We propose an unsupervised method for adapting a source classifier to a target domain that varies from the source domain alon… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  11. arXiv:1912.00262  [pdf, other

    q-bio.QM cs.CV cs.LG eess.IV q-bio.TO

    Image segmentation of liver stage malaria infection with spatial uncertainty sampling

    Authors: Ava P. Soleimany, Harini Suresh, Jose Javier Gonzalez Ortiz, Divya Shanmugam, Nil Gural, John Guttag, Sangeeta N. Bhatia

    Abstract: Global eradication of malaria depends on the development of drugs effective against the silent, yet obligate liver stage of the disease. The gold standard in drug development remains microscopic imaging of liver stage parasites in in vitro cell culture models. Image analysis presents a major bottleneck in this pipeline since the parasite has significant variability in size, shape, and density in t… ▽ More

    Submitted 30 November, 2019; originally announced December 2019.

  12. arXiv:1812.00475  [pdf, other

    cs.LG stat.ML

    Multiple Instance Learning for ECG Risk Stratification

    Authors: Divya Shanmugam, Davis Blalock, John Guttag

    Abstract: Patients who suffer an acute coronary syndrome are at elevated risk for adverse cardiovascular events such as myocardial infarction and cardiovascular death. Accurate assessment of this risk is crucial to their course of care. We focus on estimating a patient's risk of cardiovascular death after an acute coronary syndrome based on a patient's raw electrocardiogram (ECG) signal. Learning from this… ▽ More

    Submitted 25 March, 2020; v1 submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Healthcare Conference (MLHC 2019)