Skip to main content

Showing 1–28 of 28 results for author: Saria, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.06627  [pdf, other

    cs.LG cs.AI stat.ML

    Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)

    Authors: Drew Prinster, Samuel Stanton, Anqi Liu, Suchi Saria

    Abstract: As artificial intelligence (AI) / machine learning (ML) gain widespread adoption, practitioners are increasingly seeking means to quantify and control the risk these systems incur. This challenge is especially salient when such systems have autonomy to collect their own data, such as in black-box optimization and active learning, where their actions induce sequential feedback-loop shifts in the da… ▽ More

    Submitted 5 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Code available at https://github.com/drewprinster/conformal-mfcs

  2. arXiv:2402.03226  [pdf, other

    cs.LG

    FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion

    Authors: Xing Han, Huy Nguyen, Carl Harris, Nhat Ho, Suchi Saria

    Abstract: As machine learning models in critical fields increasingly grapple with multimodal data, they face the dual challenges of handling a wide array of modalities, often incomplete due to missing elements, and the temporal irregularity and sparsity of collected samples. Successfully leveraging this complex data, while overcoming the scarcity of high-quality training samples, is key to improving these m… ▽ More

    Submitted 22 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 47 pages, 9 tables, 12 figures

  3. arXiv:2310.12803  [pdf, other

    cs.LG cs.CL

    Data Augmentations for Improved (Large) Language Model Generalization

    Authors: Amir Feder, Yoav Wald, Claudia Shi, Suchi Saria, David Blei

    Abstract: The reliance of text classifiers on spurious correlations can lead to poor generalization at deployment, raising concerns about their use in safety-critical domains such as healthcare. In this work, we propose to use counterfactual data augmentation, guided by knowledge of the causal structure of the data, to simulate interventions on spurious features and to learn more robust text classifiers. We… ▽ More

    Submitted 9 January, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Published at NeurIPS 2023

  4. arXiv:2207.10716  [pdf, other

    cs.LG stat.ML

    JAWS: Auditing Predictive Uncertainty Under Covariate Shift

    Authors: Drew Prinster, Anqi Liu, Suchi Saria

    Abstract: We propose \textbf{JAWS}, a series of wrapper methods for distribution-free uncertainty quantification tasks under covariate shift, centered on the core method \textbf{JAW}, the \textbf{JA}ckknife+ \textbf{W}eighted with data-dependent likelihood-ratio weights. JAWS also includes computationally efficient \textbf{A}pproximations of JAW using higher-order influence functions: \textbf{JAWA}. Theoret… ▽ More

    Submitted 23 November, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: Thirty-sixth Conference on Neural Information Processing Systems

  5. arXiv:2112.12582  [pdf

    q-bio.OT cs.LG

    Beyond Low Earth Orbit: Biological Research, Artificial Intelligence, and Self-Driving Labs

    Authors: Lauren M. Sanders, Jason H. Yang, Ryan T. Scott, Amina Ann Qutub, Hector Garcia Martin, Daniel C. Berrios, Jaden J. A. Hastings, Jon Rask, Graham Mackintosh, Adrienne L. Hoarfrost, Stuart Chalk, John Kalantari, Kia Khezeli, Erik L. Antonsen, Joel Babdor, Richard Barker, Sergio E. Baranzini, Afshin Beheshti, Guillermo M. Delgado-Aparicio, Benjamin S. Glicksberg, Casey S. Greene, Melissa Haendel, Arif A. Hamid, Philip Heller, Daniel Jamieson , et al. (31 additional authors not shown)

    Abstract: Space biology research aims to understand fundamental effects of spaceflight on organisms, develop foundational knowledge to support deep space exploration, and ultimately bioengineer spacecraft and habitats to stabilize the ecosystem of plants, crops, microbes, animals, and humans for sustained multi-planetary life. To advance these aims, the field leverages experiments, platforms, data, and mode… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 28 pages, 4 figures

  6. arXiv:2112.12554  [pdf

    q-bio.OT cs.LG

    Beyond Low Earth Orbit: Biomonitoring, Artificial Intelligence, and Precision Space Health

    Authors: Ryan T. Scott, Erik L. Antonsen, Lauren M. Sanders, Jaden J. A. Hastings, Seung-min Park, Graham Mackintosh, Robert J. Reynolds, Adrienne L. Hoarfrost, Aenor Sawyer, Casey S. Greene, Benjamin S. Glicksberg, Corey A. Theriot, Daniel C. Berrios, Jack Miller, Joel Babdor, Richard Barker, Sergio E. Baranzini, Afshin Beheshti, Stuart Chalk, Guillermo M. Delgado-Aparicio, Melissa Haendel, Arif A. Hamid, Philip Heller, Daniel Jamieson, Katelyn J. Jarvis , et al. (31 additional authors not shown)

    Abstract: Human space exploration beyond low Earth orbit will involve missions of significant distance and duration. To effectively mitigate myriad space health hazards, paradigm shifts in data and space health systems are necessary to enable Earth-independence, rather than Earth-reliance. Promising developments in the fields of artificial intelligence and machine learning for biology and health can address… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 31 pages, 4 figures

  7. arXiv:2012.12449  [pdf, other

    stat.ML cs.LG stat.ME

    Partial Identifiability in Discrete Data With Measurement Error

    Authors: Noam Finkelstein, Roy Adams, Suchi Saria, Ilya Shpitser

    Abstract: When data contains measurement errors, it is necessary to make assumptions relating the observed, erroneous data to the unobserved true phenomena of interest. These assumptions should be justifiable on substantive grounds, but are often motivated by mathematical convenience, for the sake of exactly identifying the target of inference. We adopt the view that it is preferable to present bounds under… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  8. arXiv:2010.15100  [pdf, other

    cs.LG stat.ML

    Evaluating Model Robustness and Stability to Dataset Shift

    Authors: Adarsh Subbaswamy, Roy Adams, Suchi Saria

    Abstract: As the use of machine learning in high impact domains becomes widespread, the importance of evaluating safety has increased. An important aspect of this is evaluating how robust a model is to changes in setting or population, which typically requires applying the model to multiple, independent datasets. Since the cost of collecting such datasets is often prohibitive, in this paper, we propose a fr… ▽ More

    Submitted 15 March, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: In Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS), 2021

  9. arXiv:2002.08948  [pdf, other

    stat.ML cs.AI cs.LG

    I-SPEC: An End-to-End Framework for Learning Transportable, Shift-Stable Models

    Authors: Adarsh Subbaswamy, Suchi Saria

    Abstract: Shifts in environment between development and deployment cause classical supervised learning to produce models that fail to generalize well to new target distributions. Recently, many solutions which find invariant predictive distributions have been developed. Among these, graph-based approaches do not require data from the target environment and can capture more stable information than alternativ… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

  10. arXiv:1905.11374  [pdf, other

    stat.ML cs.AI cs.LG

    A Unifying Causal Framework for Analyzing Dataset Shift-stable Learning Algorithms

    Authors: Adarsh Subbaswamy, Bryant Chen, Suchi Saria

    Abstract: Recent interest in the external validity of prediction models (i.e., the problem of different train and test distributions, known as dataset shift) has produced many methods for finding predictive distributions that are invariant to dataset shifts and can be used for prediction in new, unseen environments. However, these methods consider different types of shifts and have been developed under disp… ▽ More

    Submitted 18 July, 2022; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: Published in the Journal of Causal Inference

    Journal ref: Journal of Causal Inference, 10(1), 64-89

  11. arXiv:1904.07204  [pdf, other

    cs.LG cs.AI

    Tutorial: Safe and Reliable Machine Learning

    Authors: Suchi Saria, Adarsh Subbaswamy

    Abstract: This document serves as a brief overview of the "Safe and Reliable Machine Learning" tutorial given at the 2019 ACM Conference on Fairness, Accountability, and Transparency (FAT* 2019). The talk slides can be found here: https://bit.ly/2Gfsukp, while a video of the talk is available here: https://youtu.be/FGLOCkC4KmE, and a complete list of references for the tutorial here: https://bit.ly/2GdLPme.

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: Overview of the "Safe and Reliable Machine Learning" tutorial given at the 2019 ACM Conference on Fairness, Accountability, and Transparency (FAT* 2019)

  12. arXiv:1904.05268  [pdf, other

    stat.ML cs.LG

    Active Learning for Decision-Making from Imbalanced Observational Data

    Authors: Iiris Sundin, Peter Schulam, Eero Siivola, Aki Vehtari, Suchi Saria, Samuel Kaski

    Abstract: Machine learning can help personalized decision support by learning models to predict individual treatment effects (ITE). This work studies the reliability of prediction-based decision-making in a task of deciding which action $a$ to take for a target unit after observing its covariates $\tilde{x}$ and predicted outcomes $\hat{p}(\tilde{y} \mid \tilde{x}, a)$. An example case is personalized medic… ▽ More

    Submitted 6 June, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

    Comments: Published in Proceedings of the 36th International Conference on Machine Learning (ICML) 2019. 15 pages (10 paper + 5 supplementary), 7 figures

  13. arXiv:1901.09060  [pdf, other

    stat.ML cs.LG

    Learning Models from Data with Measurement Error: Tackling Underreporting

    Authors: Roy Adams, Yuelong Ji, Xiaobin Wang, Suchi Saria

    Abstract: Measurement error in observational datasets can lead to systematic bias in inferences based on these datasets. As studies based on observational data are increasingly used to inform decisions with real-world impact, it is critical that we develop a robust set of techniques for analyzing and adjusting for these biases. In this paper we present a method for estimating the distribution of an outcome… ▽ More

    Submitted 25 January, 2019; originally announced January 2019.

  14. arXiv:1901.05406  [pdf

    cs.CY

    Artificial Intelligence for Social Good

    Authors: Gregory D. Hager, Ann Drobnis, Fei Fang, Rayid Ghani, Amy Greenwald, Terah Lyons, David C. Parkes, Jason Schultz, Suchi Saria, Stephen F. Smith, Milind Tambe

    Abstract: The Computing Community Consortium (CCC), along with the White House Office of Science and Technology Policy (OSTP), and the Association for the Advancement of Artificial Intelligence (AAAI), co-sponsored a public workshop on Artificial Intelligence for Social Good on June 7th, 2016 in Washington, DC. This was one of five workshops that OSTP co-sponsored and held around the country to spur public… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: A Computing Community Consortium (CCC) workshop report, 22 pages

    Report number: ccc2016report_1

  15. arXiv:1901.00403  [pdf, other

    stat.ML cs.LG stat.ME

    Can You Trust This Prediction? Auditing Pointwise Reliability After Learning

    Authors: Peter Schulam, Suchi Saria

    Abstract: To use machine learning in high stakes applications (e.g. medicine), we need tools for building confidence in the system and evaluating whether it is reliable. Methods to improve model reliability often require new learning algorithms (e.g. using Bayesian inference to obtain uncertainty estimates). An alternative is to audit a model after it is trained. In this paper, we describe resampling uncert… ▽ More

    Submitted 28 February, 2019; v1 submitted 2 January, 2019; originally announced January 2019.

    Comments: To appear in the proceedings of Artificial Intelligence and Statistics (AISTATS) 2019

  16. arXiv:1812.04597  [pdf, other

    stat.ML cs.AI cs.LG

    Preventing Failures Due to Dataset Shift: Learning Predictive Models That Transport

    Authors: Adarsh Subbaswamy, Peter Schulam, Suchi Saria

    Abstract: Classical supervised learning produces unreliable models when training and target distributions differ, with most existing solutions requiring samples from the target domain. We propose a proactive approach which learns a relationship in the training domain that will generalize to the target domain by incorporating prior knowledge of aspects of the data generating process that are expected to diff… ▽ More

    Submitted 28 February, 2019; v1 submitted 11 December, 2018; originally announced December 2018.

    Comments: In Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS), 2019. Previously presented at the NeurIPS 2018 Causal Learning Workshop

  17. arXiv:1810.03025  [pdf, other

    stat.ML cs.AI cs.LG eess.SY

    Discretizing Logged Interaction Data Biases Learning for Decision-Making

    Authors: Peter Schulam, Suchi Saria

    Abstract: Time series data that are not measured at regular intervals are commonly discretized as a preprocessing step. For example, data about customer arrival times might be simplified by summing the number of arrivals within hourly intervals, which produces a discrete-time time series that is easier to model. In this abstract, we show that discretization introduces a bias that affects models trained for… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: This is a standalone short paper describing a new type of bias that can arise when learning from time series data for sequential decision-making problems

  18. arXiv:1808.03253  [pdf, other

    stat.ML cs.LG

    Counterfactual Normalization: Proactively Addressing Dataset Shift and Improving Reliability Using Causal Mechanisms

    Authors: Adarsh Subbaswamy, Suchi Saria

    Abstract: Predictive models can fail to generalize from training to deployment environments because of dataset shift, posing a threat to model reliability and the safety of downstream decisions made in practice. Instead of using samples from the target distribution to reactively correct dataset shift, we use graphical knowledge of the causal mechanisms relating variables in a prediction problem to proactive… ▽ More

    Submitted 9 August, 2018; originally announced August 2018.

    Comments: Proceedings of the 34th Conference on Uncertainty in Artificial Intelligence (UAI), 2018. Revised from print version

  19. arXiv:1708.04757  [pdf, other

    stat.ML cs.AI cs.LG

    Scalable Joint Models for Reliable Uncertainty-Aware Event Prediction

    Authors: Hossein Soleimani, James Hensman, Suchi Saria

    Abstract: Missing data and noisy observations pose significant challenges for reliably predicting events from irregularly sampled multivariate time series (longitudinal) data. Imputation methods, which are typically used for completing the data prior to event prediction, lack a principled mechanism to account for the uncertainty due to missingness. Alternatively, state-of-the-art joint modeling techniques c… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: To appear in IEEE Transaction on Pattern Analysis and Machine Intelligence

  20. arXiv:1704.02038  [pdf, other

    stat.ML cs.AI cs.LG

    Treatment-Response Models for Counterfactual Reasoning with Continuous-time, Continuous-valued Interventions

    Authors: Hossein Soleimani, Adarsh Subbaswamy, Suchi Saria

    Abstract: Treatment effects can be estimated from observational data as the difference in potential outcomes. In this paper, we address the challenge of estimating the potential outcome when treatment-dose levels can vary continuously over time. Further, the outcome variable may not be measured at a regular frequency. Our proposed solution represents the treatment response curves using linear time-invariant… ▽ More

    Submitted 4 November, 2017; v1 submitted 6 April, 2017; originally announced April 2017.

    Comments: In Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence (UAI-2017), Sydney, Australia, August 2017. The first two authors contributed equally to this work

  21. arXiv:1703.10651  [pdf, other

    stat.ML cs.AI cs.LG

    Reliable Decision Support using Counterfactual Models

    Authors: Peter Schulam, Suchi Saria

    Abstract: Decision-makers are faced with the challenge of estimating what is likely to happen when they take an action. For instance, if I choose not to treat this patient, are they likely to die? Practitioners commonly use supervised learning algorithms to fit predictive models that help decision-makers reason about likely future outcomes, but we show that this approach is unreliable, and sometimes even da… ▽ More

    Submitted 1 February, 2018; v1 submitted 30 March, 2017; originally announced March 2017.

    Comments: Published in the proceedings of Neural Information Processing Systems (NIPS) 2017

  22. arXiv:1608.05182  [pdf, other

    cs.LG stat.ML

    A Bayesian Nonparametric Approach for Estimating Individualized Treatment-Response Curves

    Authors: Yanbo Xu, Yanxun Xu, Suchi Saria

    Abstract: We study the problem of estimating the continuous response over time to interventions using observational time series---a retrospective dataset where the policy by which the data are generated is unknown to the learner. We are motivated by applications where response varies by individuals and therefore, estimating responses at the individual-level is valuable for personalizing decision-making. We… ▽ More

    Submitted 10 December, 2016; v1 submitted 18 August, 2016; originally announced August 2016.

  23. arXiv:1604.05819  [pdf, other

    stat.ML cs.LG

    Trading-Off Cost of Deployment Versus Accuracy in Learning Predictive Models

    Authors: Daniel P. Robinson, Suchi Saria

    Abstract: Predictive models are finding an increasing number of applications in many industries. As a result, a practical means for trading-off the cost of deploying a model versus its effectiveness is needed. Our work is motivated by risk prediction problems in healthcare. Cost-structures in domains such as healthcare are quite complex, posing a significant challenge to existing approaches. We propose a no… ▽ More

    Submitted 20 April, 2016; originally announced April 2016.

    Comments: Authors contributed equally to this work. To appear in IJCAI 2016, Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

  24. arXiv:1601.00960  [pdf, other

    cs.CY

    High Frequency Remote Monitoring of Parkinson's Disease via Smartphone: Platform Overview and Medication Response Detection

    Authors: Andong Zhan, Max A. Little, Denzil A. Harris, Solomon O. Abiola, E. Ray Dorsey, Suchi Saria, Andreas Terzis

    Abstract: Objective: The aim of this study is to develop a smartphone-based high-frequency remote monitoring platform, assess its feasibility for remote monitoring of symptoms in Parkinson's disease, and demonstrate the value of data collected using the platform by detecting dopaminergic medication response. Methods: We have developed HopkinsPD, a novel smartphone-based monitoring platform, which measures s… ▽ More

    Submitted 5 January, 2016; originally announced January 2016.

  25. arXiv:1512.05990  [pdf, other

    cs.CV

    Deformable Distributed Multiple Detector Fusion for Multi-Person Tracking

    Authors: Andy J Ma, Pong C Yuen, Suchi Saria

    Abstract: This paper addresses fully automated multi-person tracking in complex environments with challenging occlusion and extensive pose variations. Our solution combines multiple detectors for a set of different regions of interest (e.g., full-body and head) for multi-person tracking. The use of multiple detectors leads to fewer miss detections as it is able to exploit the complementary strengths of the… ▽ More

    Submitted 18 December, 2015; originally announced December 2015.

  26. arXiv:1507.07295  [pdf, other

    cs.AI stat.AP

    Learning (Predictive) Risk Scores in the Presence of Censoring due to Interventions

    Authors: Kirill Dyagilev, Suchi Saria

    Abstract: A large and diverse set of measurements are regularly collected during a patient's hospital stay to monitor their health status. Tools for integrating these measurements into severity scores, that accurately track changes in illness severity, can improve clinicians ability to provide timely interventions. Existing approaches for creating such scores either 1) rely on experts to fully specify the s… ▽ More

    Submitted 26 July, 2015; originally announced July 2015.

    Journal ref: Machine Learning Journal, Special Issue on on Machine Learning for Health and Medicine, pp. 1-26, 2015

  27. arXiv:1206.5260  [pdf

    cs.AI

    Reasoning at the Right Time Granularity

    Authors: Suchi Saria, Uri Nodelman, Daphne Koller

    Abstract: Most real-world dynamic systems are composed of different components that often evolve at very different rates. In traditional temporal graphical models, such as dynamic Bayesian networks, time is modeled at a fixed granularity, generally selected based on the rate at which the fastest component evolves. Inference must then be performed at this fastest granularity, potentially at significant compu… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-326-334

  28. arXiv:1008.2028  [pdf, ps, other

    stat.ML cs.AI stat.ME

    Discovering shared and individual latent structure in multiple time series

    Authors: Suchi Saria, Daphne Koller, Anna Penn

    Abstract: This paper proposes a nonparametric Bayesian method for exploratory data analysis and feature construction in continuous time series. Our method focuses on understanding shared features in a set of time series that exhibit significant individual variability. Our method builds on the framework of latent Diricihlet allocation (LDA) and its extension to hierarchical Dirichlet processes, which allows… ▽ More

    Submitted 11 August, 2010; originally announced August 2010.

    Comments: Additional supplementary section in tex file