Skip to main content

Showing 1–30 of 30 results for author: Wiens, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18865  [pdf, other

    cs.LG stat.ML

    From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased Decisions

    Authors: Trenton Chang, Jenna Wiens

    Abstract: Selective labels occur when label observations are subject to a decision-making process; e.g., diagnoses that depend on the administration of laboratory tests. We study a clinically-inspired selective label problem called disparate censorship, where labeling biases vary across subgroups and unlabeled individuals are imputed as "negative" (i.e., no diagnostic test = no illness). Machine learning mo… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 39 pages, 33 figures. ICML 2024 conference paper

  2. arXiv:2310.17146  [pdf, other

    cs.LG cs.AI

    Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation

    Authors: Shengpu Tang, Jenna Wiens

    Abstract: In applying reinforcement learning (RL) to high-stakes domains, quantitative and qualitative evaluation using observational data can help practitioners understand the generalization performance of new policies. However, this type of off-policy evaluation (OPE) is inherently limited since offline data may not reflect the distribution shifts resulting from the application of new policies. On the oth… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 36 pages, 12 figures, 5 tables. NeurIPS 2023. Code available at https://github.com/MLD3/CounterfactualAnnot-SemiOPE

  3. arXiv:2308.05619  [pdf, other

    stat.ML cs.AI cs.LG

    Updating Clinical Risk Stratification Models Using Rank-Based Compatibility: Approaches for Evaluating and Optimizing Clinician-Model Team Performance

    Authors: Erkin Ötleş, Brian T. Denton, Jenna Wiens

    Abstract: As data shift or new data become available, updating clinical machine learning models may be necessary to maintain or improve performance over time. However, updating a model can introduce compatibility issues when the behavior of the updated model does not align with user expectations, resulting in poor user-model team performance. Existing compatibility measures depend on model decision threshol… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: Conference paper accepted at the 2023 Machine Learning for Healthcare Conference Includes supplemental: 32 pages, 17 figures

  4. arXiv:2307.07014  [pdf, other

    cs.LG cs.AI stat.ML

    Leveraging Factored Action Spaces for Off-Policy Evaluation

    Authors: Aaman Rebello, Shengpu Tang, Jenna Wiens, Sonali Parbhoo

    Abstract: Off-policy evaluation (OPE) aims to estimate the benefit of following a counterfactual sequence of actions, given data collected from executed sequences. However, existing OPE estimators often exhibit high bias and high variance in problems involving large, combinatorial action spaces. We investigate how to mitigate this issue using factored action spaces i.e. expressing each action as a combinati… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Main paper: 8 pages, 7 figures. Appendix: 30 pages, 17 figures. Accepted at ICML 2023 Workshop on Counterfactuals in Minds and Machines, Honolulu, Hawaii, USA. Camera ready version

    MSC Class: 62D20 (Primary) 62M05; 60J10; 62D05; 62P10 (Secondary) ACM Class: I.2.6; I.2.8; G.3; J.3

  5. arXiv:2307.04868  [pdf, other

    cs.LG

    Leveraging an Alignment Set in Tackling Instance-Dependent Label Noise

    Authors: Donna Tjandra, Jenna Wiens

    Abstract: Noisy training labels can hurt model performance. Most approaches that aim to address label noise assume label noise is independent from the input features. In practice, however, label noise is often feature or \textit{instance-dependent}, and therefore biased (i.e., some instances are more likely to be mislabeled than others). E.g., in clinical care, female patients are more likely to be under-di… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Journal ref: In Conference on Health, Inference, and Learning (pp. 477-497). PMLR (2023)

  6. arXiv:2305.01738  [pdf, other

    cs.LG cs.AI

    Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare

    Authors: Shengpu Tang, Maggie Makar, Michael W. Sjoding, Finale Doshi-Velez, Jenna Wiens

    Abstract: Many reinforcement learning (RL) applications have combinatorial action spaces, where each action is a composition of sub-actions. A standard RL approach ignores this inherent factorization structure, resulting in a potential failure to make meaningful inferences about rarely observed sub-action combinations; this is particularly problematic for offline settings, where data may be limited. In this… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 30 pages, 18 figures, 2 tables. NeurIPS 2022. Code available at https://github.com/MLD3/OfflineRL_FactoredActions

  7. arXiv:2304.08593  [pdf, other

    cs.LG

    Forecasting with Sparse but Informative Variables: A Case Study in Predicting Blood Glucose

    Authors: Harry Rubin-Falcone, Joyce Lee, Jenna Wiens

    Abstract: In time-series forecasting, future target values may be affected by both intrinsic and extrinsic effects. When forecasting blood glucose, for example, intrinsic effects can be inferred from the history of the target signal alone (\textit{i.e.} blood glucose), but accurately modeling the impact of extrinsic effects requires auxiliary signals, like the amount of carbohydrates ingested. Standard fore… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 10 pages, 9 figures, 5 tables, accepted to AAAI23

    Journal ref: AAAI 2023

  8. arXiv:2208.01127  [pdf, other

    cs.LG cs.CY

    Disparate Censorship & Undertesting: A Source of Label Bias in Clinical Machine Learning

    Authors: Trenton Chang, Michael W. Sjoding, Jenna Wiens

    Abstract: As machine learning (ML) models gain traction in clinical applications, understanding the impact of clinician and societal biases on ML models is increasingly important. While biases can arise in the labels used for model training, the many sources from which these biases arise are not yet well-studied. In this paper, we highlight disparate censorship (i.e., differences in testing rates across pat… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 48 pages, 18 figures. Machine Learning for Healthcare Conference (MLHC 2022)

  9. arXiv:2108.12530  [pdf

    cs.LG cs.AI cs.CV

    Combining chest X-rays and electronic health record (EHR) data using machine learning to diagnose acute respiratory failure

    Authors: Sarah Jabbour, David Fouhey, Ella Kazerooni, Jenna Wiens, Michael W Sjoding

    Abstract: Objective: When patients develop acute respiratory failure, accurately identifying the underlying etiology is essential for determining the best treatment. However, differentiating between common medical diagnoses can be challenging in clinical practice. Machine learning models could improve medical diagnosis by aiding in the diagnostic evaluation of these patients. Materials and Methods: Machine… ▽ More

    Submitted 20 April, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

  10. arXiv:2107.13964  [pdf, other

    cs.CY cs.LG

    Mind the Performance Gap: Examining Dataset Shift During Prospective Validation

    Authors: Erkin Ötleş, Jeeheh Oh, Benjamin Li, Michelle Bochinski, Hyeon Joo, Justin Ortwine, Erica Shenoy, Laraine Washer, Vincent B. Young, Krishna Rao, Jenna Wiens

    Abstract: Once integrated into clinical care, patient risk stratification models may perform worse compared to their retrospective performance. To date, it is widely accepted that performance will degrade over time due to changes in care processes and patient populations. However, the extent to which this occurs is poorly understood, in part because few researchers report prospective validation performance.… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: Accepted by the 2021 Machine Learning for Healthcare Conference

  11. arXiv:2107.11003  [pdf, other

    cs.LG

    Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings

    Authors: Shengpu Tang, Jenna Wiens

    Abstract: Reinforcement learning (RL) can be used to learn treatment policies and aid decision making in healthcare. However, given the need for generalization over complex state/action spaces, the incorporation of function approximators (e.g., deep neural networks) requires model selection to reduce overfitting and improve policy performance at deployment. Yet a standard validation pipeline for model selec… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: 33 pages, 9 figures. Machine Learning for Healthcare Conference (MLHC 2021)

  12. arXiv:2010.14592  [pdf, other

    cs.LG stat.ML

    Shapley Flow: A Graph-based Approach to Interpreting Model Predictions

    Authors: Jiaxuan Wang, Jenna Wiens, Scott Lundberg

    Abstract: Many existing approaches for estimating feature importance are problematic because they ignore or hide dependencies among features. A causal graph, which encodes the relationships among input variables, can aid in assigning feature importance. However, current approaches that assign credit to nodes in the causal graph fail to explain the entire graph. In light of these limitations, we propose Shap… ▽ More

    Submitted 26 February, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: camera ready version for AISTATS 2021

  13. arXiv:2009.10132  [pdf, other

    cs.CV cs.AI cs.LG

    Deep Learning Applied to Chest X-Rays: Exploiting and Preventing Shortcuts

    Authors: Sarah Jabbour, David Fouhey, Ella Kazerooni, Michael W. Sjoding, Jenna Wiens

    Abstract: While deep learning has shown promise in improving the automated diagnosis of disease based on chest X-rays, deep networks may exhibit undesirable behavior related to shortcuts. This paper studies the case of spurious class skew in which patients with a particular attribute are spuriously more likely to have the outcome of interest. For instance, clinical protocols might lead to a dataset in which… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: 32 pages, 9 figures, 12 tables, MLHC 2020

  14. arXiv:2009.09051  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Reinforcement Learning for Closed-Loop Blood Glucose Control

    Authors: Ian Fox, Joyce Lee, Rodica Pop-Busui, Jenna Wiens

    Abstract: People with type 1 diabetes (T1D) lack the ability to produce the insulin their bodies need. As a result, they must continually make decisions about how much insulin to self-administer to adequately control their blood glucose levels. Longitudinal data streams captured from wearables, like continuous glucose monitors, can help these individuals manage their health, but currently the majority of th… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: Accepted to MLHC 2020

  15. arXiv:2007.12678  [pdf, other

    cs.LG cs.AI stat.ML

    Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies

    Authors: Shengpu Tang, Aditya Modi, Michael W. Sjoding, Jenna Wiens

    Abstract: Standard reinforcement learning (RL) aims to find an optimal policy that identifies the best action for each state. However, in healthcare settings, many actions may be near-equivalent with respect to the reward (e.g., survival). We consider an alternative objective -- learning set-valued policies to capture near-equivalent actions that lead to similar cumulative rewards. We propose a model-free a… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: ICML 2020. Code available at https://github.com/shengpu1126/RL-Set-Valued-Policy

  16. arXiv:2006.16541  [pdf, other

    cs.LG stat.ML

    AdaSGD: Bridging the gap between SGD and Adam

    Authors: Jiaxuan Wang, Jenna Wiens

    Abstract: In the context of stochastic gradient descent(SGD) and adaptive moment estimation (Adam),researchers have recently proposed optimization techniques that transition from Adam to SGD with the goal of improving both convergence and generalization performance. However, precisely how each approach trades off early progress and generalization is not well understood; thus, it is unclear when or even if,… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

  17. arXiv:1908.02723  [pdf, other

    cs.LG cs.CV stat.ML

    Advocacy Learning: Learning through Competition and Class-Conditional Representations

    Authors: Ian Fox, Jenna Wiens

    Abstract: We introduce advocacy learning, a novel supervised training scheme for attention-based classification problems. Advocacy learning relies on a framework consisting of two connected networks: 1) $N$ Advocates (one for each class), each of which outputs an argument in the form of an attention map over the input, and 2) a Judge, which predicts the class label based on these arguments. Each Advocate pr… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: Accepted IJCAI 2019

  18. arXiv:1906.05657  [pdf

    cs.CY cs.LG stat.ML

    Automatically Evaluating Balance: A Machine Learning Approach

    Authors: Tian Bao, Brooke N. Klatt, Susan L. Whitney, Kathleen H. Sienko, Jenna Wiens

    Abstract: Compared to in-clinic balance training, in-home training is not as effective. This is, in part, due to the lack of feedback from physical therapists (PTs). Here, we analyze the feasibility of using trunk sway data and machine learning (ML) techniques to automatically evaluate balance, providing accurate assessments outside of the clinic. We recruited sixteen participants to perform standing balanc… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

    Comments: 8 pages, 4 figures, 5 tables

    Journal ref: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2019

  19. arXiv:1906.02898  [pdf, other

    cs.LG stat.ML

    Relaxed Parameter Sharing: Effectively Modeling Time-Varying Relationships in Clinical Time-Series

    Authors: Jeeheh Oh, Jiaxuan Wang, Shengpu Tang, Michael Sjoding, Jenna Wiens

    Abstract: Recurrent neural networks (RNNs) are commonly applied to clinical time-series data with the goal of learning patient risk stratification models. Their effectiveness is due, in part, to their use of parameter sharing over time (i.e., cells are repeated hence the name recurrent). We hypothesize, however, that this trait also contributes to the increased difficulty such models have with learning rela… ▽ More

    Submitted 2 January, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: Machine Learning for Healthcare 2019

  20. arXiv:1811.12520  [pdf, other

    cs.LG stat.AP stat.ML

    Leveraging Clinical Time-Series Data for Prediction: A Cautionary Tale

    Authors: Eli Sherman, Hitinder Gurm, Ulysses Balis, Scott Owens, Jenna Wiens

    Abstract: In healthcare, patient risk stratification models are often learned using time-series data extracted from electronic health records. When extracting data for a clinical prediction task, several formulations exist, depending on how one chooses the time of prediction and the prediction horizon. In this paper, we show how the formulation can greatly impact both model performance and clinical utility.… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: In Proceedings of American Medical Informatics Annual Symposium 2017 PMID: 29854227

    Journal ref: AMIA Annu Symp Proc. 2018 Apr 16;2017:1571-1580. eCollection 2017

  21. arXiv:1808.06725  [pdf, other

    cs.LG stat.ML

    Learning to Exploit Invariances in Clinical Time-Series Data using Sequence Transformer Networks

    Authors: Jeeheh Oh, Jiaxuan Wang, Jenna Wiens

    Abstract: Recently, researchers have started applying convolutional neural networks (CNNs) with one-dimensional convolutions to clinical tasks involving time-series data. This is due, in part, to their computational efficiency, relative to recurrent neural networks and their ability to efficiently exploit certain temporal invariances, (e.g., phase invariance). However, it is well-established that clinical d… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Journal ref: PMLR - Machine Learning for Healthcare 2018

  22. arXiv:1808.04362  [pdf, other

    cs.CV cs.LG stat.ML

    A Domain Guided CNN Architecture for Predicting Age from Structural Brain Images

    Authors: Pascal Sturmfels, Saige Rutherford, Mike Angstadt, Mark Peterson, Chandra Sripada, Jenna Wiens

    Abstract: Given the wide success of convolutional neural networks (CNNs) applied to natural images, researchers have begun to apply them to neuroimaging data. To date, however, exploration of novel CNN architectures tailored to neuroimaging data has been limited. Several recent works fail to leverage the 3D structure of the brain, instead treating the brain as a set of independent 2D slices. Approaches that… ▽ More

    Submitted 11 August, 2018; originally announced August 2018.

    Comments: Machine Learning for Healthcare 2018

  23. Deep Multi-Output Forecasting: Learning to Accurately Predict Blood Glucose Trajectories

    Authors: Ian Fox, Lynn Ang, Mamta Jaiswal, Rodica Pop-Busui, Jenna Wiens

    Abstract: In many forecasting applications, it is valuable to predict not only the value of a signal at a certain time point in the future, but also the values leading up to that point. This is especially true in clinical applications, where the future state of the patient can be less important than the patient's overall trajectory. This requires multi-step forecasting, a forecasting variant where one aims… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

    Comments: KDD 2018

    Journal ref: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

  24. arXiv:1803.02940  [pdf

    cs.LG

    The Advantage of Doubling: A Deep Reinforcement Learning Approach to Studying the Double Team in the NBA

    Authors: Jiaxuan Wang, Ian Fox, Jonathan Skaza, Nick Linck, Satinder Singh, Jenna Wiens

    Abstract: During the 2017 NBA playoffs, Celtics coach Brad Stevens was faced with a difficult decision when defending against the Cavaliers: "Do you double and risk giving up easy shots, or stay at home and do the best you can?" It's a tough call, but finding a good defensive strategy that effectively incorporates doubling can make all the difference in the NBA. In this paper, we analyze double teaming in t… ▽ More

    Submitted 7 March, 2018; originally announced March 2018.

    Comments: Accepted to MIT Sloan Sports Analytics 2018. First two authors contributed equally

  25. arXiv:1803.00744  [pdf, other

    cs.LG stat.ML

    Clinically Meaningful Comparisons Over Time: An Approach to Measuring Patient Similarity based on Subsequence Alignment

    Authors: Dev Goyal, Zeeshan Syed, Jenna Wiens

    Abstract: Longitudinal patient data has the potential to improve clinical risk stratification models for disease. However, chronic diseases that progress slowly over time are often heterogeneous in their clinical presentation. Patients may progress through disease stages at varying rates. This leads to pathophysiological misalignment over time, making it difficult to consistently compare patients in a clini… ▽ More

    Submitted 2 March, 2018; originally announced March 2018.

  26. arXiv:1712.00643  [pdf, other

    cs.SI physics.soc-ph

    Learning the Probability of Activation in the Presence of Latent Spreaders

    Authors: Maggie Makar, John Guttag, Jenna Wiens

    Abstract: When an infection spreads in a community, an individual's probability of becoming infected depends on both her susceptibility and exposure to the contagion through contact with others. While one often has knowledge regarding an individual's susceptibility, in many cases, whether or not an individual's contacts are contagious is unknown. We study the problem of predicting if an individual will adop… ▽ More

    Submitted 2 December, 2017; originally announced December 2017.

    Comments: To appear in AAA1-18

  27. Learning Credible Models

    Authors: Jiaxuan Wang, Jeeheh Oh, Haozhu Wang, Jenna Wiens

    Abstract: In many settings, it is important that a model be capable of providing reasons for its predictions (i.e., the model must be interpretable). However, the model's reasoning may not conform with well-established knowledge. In such cases, while interpretable, the model lacks \textit{credibility}. In this work, we formally define credibility in the linear setting and focus on techniques for learning mo… ▽ More

    Submitted 7 June, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

    Journal ref: KDD '18 Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining 2018

  28. Contextual Motifs: Increasing the Utility of Motifs using Contextual Data

    Authors: Ian Fox, Lynn Ang, Mamta Jaiswal, Rodica Pop-Busui, Jenna Wiens

    Abstract: Motifs are a powerful tool for analyzing physiological waveform data. Standard motif methods, however, ignore important contextual information (e.g., what the patient was doing at the time the data were collected). We hypothesize that these additional contextual data could increase the utility of motifs. Thus, we propose an extension to motifs, contextual motifs, that incorporates context. Recogni… ▽ More

    Submitted 31 July, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: 10 pages, 7 figures, accepted for oral presentation at KDD '17

    Journal ref: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2017

  29. arXiv:1407.7514  [pdf, other

    physics.flu-dyn cs.CE

    Simulating flexible fiber suspensions using a scalable immersed boundary algorithm

    Authors: Jeffrey K. Wiens, John M. Stockie

    Abstract: We present an approach for numerically simulating the dynamics of flexible fibers in a three-dimensional shear flow using a scalable immersed boundary (IB) algorithm based on Guermond and Minev's pseudo-compressible fluid solver. The fibers are treated as one-dimensional Kirchhoff rods that resist stretching, bending, and twisting, within the generalized IB framework. We perform a careful numerica… ▽ More

    Submitted 28 July, 2014; originally announced July 2014.

    MSC Class: 74F10; 76D05; 76M12; 65Y05

    Journal ref: Computer Methods in Applied Mechanics and Engineering, 290:1-18, 2015

  30. arXiv:1305.3976  [pdf, other

    cs.DC math.NA physics.flu-dyn

    An efficient parallel immersed boundary algorithm using a pseudo-compressible fluid solver

    Authors: Jeffrey K. Wiens, John M. Stockie

    Abstract: We propose an efficient algorithm for the immersed boundary method on distributed-memory architectures, with the computational complexity of a completely explicit method and excellent parallel scaling. The algorithm utilizes the pseudo-compressibility method recently proposed by Guermond and Minev [Comptes Rendus Mathematique, 348:581-585, 2010] that uses a directional splitting strategy to discre… ▽ More

    Submitted 19 May, 2014; v1 submitted 17 May, 2013; originally announced May 2013.

    MSC Class: 74F10; 76M12; 76D27; 65Y05

    Journal ref: Journal of Computational Physics, 281:917-941, 2015