Skip to main content

Showing 1–28 of 28 results for author: Shalit, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00806  [pdf, other

    cs.LG

    Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators

    Authors: Ori Linial, Guy Tennenholtz, Uri Shalit

    Abstract: In many reinforcement learning (RL) applications one cannot easily let the agent act in the world; this is true for autonomous vehicles, healthcare applications, and even some recommender systems, to name a few examples. Offline RL provides a way to train agents without real-world exploration, but is often faced with biases due to data distribution shifts, limited coverage, and incomplete represen… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2403.18668  [pdf

    cs.LG cs.AI cs.HC stat.ML

    Aiming for Relevance

    Authors: Bar Eini Porat, Danny Eytan, Uri Shalit

    Abstract: Vital signs are crucial in intensive care units (ICUs). They are used to track the patient's state and to identify clinically significant changes. Predicting vital sign trajectories is valuable for early detection of adverse events. However, conventional machine learning metrics like RMSE often fail to capture the true clinical relevance of such predictions. We introduce novel vital sign predictio… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 10 pages, 9 figures, AMIA Informatics 2024

  3. arXiv:2304.10577  [pdf, other

    cs.LG stat.ML

    B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under Hidden Confounding

    Authors: Miruna Oprescu, Jacob Dorn, Marah Ghoummaid, Andrew Jesson, Nathan Kallus, Uri Shalit

    Abstract: Estimating heterogeneous treatment effects from observational data is a crucial task across many fields, hel** policy and decision-makers take better actions. There has been recent progress on robust and efficient methods for estimating the conditional average treatment effect (CATE) function, but these methods often do not take into account the risk of hidden confounding, which could arbitraril… ▽ More

    Submitted 13 June, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 20 pages, 4 figures, ICML 2023

    Journal ref: PMLR 202 (2023) 26599-26618

  4. arXiv:2211.15724  [pdf, other

    cs.LG stat.ML

    Malign Overfitting: Interpolation Can Provably Preclude Invariance

    Authors: Yoav Wald, Gal Yona, Uri Shalit, Yair Carmon

    Abstract: Learned classifiers should often possess certain invariance properties meant to encourage fairness, robustness, or out-of-distribution generalization. However, multiple recent works empirically demonstrate that common invariance-inducing regularizers are ineffective in the over-parameterized regime, in which classifiers perfectly fit (i.e. interpolate) the training data. This suggests that the phe… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

  5. arXiv:2205.15376  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning with a Terminator

    Authors: Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal

    Abstract: We present the problem of reinforcement learning with exogenous termination. We define the Termination Markov Decision Process (TerMDP), an extension of the MDP framework, in which episodes may be interrupted by an external non-Markovian observer. This formulation accounts for numerous real-world situations, such as a human interrupting an autonomous driving agent for reasons of discomfort. We lea… ▽ More

    Submitted 5 October, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022

  6. arXiv:2204.10022  [pdf, other

    cs.LG stat.ML

    Scalable Sensitivity and Uncertainty Analysis for Causal-Effect Estimates of Continuous-Valued Interventions

    Authors: Andrew Jesson, Alyson Douglas, Peter Manshausen, Maëlys Solal, Nicolai Meinshausen, Philip Stier, Yarin Gal, Uri Shalit

    Abstract: Estimating the effects of continuous-valued interventions from observational data is a critically important task for climate science, healthcare, and economics. Recent work focuses on designing neural network architectures and regularization functions to allow for scalable estimation of average and individual-level dose-response curves from high-dimensional, large-sample data. Such methodologies a… ▽ More

    Submitted 12 October, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: 33 pages

  7. arXiv:2111.02275  [pdf, other

    cs.LG stat.ML

    Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

    Authors: Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort, Andreas Kirsch, Uri Shalit, Yarin Gal

    Abstract: Estimating personalized treatment effects from high-dimensional observational data is essential in situations where experimental designs are infeasible, unethical, or expensive. Existing approaches rely on fitting deep models on outcomes observed for treated and control populations. However, when measuring individual outcomes is costly, as is the case of a tumor biopsy, a sample-efficient strategy… ▽ More

    Submitted 1 February, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 24 pages, 8 Figures, 5 tables, NeurIPS 2021

  8. arXiv:2110.06539  [pdf, other

    cs.LG cs.AI cs.RO

    On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

    Authors: Guy Tennenholtz, Assaf Hallak, Gal Dalal, Shie Mannor, Gal Chechik, Uri Shalit

    Abstract: We consider the problem of using expert data with unobserved confounders for imitation and reinforcement learning. We begin by defining the problem of learning from confounded expert data in a contextual MDP setup. We analyze the limitations of learning from such data with and without external reward, and propose an adjustment of standard imitation learning algorithms to fit this setup. We then di… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  9. arXiv:2103.04850  [pdf, other

    cs.LG stat.ML

    Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding

    Authors: Andrew Jesson, Sören Mindermann, Yarin Gal, Uri Shalit

    Abstract: We study the problem of learning conditional average treatment effects (CATE) from high-dimensional, observational data with unobserved confounders. Unobserved confounders introduce ignorance -- a level of unidentifiability -- about an individual's response to treatment by inducing bias in CATE estimates. We present a new parametric interval estimator suited for high-dimensional data, that estimat… ▽ More

    Submitted 1 February, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: 19 pages, 5 figures, ICML 2021

    Journal ref: PMLR 139 (2021) 4829-4838

  10. arXiv:2102.10395  [pdf, other

    cs.LG

    On Calibration and Out-of-domain Generalization

    Authors: Yoav Wald, Amir Feder, Daniel Greenfeld, Uri Shalit

    Abstract: Out-of-domain (OOD) generalization is a significant challenge for machine learning models. Many techniques have been proposed to overcome this challenge, often focused on learning models with certain invariance properties. In this work, we draw a link between OOD performance and model calibration, arguing that calibration across multiple domains can be viewed as a special case of an invariant repr… ▽ More

    Submitted 11 January, 2022; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: 24 pages, 6 figures. Published at NeurIPS 2021. Change log for each version: v2 - major revision, main additions are a trainable calibration loss (CLOvE) and experiments with fine-tuning. v3 - minor revision, main changes are added background material and technical details to the supplementary, and a fix to lemma 1. v4 - corrected caption of Table 3 and standard deviations in Tables 2 and 3

  11. arXiv:2102.08208  [pdf, other

    stat.ML cs.LG

    Conditional Distributional Treatment Effect with Kernel Conditional Mean Embeddings and U-Statistic Regression

    Authors: Junhyung Park, Uri Shalit, Bernhard Schölkopf, Krikamol Muandet

    Abstract: We propose to analyse the conditional distributional treatment effect (CoDiTE), which, in contrast to the more common conditional average treatment effect (CATE), is designed to encode a treatment's distributional aspects beyond the mean. We first introduce a formal definition of the CoDiTE associated with a distance function between probability measures. Then we discuss the CoDiTE associated with… ▽ More

    Submitted 10 June, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  12. arXiv:2008.10936  [pdf, other

    stat.ML cs.LG

    Using Deep Networks for Scientific Discovery in Physiological Signals

    Authors: Tom Beer, Bar Eini-Porat, Sebastian Goodfellow, Danny Eytan, Uri Shalit

    Abstract: Deep neural networks (DNN) have shown remarkable success in the classification of physiological signals. In this study we propose a method for examining to what extent does a DNN's performance rely on rediscovering existing features of the signals, as opposed to discovering genuinely new features. Moreover, we offer a novel method of "removing" a hand-engineered feature from the network's hypothes… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: 2020 Machine Learning for Healthcare Conference

  13. arXiv:2007.00163  [pdf, other

    cs.LG stat.ML

    Identifying Causal-Effect Inference Failure with Uncertainty-Aware Models

    Authors: Andrew Jesson, Sören Mindermann, Uri Shalit, Yarin Gal

    Abstract: Recommending the best course of action for an individual is a major application of individual-level causal effect estimation. This application is often needed in safety-critical domains such as healthcare, where estimating and communicating uncertainty to decision-makers is crucial. We introduce a practical approach for integrating uncertainty estimation into a class of state-of-the-art neural net… ▽ More

    Submitted 22 October, 2020; v1 submitted 30 June, 2020; originally announced July 2020.

  14. arXiv:2006.14610  [pdf, other

    cs.CV

    A causal view of compositional zero-shot recognition

    Authors: Yuval Atzmon, Felix Kreuk, Uri Shalit, Gal Chechik

    Abstract: People easily recognize new visual categories that are new combinations of known components. This compositional generalization capacity is critical for learning in real-world domains like vision and language because the long tail of new combinations dominates the distribution. Unfortunately, learning systems struggle with compositional generalization because they often build on features that are c… ▽ More

    Submitted 1 November, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: (1) Accepted to NeurIPS 2020 (Spotlight) (2) Project page is at https://github.com/nv-research-israel/causal_comp (3) A video of our spotlight talk is at https://www.youtube.com/watch?v=IUAmwBylvyc

  15. arXiv:2006.06731  [pdf, other

    cs.LG cs.AI stat.ML

    Bandits with Partially Observable Confounded Data

    Authors: Guy Tennenholtz, Uri Shalit, Shie Mannor, Yonathan Efroni

    Abstract: We study linear contextual bandits with access to a large, confounded, offline dataset that was sampled from some fixed policy. We show that this problem is closely related to a variant of the bandit problem with side information. We construct a linear bandit algorithm that takes advantage of the projected information, and prove regret bounds. Our results demonstrate the ability to take advantage… ▽ More

    Submitted 10 August, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Published as a conference paper at UAI 2021

  16. arXiv:2005.13407  [pdf, other

    cs.CL cs.AI cs.LG

    CausaLM: Causal Model Explanation Through Counterfactual Language Models

    Authors: Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart

    Abstract: Understanding predictions made by deep neural networks is notoriously difficult, but also crucial to their dissemination. As all machine learning based methods, they are as good as their training data, and can also capture unwanted biases. While there are tools that can help understand whether such biases exist, they do not distinguish between correlation and causation, and might be ill-suited for… ▽ More

    Submitted 12 November, 2022; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: Our code and data are available at: https://amirfeder.github.io/CausaLM/ Accepted for publication in Computational Linguistics journal

  17. Generative ODE Modeling with Known Unknowns

    Authors: Ori Linial, Neta Ravid, Danny Eytan, Uri Shalit

    Abstract: In several crucial applications, domain knowledge is encoded by a system of ordinary differential equations (ODE), often stemming from underlying physical and biological processes. A motivating example is intensive care unit patients: the dynamics of vital physiological functions, such as the cardiovascular system with its associated variables (heart rate, cardiac contractility and output and vasc… ▽ More

    Submitted 30 March, 2021; v1 submitted 24 March, 2020; originally announced March 2020.

  18. arXiv:2001.07426  [pdf, other

    cs.LG stat.ML

    Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects

    Authors: Fredrik D. Johansson, Uri Shalit, Nathan Kallus, David Sontag

    Abstract: Practitioners in diverse fields such as healthcare, economics and education are eager to apply machine learning to improve decision making. The cost and impracticality of performing experiments and a recent monumental increase in electronic record kee** has brought attention to the problem of evaluating decisions based on non-experimental observational data. This is the setting of this work. In… ▽ More

    Submitted 31 July, 2023; v1 submitted 21 January, 2020; originally announced January 2020.

  19. arXiv:1910.00270  [pdf, other

    cs.LG stat.ML

    Robust Learning with the Hilbert-Schmidt Independence Criterion

    Authors: Daniel Greenfeld, Uri Shalit

    Abstract: We investigate the use of a non-parametric independence measure, the Hilbert-Schmidt Independence Criterion (HSIC), as a loss-function for learning robust regression and classification models. This loss-function encourages learning models where the distribution of the residuals between the label and the model prediction is statistically independent of the distribution of the instances themselves.… ▽ More

    Submitted 11 July, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: Proceedings of the 37th International Conference on Machine Learning (ICML 2020)

  20. arXiv:1909.03739  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Off-Policy Evaluation in Partially Observable Environments

    Authors: Guy Tennenholtz, Shie Mannor, Uri Shalit

    Abstract: This work studies the problem of batch off-policy evaluation for Reinforcement Learning in partially observable environments. Off-policy evaluation under partial observability is inherently prone to bias, with risk of arbitrarily large errors. We define the problem of off-policy evaluation for Partially Observable Markov Decision Processes (POMDPs) and establish what we believe is the first off-po… ▽ More

    Submitted 24 November, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted to AAAI-2020

  21. arXiv:1907.07165  [pdf, other

    cs.LG cs.CV stat.ML

    Explaining Classifiers with Causal Concept Effect (CaCE)

    Authors: Yash Goyal, Amir Feder, Uri Shalit, Been Kim

    Abstract: How can we understand classification decisions made by deep neural networks? Many existing explainability methods rely solely on correlations and fail to account for confounding, which may result in potentially misleading explanations. To overcome this problem, we define the Causal Concept Effect (CaCE) as the causal effect of (the presence or absence of) a human-interpretable concept on a deep ne… ▽ More

    Submitted 28 February, 2020; v1 submitted 16 July, 2019; originally announced July 2019.

  22. arXiv:1810.11646  [pdf, other

    stat.ML cs.LG

    Removing Hidden Confounding by Experimental Grounding

    Authors: Nathan Kallus, Aahlad Manas Puli, Uri Shalit

    Abstract: Observational data is increasingly used as a means for making individual-level causal predictions and intervention recommendations. The foremost challenge of causal inference from observational data is hidden confounding, whose presence cannot be tested in data and can invalidate any causal conclusion. Experimental data does not suffer from confounding but is usually limited in both scope and scal… ▽ More

    Submitted 27 October, 2018; originally announced October 2018.

  23. arXiv:1705.08821  [pdf, other

    stat.ML cs.LG

    Causal Effect Inference with Deep Latent-Variable Models

    Authors: Christos Louizos, Uri Shalit, Joris Mooij, David Sontag, Richard Zemel, Max Welling

    Abstract: Learning individual-level causal effects from observational data, such as inferring the most effective medication for a specific patient, is a problem of growing importance for policy makers. The most important aspect of inferring causal effects from observational data is the handling of confounders, factors that affect both an intervention and its outcome. A carefully designed observational study… ▽ More

    Submitted 6 November, 2017; v1 submitted 24 May, 2017; originally announced May 2017.

    Comments: Published as a conference paper at NIPS 2017

  24. arXiv:1609.09869  [pdf, other

    stat.ML cs.AI cs.LG

    Structured Inference Networks for Nonlinear State Space Models

    Authors: Rahul G. Krishnan, Uri Shalit, David Sontag

    Abstract: Gaussian state space models have been used for decades as generative models of sequential data. They admit an intuitive probabilistic interpretation, have a simple functional form, and enjoy widespread adoption. We introduce a unified algorithm to efficiently learn a broad class of linear and non-linear state space models, including variants where the emission and transition distributions are mode… ▽ More

    Submitted 5 December, 2016; v1 submitted 30 September, 2016; originally announced September 2016.

    Comments: To appear in the Thirty-First AAAI Conference on Artificial Intelligence, February 2017, 13 pages, 11 figures with supplement, changed to AAAI formatting style, added references

  25. arXiv:1606.03976  [pdf, other

    stat.ML cs.AI cs.LG

    Estimating individual treatment effect: generalization bounds and algorithms

    Authors: Uri Shalit, Fredrik D. Johansson, David Sontag

    Abstract: There is intense interest in applying machine learning to problems of causal inference in fields such as healthcare, economics and education. In particular, individual-level causal inference has important applications such as precision medicine. We give a new theoretical analysis and family of algorithms for predicting individual treatment effect (ITE) from observational data, under the assumption… ▽ More

    Submitted 16 May, 2017; v1 submitted 13 June, 2016; originally announced June 2016.

    Comments: Added name "TARNet" to refer to version with alpha = 0. Removed supp

  26. arXiv:1605.03661  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Representations for Counterfactual Inference

    Authors: Fredrik D. Johansson, Uri Shalit, David Sontag

    Abstract: Observational studies are rising in importance due to the widespread accumulation of data in fields such as healthcare, education, employment and ecology. We consider the task of answering counterfactual questions such as, "Would this patient have lower blood sugar had she received a different medication?". We propose a new algorithmic framework for counterfactual inference which brings together i… ▽ More

    Submitted 6 June, 2018; v1 submitted 11 May, 2016; originally announced May 2016.

    Comments: Appeared in ICML 2016

  27. arXiv:1511.05121  [pdf, other

    stat.ML cs.LG

    Deep Kalman Filters

    Authors: Rahul G. Krishnan, Uri Shalit, David Sontag

    Abstract: Kalman Filters are one of the most influential models of time-varying phenomena. They admit an intuitive probabilistic interpretation, have a simple functional form, and enjoy widespread adoption in a variety of disciplines. Motivated by recent variational methods for learning deep generative models, we introduce a unified algorithm to efficiently learn a broad spectrum of Kalman filters. Of parti… ▽ More

    Submitted 25 November, 2015; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: 17 pages, 14 figures: Fixed typo in Fig. 1(b) and added reference

  28. arXiv:1312.0624  [pdf, ps, other

    cs.LG stat.ML

    Efficient coordinate-descent for orthogonal matrices through Givens rotations

    Authors: Uri Shalit, Gal Chechik

    Abstract: Optimizing over the set of orthogonal matrices is a central component in problems like sparse-PCA or tensor decomposition. Unfortunately, such optimization is hard since simple operations on orthogonal matrices easily break orthogonality, and correcting orthogonality usually costs a large amount of computation. Here we propose a framework for optimizing orthogonal matrices, that is the parallel of… ▽ More

    Submitted 13 December, 2013; v1 submitted 2 December, 2013; originally announced December 2013.

    Comments: A shorter version of this paper will appear in the proceedings of the 31st International Conference for Machine Learning (ICML 2014)