Skip to main content

Showing 1–10 of 10 results for author: Gradu, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.17187  [pdf, other

    stat.ME cs.DS

    Clip-OGD: An Experimental Design for Adaptive Neyman Allocation in Sequential Experiments

    Authors: Jessica Dai, Paula Gradu, Christopher Harshaw

    Abstract: From clinical development of cancer therapies to investigations into partisan bias, adaptive sequential designs have become increasingly popular method for causal inference, as they offer the possibility of improved precision over their non-adaptive counterparts. However, even in simple settings (e.g. two treatments) the extent to which adaptive designs can improve precision is not sufficiently we… ▽ More

    Submitted 13 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  2. arXiv:2211.12638  [pdf, ps, other

    cs.LG math.OC stat.ML

    Projection-free Adaptive Regret with Membership Oracles

    Authors: Zhou Lu, Nataly Brukhim, Paula Gradu, Elad Hazan

    Abstract: In the framework of online convex optimization, most iterative algorithms require the computation of projections onto convex sets, which can be computationally expensive. To tackle this problem HK12 proposed the study of projection-free methods that replace projections with less expensive computations. The most common approach is based on the Frank-Wolfe method, that uses linear optimization compu… ▽ More

    Submitted 14 December, 2022; v1 submitted 22 November, 2022; originally announced November 2022.

  3. arXiv:2208.05949  [pdf, other

    stat.ME cs.LG stat.ML

    Valid Inference after Causal Discovery

    Authors: Paula Gradu, Tijana Zrnic, Yixin Wang, Michael I. Jordan

    Abstract: Causal discovery and causal effect estimation are two fundamental tasks in causal inference. While many methods have been developed for each task individually, statistical challenges arise when applying these methods jointly: estimating causal effects after running causal discovery algorithms on the same data leads to "double dip**," invalidating the coverage guarantees of classical confidence i… ▽ More

    Submitted 20 March, 2023; v1 submitted 11 August, 2022; originally announced August 2022.

  4. arXiv:2206.10524  [pdf, other

    cs.LG eess.SY

    Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control

    Authors: Katie Kang, Paula Gradu, Jason Choi, Michael Janner, Claire Tomlin, Sergey Levine

    Abstract: Learned models and policies can generalize effectively when evaluated within the distribution of the training data, but can produce unpredictable and erroneous outputs on out-of-distribution inputs. In order to avoid distribution shift when deploying learning-based control algorithms, we seek a mechanism to constrain the agent to states and actions that resemble those that it was trained on. In co… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  5. arXiv:2202.07890  [pdf, other

    cs.LG math.OC stat.ML

    Online Control of Unknown Time-Varying Dynamical Systems

    Authors: Edgar Minasyan, Paula Gradu, Max Simchowitz, Elad Hazan

    Abstract: We study online control of time-varying linear systems with unknown dynamics in the nonstochastic control model. At a high level, we demonstrate that this setting is \emph{qualitatively harder} than that of either unknown time-invariant or known time-varying dynamics, and complement our negative results with algorithmic upper bounds in regimes where sublinear regret is possible. More specifically,… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

  6. arXiv:2111.10434  [pdf, other

    cs.LG

    Machine Learning for Mechanical Ventilation Control (Extended Abstract)

    Authors: Daniel Suo, Naman Agarwal, Wenhan Xia, Xinyi Chen, Udaya Ghai, Alexander Yu, Paula Gradu, Karan Singh, Cyril Zhang, Edgar Minasyan, Julienne LaChance, Tom Zajdel, Manuel Schottdorf, Daniel Cohen, Elad Hazan

    Abstract: Mechanical ventilation is one of the most widely used therapies in the ICU. However, despite broad application from anaesthesia to COVID-related life support, many injurious challenges remain. We frame these as a control problem: ventilators must let air in and out of the patient's lungs according to a prescribed trajectory of airway pressure. Industry-standard controllers, based on the PID method… ▽ More

    Submitted 23 December, 2021; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2021 - Extended Abstract. arXiv admin note: substantial text overlap with arXiv:2102.06779

  7. arXiv:2102.09968  [pdf, other

    cs.RO cs.LG

    Deluca -- A Differentiable Control Library: Environments, Methods, and Benchmarking

    Authors: Paula Gradu, John Hallman, Daniel Suo, Alex Yu, Naman Agarwal, Udaya Ghai, Karan Singh, Cyril Zhang, Anirudha Majumdar, Elad Hazan

    Abstract: We present an open-source library of natively differentiable physics and robotics environments, accompanied by gradient-based control methods and a benchmark-ing suite. The introduced environments allow auto-differentiation through the simulation dynamics, and thereby permit fast training of controllers. The library features several popular environments, including classical control settings from O… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

  8. arXiv:2102.06779  [pdf, other

    cs.LG

    Machine Learning for Mechanical Ventilation Control

    Authors: Daniel Suo, Naman Agarwal, Wenhan Xia, Xinyi Chen, Udaya Ghai, Alexander Yu, Paula Gradu, Karan Singh, Cyril Zhang, Edgar Minasyan, Julienne LaChance, Tom Zajdel, Manuel Schottdorf, Daniel Cohen, Elad Hazan

    Abstract: We consider the problem of controlling an invasive mechanical ventilator for pressure-controlled ventilation: a controller must let air in and out of a sedated patient's lungs according to a trajectory of airway pressures specified by a clinician. Hand-tuned PID controllers and similar variants have comprised the industry standard for decades, yet can behave poorly by over- or under-shooting their… ▽ More

    Submitted 18 January, 2022; v1 submitted 12 February, 2021; originally announced February 2021.

  9. arXiv:2008.05523  [pdf, other

    cs.LG math.OC stat.ML

    Non-Stochastic Control with Bandit Feedback

    Authors: Paula Gradu, John Hallman, Elad Hazan

    Abstract: We study the problem of controlling a linear dynamical system with adversarial perturbations where the only feedback available to the controller is the scalar loss, and the loss function itself is unknown. For this problem, with either a known or unknown system, we give an efficient sublinear regret algorithm. The main algorithmic difficulty is the dependence of the loss on past controls. To overc… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

  10. arXiv:2007.04393  [pdf, other

    cs.LG math.OC stat.ML

    Adaptive Regret for Control of Time-Varying Dynamics

    Authors: Paula Gradu, Elad Hazan, Edgar Minasyan

    Abstract: We consider the problem of online control of systems with time-varying linear dynamics. This is a general formulation that is motivated by the use of local linearization in control of nonlinear dynamical systems. To state meaningful guarantees over changing environments, we introduce the metric of {\it adaptive regret} to the field of control. This metric, originally studied in online learning, me… ▽ More

    Submitted 11 February, 2022; v1 submitted 8 July, 2020; originally announced July 2020.