Skip to main content

Showing 1–21 of 21 results for author: Bennett, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.00099  [pdf, other

    cs.AI stat.ML

    Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes

    Authors: Andrew Bennett, Nathan Kallus, Miruna Oprescu, Wen Sun, Kaiwen Wang

    Abstract: We study evaluating a policy under best- and worst-case perturbations to a Markov decision process (MDP), given transition observations from the original MDP, whether under the same or different policy. This is an important problem when there is the possibility of a shift between historical and future environments, due to e.g. unmeasured confounding, distributional shift, or an adversarial environ… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: 40 pages, 1 figure

  2. arXiv:2311.03564  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Low-Rank MDPs with Continuous Action Spaces

    Authors: Andrew Bennett, Nathan Kallus, Miruna Oprescu

    Abstract: Low-Rank Markov Decision Processes (MDPs) have recently emerged as a promising framework within the domain of reinforcement learning (RL), as they allow for provably approximately correct (PAC) learning guarantees while also incorporating ML algorithms for representation learning. However, current methods for low-rank MDPs are limited in that they only consider finite action spaces, and give vacuo… ▽ More

    Submitted 1 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 25 pages, AISTATS 2024

    Journal ref: PMLR, Volume 238, 2024

  3. arXiv:2307.13793  [pdf, ps, other

    stat.ME cs.LG econ.EM math.ST stat.ML

    Source Condition Double Robust Inference on Functionals of Inverse Problems

    Authors: Andrew Bennett, Nathan Kallus, Xiaojie Mao, Whitney Newey, Vasilis Syrgkanis, Masatoshi Uehara

    Abstract: We consider estimation of parameters defined as linear functionals of solutions to linear inverse problems. Any such parameter admits a doubly robust representation that depends on the solution to a dual linear inverse problem, where the dual solution can be thought as a generalization of the inverse propensity function. We provide the first source condition double robust inference method that ens… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  4. RED CoMETS: An ensemble classifier for symbolically represented multivariate time series

    Authors: Luca A. Bennett, Zahraa S. Abdallah

    Abstract: Multivariate time series classification is a rapidly growing research field with practical applications in finance, healthcare, engineering, and more. The complexity of classifying multivariate time series data arises from its high dimensionality, temporal dependencies, and varying lengths. This paper introduces a novel ensemble classifier called RED CoMETS (Random Enhanced Co-eye for Multivariate… ▽ More

    Submitted 16 September, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted by AALTD 2023; fixed typos and minor error in Table 2

    Journal ref: In proceedings of the 8th Workshop on Advanced Analytics and Learning on Temporal Data (AALTD 2023), pages 76-91, 2023

  5. arXiv:2302.05404  [pdf, ps, other

    stat.ML cs.LG econ.EM math.ST stat.ME

    Minimax Instrumental Variable Regression and $L_2$ Convergence Guarantees without Identification or Closedness

    Authors: Andrew Bennett, Nathan Kallus, Xiaojie Mao, Whitney Newey, Vasilis Syrgkanis, Masatoshi Uehara

    Abstract: In this paper, we study nonparametric estimation of instrumental variable (IV) regressions. Recently, many flexible machine learning methods have been developed for instrumental variable estimation. However, these methods have at least one of the following limitations: (1) restricting the IV regression to be uniquely identified; (2) only obtaining estimation error rates in terms of pseudometrics (… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Under review

  6. arXiv:2211.05698  [pdf, other

    stat.ML cs.LG

    Probabilistic thermal stability prediction through sparsity promoting transformer representation

    Authors: Yevgen Zainchkovskyy, Jesper Ferkinghoff-Borg, Anja Bennett, Thomas Egebjerg, Nikolai Lorenzen, Per Jr. Greisen, Søren Hauberg, Carsten Stahlhut

    Abstract: Pre-trained protein language models have demonstrated significant applicability in different protein engineering task. A general usage of these pre-trained transformer models latent representation is to use a mean pool across residue positions to reduce the feature dimensions to further downstream tasks such as predicting bio-physics properties or other functional behaviours. In this paper we prov… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  7. arXiv:2210.14492  [pdf, other

    cs.LG cs.AI stat.ML

    Provable Safe Reinforcement Learning with Binary Feedback

    Authors: Andrew Bennett, Dipendra Misra, Nathan Kallus

    Abstract: Safety is a crucial necessity in many applications of reinforcement learning (RL), whether robotic, automotive, or medical. Many existing approaches to safe RL rely on receiving numeric safety feedback, but in many cases this feedback can only take binary values; that is, whether an action in a given state is safe or unsafe. This is particularly true when feedback comes from human experts. We ther… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  8. arXiv:2208.08291  [pdf, ps, other

    stat.ME econ.EM math.ST stat.ML

    Inference on Strongly Identified Functionals of Weakly Identified Functions

    Authors: Andrew Bennett, Nathan Kallus, Xiaojie Mao, Whitney Newey, Vasilis Syrgkanis, Masatoshi Uehara

    Abstract: In a variety of applications, including nonparametric instrumental variable (NPIV) analysis, proximal causal inference under unmeasured confounding, and missing-not-at-random data with shadow variables, we are interested in inference on a continuous linear functional (e.g., average causal effects) of nuisance function (e.g., NPIV regression) defined by conditional moment restrictions. These nuisan… ▽ More

    Submitted 30 June, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: This supersedes the previous version titled "Debiased Inference on Identified Linear Functionals of Underidentified Nuisances via Penalized Minimax Estimation"

  9. arXiv:2207.13081  [pdf, other

    cs.LG stat.ML

    Future-Dependent Value-Based Off-Policy Evaluation in POMDPs

    Authors: Masatoshi Uehara, Haruka Kiyohara, Andrew Bennett, Victor Chernozhukov, Nan Jiang, Nathan Kallus, Chengchun Shi, Wen Sun

    Abstract: We study off-policy evaluation (OPE) for partially observable MDPs (POMDPs) with general function approximation. Existing methods such as sequential importance sampling estimators and fitted-Q evaluation suffer from the curse of horizon in POMDPs. To circumvent this problem, we develop a novel model-free OPE method by introducing future-dependent value functions that take future proxies as inputs.… ▽ More

    Submitted 14 November, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: This paper was accepted in NeurIPS 2023

  10. arXiv:2202.11796  [pdf, ps, other

    stat.ME stat.CO

    An expectation-maximization algorithm for estimating the parameters of the correlated binomial distribution

    Authors: Andrea Bennett, Min Wang

    Abstract: The correlated binomial (CB) distribution was proposed by Luceño (Computational Statistics $\&$ Data Analysis, 20, 1995, 511-520) as an alternative to the binomial distribution for the analysis of the data in the presence of correlations among events. Due to the complexity of the mixture likelihood of the model, it may be impossible to derive analytical expressions of the maximum likelihood estima… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 8 pages; 1 figure; Undergraduate Research

    MSC Class: 62F10

  11. arXiv:2202.08062  [pdf

    stat.ME

    Where the Model Frequently Meets the Road: Combining Statistical, Formal, and Case Study Methods

    Authors: Andrew Bennett, Bear F. Braumoeller

    Abstract: This paper analyzes the working or default assumptions researchers in the formal, statistical, and case study traditions typically hold regarding the sources of unexplained variance, the meaning of outliers, parameter values, human motivation, functional forms, time, and external validity. We argue that these working assumptions are often not essential to each method, and that these assumptions ca… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

  12. arXiv:2110.15332  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes

    Authors: Andrew Bennett, Nathan Kallus

    Abstract: In applications of offline reinforcement learning to observational data, such as in healthcare or education, a general concern is that observed actions might be affected by unobserved factors, inducing confounding and biasing estimates derived under the assumption of a perfect Markov decision process (MDP) model. Here we tackle this by considering off-policy evaluation in a partially observed MDP… ▽ More

    Submitted 22 March, 2023; v1 submitted 28 October, 2021; originally announced October 2021.

  13. arXiv:2106.14436  [pdf, other

    stat.AP

    Malaria Risk Map** Using Routine Health System Incidence Data in Zambia

    Authors: Benjamin M. Taylor, Ricardo Andrade-Pacheco, Hugh Sturrock, Busiku Hamainza, Kafula Silumbe, John Miller, Thomas P. Eisele, Francois Rerolle, Hannah Slater, Adam Bennett

    Abstract: Improvements to Zambia's malaria surveillance system allow better monitoring of incidence and targetting of responses at refined spatial scales. As transmission decreases, understanding heterogeneity in risk at fine spatial scales becomes increasingly important. However, there are challenges in using health system data for high-resolution risk map**: health facilities have undefined and overlapp… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  14. arXiv:2012.09422  [pdf, ps, other

    cs.LG econ.EM math.ST stat.ML

    The Variational Method of Moments

    Authors: Andrew Bennett, Nathan Kallus

    Abstract: The conditional moment problem is a powerful formulation for describing structural causal parameters in terms of observables, a prominent example being instrumental variable regression. A standard approach reduces the problem to a finite set of marginal moment conditions and applies the optimally weighted generalized method of moments (OWGMM), but this requires we know a finite set of identifying… ▽ More

    Submitted 22 March, 2023; v1 submitted 17 December, 2020; originally announced December 2020.

  15. arXiv:2007.13893  [pdf, other

    cs.LG cs.AI stat.ML

    Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders

    Authors: Andrew Bennett, Nathan Kallus, Lihong Li, Ali Mousavi

    Abstract: Off-policy evaluation (OPE) in reinforcement learning is an important problem in settings where experimentation is limited, such as education and healthcare. But, in these very same settings, observed actions are often confounded by unobserved variables making OPE even more difficult. We study an OPE problem in an infinite-horizon, ergodic Markov decision process with unobserved confounders, where… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

  16. arXiv:2002.05153  [pdf, other

    cs.LG econ.EM math.ST stat.ML

    Efficient Policy Learning from Surrogate-Loss Classification Reductions

    Authors: Andrew Bennett, Nathan Kallus

    Abstract: Recent work on policy learning from observational data has highlighted the importance of efficient policy evaluation and has proposed reductions to weighted (cost-sensitive) classification. But, efficient policy evaluation need not yield efficient estimation of policy parameters. We consider the estimation problem given by a weighted surrogate-loss classification reduction of policy learning with… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

  17. arXiv:1908.01920  [pdf, ps, other

    stat.ML cs.LG

    Policy Evaluation with Latent Confounders via Optimal Balance

    Authors: Andrew Bennett, Nathan Kallus

    Abstract: Evaluating novel contextual bandit policies using logged data is crucial in applications where exploration is costly, such as medicine. But it usually relies on the assumption of no unobserved confounders, which is bound to fail in practice. We study the question of policy evaluation when we instead have proxies for the latent confounders and develop an importance weighting method that avoids fitt… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

  18. arXiv:1906.05912  [pdf, ps, other

    cs.LG stat.ML

    A Variational Autoencoder for Probabilistic Non-Negative Matrix Factorisation

    Authors: Steven Squires, Adam Prügel Bennett, Mahesan Niranjan

    Abstract: We introduce and demonstrate the variational autoencoder (VAE) for probabilistic non-negative matrix factorisation (PAE-NMF). We design a network which can perform non-negative matrix factorisation (NMF) and add in aspects of a VAE to make the coefficients of the latent space probabilistic. By restricting the weights in the final layer of the network to be non-negative and using the non-negative W… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  19. arXiv:1905.12495  [pdf, other

    stat.ML cs.LG econ.EM

    Deep Generalized Method of Moments for Instrumental Variable Analysis

    Authors: Andrew Bennett, Nathan Kallus, Tobias Schnabel

    Abstract: Instrumental variable analysis is a powerful tool for estimating causal effects when randomization or full control of confounders is not possible. The application of standard methods such as 2SLS, GMM, and more recent variants are significantly impeded when the causal effects are complex, the instruments are high-dimensional, and/or the treatment is high-dimensional. In this paper, we propose the… ▽ More

    Submitted 18 April, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

    Journal ref: Advances in Neural Information Processing Systems 32 (2019) 3564--3574

  20. arXiv:1902.01632  [pdf, ps, other

    cs.LG stat.ML

    Minimum description length as an objective function for non-negative matrix factorization

    Authors: Steven Squires, Adam Prugel Bennett, Mahesan Niranjan

    Abstract: Non-negative matrix factorization (NMF) is a dimensionality reduction technique which tends to produce a sparse representation of data. Commonly, the error between the actual and recreated matrices is used as an objective function, but this method may not produce the type of representation we desire as it allows for the complexity of the model to grow, constrained only by the size of the subspace… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

  21. arXiv:1805.04164  [pdf

    q-bio.GN stat.ME

    Bivariate Causal Discovery and its Applications to Gene Expression and Imaging Data Analysis

    Authors: Rong Jiao, Nan Lin, Zixin Hu, David A Bennett, Li **, Momiao Xiong

    Abstract: The mainstream of research in genetics, epigenetics and imaging data analysis focuses on statistical association or exploring statistical dependence between variables. Despite their significant progresses in genetic research, understanding the etiology and mechanism of complex phenotypes remains elusive. Using association analysis as a major analytical platform for the complex data analysis is a k… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.