Skip to main content

Showing 1–9 of 9 results for author: Cornish, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11814  [pdf, other

    stat.ML cs.LG math.CT

    Stochastic Neural Network Symmetrisation in Markov Categories

    Authors: Rob Cornish

    Abstract: We consider the problem of symmetrising a neural network along a group homomorphism: given a homomorphism $\varphi : H \to G$, we would like a procedure that converts $H$-equivariant neural networks into $G$-equivariant ones. We formulate this in terms of Markov categories, which allows us to consider neural networks whose outputs may be stochastic, but with measure-theoretic details abstracted aw… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2312.01457  [pdf, other

    stat.ML cs.LG stat.ME

    Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

    Authors: Muhammad Faaiz Taufiq, Arnaud Doucet, Rob Cornish, Jean-Francois Ton

    Abstract: Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new policies using existing data without costly experimentation. However, current OPE methods, such as Inverse Probability Weighting (IPW) and Doubly Robust (DR) estimators, suffer from high variance, particularly in cases of low overlap between target and behavior policies or large action and context spaces. In this paper,… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Conference on Neural Information Processing Systems (NeurIPS 2023)

  3. arXiv:2301.07210  [pdf, other

    stat.ME cs.CE cs.LG stat.AP

    Causal Falsification of Digital Twins

    Authors: Rob Cornish, Muhammad Faaiz Taufiq, Arnaud Doucet, Chris Holmes

    Abstract: Digital twins are virtual systems designed to predict how a real-world process will evolve in response to interventions. This modelling paradigm holds substantial promise in many applications, but rigorous procedures for assessing their accuracy are essential for safety-critical settings. We consider how to assess the accuracy of a digital twin using real-world data. We formulate this as causal in… ▽ More

    Submitted 2 November, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

  4. arXiv:2206.04405  [pdf, other

    stat.ML cs.LG

    Conformal Off-Policy Prediction in Contextual Bandits

    Authors: Muhammad Faaiz Taufiq, Jean-Francois Ton, Rob Cornish, Yee Whye Teh, Arnaud Doucet

    Abstract: Most off-policy evaluation methods for contextual bandits have focused on the expected outcome of a policy, which is estimated via methods that at best provide only asymptotic guarantees. However, in many applications, the expectation may not be the best measure of performance as it does not capture the variability of the outcome. In addition, particularly in safety-critical settings, stronger gua… ▽ More

    Submitted 26 October, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Proceedings of 36th Conference on Neural Information Processing System (NeurIPS 2022)

  5. arXiv:2103.03532  [pdf, other

    stat.ML cs.LG

    Deep Generative Pattern-Set Mixture Models for Nonignorable Missingness

    Authors: Sahra Ghalebikesabi, Rob Cornish, Luke J. Kelly, Chris Holmes

    Abstract: We propose a variational autoencoder architecture to model both ignorable and nonignorable missing data using pattern-set mixtures as proposed by Little (1993). Our model explicitly learns to cluster the missing data into missingness pattern sets based on the observed data and missingness masks. Underpinning our approach is the assumption that the data distribution under missingness is probabilist… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: International Conference on Artificial Intelligence and Statistics (AISTATS)

    Journal ref: International Conference on Artificial Intelligence and Statistics (AISTATS) 2021

  6. arXiv:2007.05426  [pdf, other

    stat.ML cs.LG

    Variational Inference with Continuously-Indexed Normalizing Flows

    Authors: Anthony Caterini, Rob Cornish, Dino Sejdinovic, Arnaud Doucet

    Abstract: Continuously-indexed flows (CIFs) have recently achieved improvements over baseline normalizing flows on a variety of density estimation tasks. CIFs do not possess a closed-form marginal density, and so, unlike standard flows, cannot be plugged in directly to a variational inference (VI) scheme in order to produce a more expressive family of approximate posteriors. However, we show here how CIFs c… ▽ More

    Submitted 14 June, 2021; v1 submitted 10 July, 2020; originally announced July 2020.

    Comments: Accepted for publication at UAI 2021

  7. arXiv:1909.13833  [pdf, other

    stat.ML cs.LG

    Relaxing Bijectivity Constraints with Continuously Indexed Normalising Flows

    Authors: Rob Cornish, Anthony L. Caterini, George Deligiannidis, Arnaud Doucet

    Abstract: We show that normalising flows become pathological when used to model targets whose supports have complicated topologies. In this scenario, we prove that a flow must become arbitrarily numerically noninvertible in order to approximate the target closely. This result has implications for all flow-based models, and especially Residual Flows (ResFlows), which explicitly control the Lipschitz constant… ▽ More

    Submitted 23 April, 2021; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: Minor revision

  8. arXiv:1901.09881  [pdf, other

    stat.ML cs.LG

    Scalable Metropolis-Hastings for Exact Bayesian Inference with Large Datasets

    Authors: Robert Cornish, Paul Vanetti, Alexandre Bouchard-Côté, George Deligiannidis, Arnaud Doucet

    Abstract: Bayesian inference via standard Markov Chain Monte Carlo (MCMC) methods is too computationally intensive to handle large datasets, since the cost per step usually scales like $Θ(n)$ in the number of data points $n$. We propose the Scalable Metropolis-Hastings (SMH) kernel that exploits Gaussian concentration of the posterior to require processing on average only $O(1)$ or even $O(1/\sqrt{n})$ data… ▽ More

    Submitted 10 June, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

  9. arXiv:1703.04782  [pdf, other

    cs.LG stat.ML

    Online Learning Rate Adaptation with Hypergradient Descent

    Authors: Atilim Gunes Baydin, Robert Cornish, David Martinez Rubio, Mark Schmidt, Frank Wood

    Abstract: We introduce a general method for improving the convergence rate of gradient-based optimizers that is easy to implement and works well in practice. We demonstrate the effectiveness of the method in a range of optimization problems by applying it to stochastic gradient descent, stochastic gradient descent with Nesterov momentum, and Adam, showing that it significantly reduces the need for the manua… ▽ More

    Submitted 25 February, 2018; v1 submitted 14 March, 2017; originally announced March 2017.

    Comments: 11 pages, 4 figures

    MSC Class: 68T05 ACM Class: G.1.6; I.2.6

    Journal ref: In Sixth International Conference on Learning Representations (ICLR), Vancouver, Canada, April 30 -- May 3, 2018. https://openreview.net/forum?id=BkrsAzWAb