Skip to main content

Showing 1–23 of 23 results for author: Terenin, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20062  [pdf, other

    cs.LG stat.ML

    Cost-aware Bayesian optimization via the Pandora's Box Gittins index

    Authors: Qian Xie, Raul Astudillo, Peter Frazier, Ziv Scully, Alexander Terenin

    Abstract: Bayesian optimization is a technique for efficiently optimizing unknown functions in a black-box manner. To handle practical settings where gathering data requires use of finite resources, it is desirable to explicitly incorporate function evaluation costs into Bayesian optimization policies. To understand how to do so, we develop a previously-unexplored connection between cost-aware Bayesian opti… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2310.20581  [pdf, other

    cs.LG stat.ML

    Stochastic Gradient Descent for Gaussian Processes Done Right

    Authors: Jihao Andreas Lin, Shreyas Padhy, Javier Antorán, Austin Tripp, Alexander Terenin, Csaba Szepesvári, José Miguel Hernández-Lobato, David Janz

    Abstract: As is well known, both sampling from the posterior and computing the mean of the posterior in Gaussian process regression reduces to solving a large linear system of equations. We study the use of stochastic gradient descent for solving this linear system, and show that when \emph{done right} -- by which we mean using specific insights from the optimisation and kernel communities -- stochastic gra… ▽ More

    Submitted 28 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  3. arXiv:2309.12269  [pdf, other

    cs.CL cs.CY stat.AP

    The Cambridge Law Corpus: A Dataset for Legal AI Research

    Authors: Andreas Östling, Holli Sargeant, Huiyuan Xie, Ludwig Bull, Alexander Terenin, Leif Jonsson, Måns Magnusson, Felix Steffek

    Abstract: We introduce the Cambridge Law Corpus (CLC), a dataset for legal AI research. It consists of over 250 000 court cases from the UK. Most cases are from the 21st century, but the corpus includes cases as old as the 16th century. This paper presents the first release of the corpus, containing the raw text and meta-data. Together with the corpus, we provide annotations on case outcomes for 638 cases,… ▽ More

    Submitted 1 January, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Journal ref: Advances in Neural Information Processing Systems, Datasets and Benchmarks Track, 2023

  4. arXiv:2309.10918  [pdf, other

    stat.ML cs.LG math.ST

    Posterior Contraction Rates for Matérn Gaussian Processes on Riemannian Manifolds

    Authors: Paul Rosa, Viacheslav Borovitskiy, Alexander Terenin, Judith Rousseau

    Abstract: Gaussian processes are used in many machine learning applications that rely on uncertainty quantification. Recently, computational tools for working with these models in geometric settings, such as when inputs lie on a Riemannian manifold, have been developed. This raises the question: can these intrinsic models be shown theoretically to lead to better performance, compared to simply embedding all… ▽ More

    Submitted 29 October, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  5. arXiv:2309.00854  [pdf, other

    cs.RO cs.LG

    A Unifying Variational Framework for Gaussian Process Motion Planning

    Authors: Lucas Cosier, Rares Iordan, Sicelukwanda Zwane, Giovanni Franzese, James T. Wilson, Marc Peter Deisenroth, Alexander Terenin, Yasemin Bekiroglu

    Abstract: To control how a robot moves, motion planning algorithms must compute paths in high-dimensional state spaces while accounting for physical constraints related to motors and joints, generating smooth and stable motions, avoiding obstacles, and preventing collisions. A motion planning algorithm must therefore balance competing demands, and should ideally incorporate uncertainty to handle noise, mode… ▽ More

    Submitted 8 March, 2024; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: Code and supplementary video available at: https://github.com/luke-ck/vgpmp

    Journal ref: Artificial Intelligence and Statistics, 2024

  6. arXiv:2306.11589  [pdf, other

    cs.LG stat.ML

    Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent

    Authors: Jihao Andreas Lin, Javier Antorán, Shreyas Padhy, David Janz, José Miguel Hernández-Lobato, Alexander Terenin

    Abstract: Gaussian processes are a powerful framework for quantifying uncertainty and for sequential decision-making but are limited by the requirement of solving linear systems. In general, this has a cubic cost in dataset size and is sensitive to conditioning. We explore stochastic gradient algorithms as a computationally efficient method of approximately solving these linear systems: we develop low-varia… ▽ More

    Submitted 15 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  7. arXiv:2301.13088  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Stationary Kernels and Gaussian Processes on Lie Groups and their Homogeneous Spaces II: non-compact symmetric spaces

    Authors: Iskander Azangulov, Andrei Smolensky, Alexander Terenin, Viacheslav Borovitskiy

    Abstract: Gaussian processes are arguably the most important class of spatiotemporal models within machine learning. They encode prior information about the modeled function and can be used for exact or approximate Bayesian learning. In many applications, particularly in physical sciences and engineering, but also in areas such as geostatistics and neuroscience, invariance to symmetries is one of the most f… ▽ More

    Submitted 1 July, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

  8. arXiv:2210.07893  [pdf, other

    stat.ML cs.LG

    Numerically Stable Sparse Gaussian Processes via Minimum Separation using Cover Trees

    Authors: Alexander Terenin, David R. Burt, Artem Artemev, Seth Flaxman, Mark van der Wilk, Carl Edward Rasmussen, Hong Ge

    Abstract: Gaussian processes are frequently deployed as part of larger machine learning and decision-making systems, for instance in geospatial modeling, Bayesian optimization, or in latent Gaussian models. Within a system, the Gaussian process model needs to perform in a stable and reliable manner to ensure it interacts correctly with other parts of the system. In this work, we study the numerical stabilit… ▽ More

    Submitted 16 January, 2024; v1 submitted 14 October, 2022; originally announced October 2022.

    Journal ref: Journal of Machine Learning Research, 2024

  9. arXiv:2208.14960  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Stationary Kernels and Gaussian Processes on Lie Groups and their Homogeneous Spaces I: the compact case

    Authors: Iskander Azangulov, Andrei Smolensky, Alexander Terenin, Viacheslav Borovitskiy

    Abstract: Gaussian processes are arguably the most important class of spatiotemporal models within machine learning. They encode prior information about the modeled function and can be used for exact or approximate Bayesian learning. In many applications, particularly in physical sciences and engineering, but also in areas such as geostatistics and neuroscience, invariance to symmetries is one of the most f… ▽ More

    Submitted 7 November, 2023; v1 submitted 31 August, 2022; originally announced August 2022.

  10. arXiv:2202.10613  [pdf, other

    stat.ML cs.LG

    Gaussian Processes and Statistical Decision-making in Non-Euclidean Spaces

    Authors: Alexander Terenin

    Abstract: Bayesian learning using Gaussian processes provides a foundational framework for making decisions in a manner that balances what is known with what could be learned by gathering data. In this dissertation, we develop techniques for broadening the applicability of Gaussian processes. This is done in two ways. Firstly, we develop pathwise conditioning techniques for Gaussian processes, which allow o… ▽ More

    Submitted 28 April, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Journal ref: PhD Thesis, Imperial College London, 2022

  11. arXiv:2111.01460  [pdf, other

    cs.RO cs.LG

    Geometry-aware Bayesian Optimization in Robotics using Riemannian Matérn Kernels

    Authors: Noémie Jaquier, Viacheslav Borovitskiy, Andrei Smolensky, Alexander Terenin, Tamim Asfour, Leonel Rozo

    Abstract: Bayesian optimization is a data-efficient technique which can be used for control parameter tuning, parametric policy adaptation, and structure design in robotics. Many of these problems require optimization of functions defined on non-Euclidean domains like spheres, rotation groups, or spaces of positive-definite matrices. To do so, one must place a Gaussian process prior, or equivalently define… ▽ More

    Submitted 17 March, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: Source code: https://github.com/NoemieJaquier/MaternGaBO, Video: https://youtu.be/6awfFRqP7wA

    Journal ref: Conference on Robot Learning, 2021

  12. arXiv:2110.14423  [pdf, other

    stat.ML cs.LG

    Vector-valued Gaussian Processes on Riemannian Manifolds via Gauge Independent Projected Kernels

    Authors: Michael Hutchinson, Alexander Terenin, Viacheslav Borovitskiy, So Takao, Yee Whye Teh, Marc Peter Deisenroth

    Abstract: Gaussian processes are machine learning models capable of learning unknown functions in a way that represents uncertainty, thereby facilitating construction of optimal decision-making systems. Motivated by a desire to deploy Gaussian processes in novel areas of science, a rapidly-growing line of research has focused on constructively extending these models to handle non-Euclidean domains, includin… ▽ More

    Submitted 25 November, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Journal ref: Advances in Neural Information Processing Systems, 2021

  13. arXiv:2102.11206  [pdf, other

    cs.LG cs.RO stat.ML

    Learning Contact Dynamics using Physically Structured Neural Networks

    Authors: Andreas Hochlehnert, Alexander Terenin, Steindór Sæmundsson, Marc Peter Deisenroth

    Abstract: Learning physically structured representations of dynamical systems that include contact between different objects is an important problem for learning-based approaches in robotics. Black-box neural networks can learn to approximately represent discontinuous dynamics, but they typically require large quantities of data and often suffer from pathological behaviour when forecasting for longer time h… ▽ More

    Submitted 15 August, 2022; v1 submitted 22 February, 2021; originally announced February 2021.

    Journal ref: Artificial Intelligence and Statistics, 2021

  14. arXiv:2102.07115  [pdf, other

    stat.ML cs.LG

    Sliced Multi-Marginal Optimal Transport

    Authors: Samuel Cohen, Alexander Terenin, Yannik Pitcan, Brandon Amos, Marc Peter Deisenroth, K S Sesh Kumar

    Abstract: Multi-marginal optimal transport enables one to compare multiple probability measures, which increasingly finds application in multi-task learning problems. One practical limitation of multi-marginal transport is computational scalability in the number of measures, samples and dimensionality. In this work, we propose a multi-marginal optimal transport paradigm based on random one-dimensional proje… ▽ More

    Submitted 23 November, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Journal ref: NeurIPS Workshop on Optimal Transport and Machine Learning, 2021

  15. arXiv:2011.04026  [pdf, other

    stat.ML cs.LG math.ST

    Pathwise Conditioning of Gaussian Processes

    Authors: James T. Wilson, Viacheslav Borovitskiy, Alexander Terenin, Peter Mostowsky, Marc Peter Deisenroth

    Abstract: As Gaussian processes are used to answer increasingly complex questions, analytic solutions become scarcer and scarcer. Monte Carlo methods act as a convenient bridge for connecting intractable mathematical expressions with actionable estimates via sampling. Conventional approaches for simulating Gaussian process posteriors view samples as draws from marginal distributions of process values at fin… ▽ More

    Submitted 30 July, 2021; v1 submitted 8 November, 2020; originally announced November 2020.

    Journal ref: Journal of Machine Learning Research, 22(105):1-47, 2021

  16. arXiv:2010.15538  [pdf, other

    stat.ML cs.LG

    Matérn Gaussian Processes on Graphs

    Authors: Viacheslav Borovitskiy, Iskander Azangulov, Alexander Terenin, Peter Mostowsky, Marc Peter Deisenroth, Nicolas Durrande

    Abstract: Gaussian processes are a versatile framework for learning unknown functions in a manner that permits one to utilize prior information about their properties. Although many different Gaussian process models are readily available when the input space is Euclidean, the choice is much more limited for Gaussian processes whose input space is an undirected graph. In this work, we leverage the stochastic… ▽ More

    Submitted 9 April, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Journal ref: Artificial Intelligence and Statistics, 2021

  17. arXiv:2006.12648  [pdf, other

    cs.LG stat.ML

    Aligning Time Series on Incomparable Spaces

    Authors: Samuel Cohen, Giulia Luise, Alexander Terenin, Brandon Amos, Marc Peter Deisenroth

    Abstract: Dynamic time war** (DTW) is a useful method for aligning, comparing and combining time series, but it requires them to live in comparable spaces. In this work, we consider a setting in which time series live on different spaces without a sensible ground metric, causing DTW to become ill-defined. To alleviate this, we propose Gromov dynamic time war** (GDTW), a distance between time series on p… ▽ More

    Submitted 22 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Journal ref: Artificial Intelligence and Statistics, 2021

  18. arXiv:2006.10160  [pdf, other

    stat.ML cs.LG

    Matérn Gaussian processes on Riemannian manifolds

    Authors: Viacheslav Borovitskiy, Alexander Terenin, Peter Mostowsky, Marc Peter Deisenroth

    Abstract: Gaussian processes are an effective model class for learning unknown functions, particularly in settings where accurately representing predictive uncertainty is of key importance. Motivated by applications in the physical sciences, the widely-used Matérn class of Gaussian processes has recently been generalized to model functions whose domains are Riemannian manifolds, by re-expressing said proces… ▽ More

    Submitted 17 April, 2023; v1 submitted 17 June, 2020; originally announced June 2020.

    Journal ref: Advances in Neural Information Processing Systems, 2020

  19. arXiv:2002.09309  [pdf, other

    stat.ML cs.LG stat.CO

    Efficiently Sampling Functions from Gaussian Process Posteriors

    Authors: James T. Wilson, Viacheslav Borovitskiy, Alexander Terenin, Peter Mostowsky, Marc Peter Deisenroth

    Abstract: Gaussian processes are the gold standard for many real-world modeling problems, especially in cases where a model's success hinges upon its ability to faithfully represent predictive uncertainty. These problems typically exist as parts of larger frameworks, wherein quantities of interest are ultimately defined by integrating over posterior distributions. These quantities are frequently intractable… ▽ More

    Submitted 16 August, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Journal ref: International Conference on Machine Learning, 2020

  20. arXiv:1910.09349  [pdf, other

    stat.ML cs.LG

    Variational Integrator Networks for Physically Structured Embeddings

    Authors: Steindor Saemundsson, Alexander Terenin, Katja Hofmann, Marc Peter Deisenroth

    Abstract: Learning workable representations of dynamical systems is becoming an increasingly important problem in a number of application areas. By leveraging recent work connecting deep neural networks to systems of differential equations, we propose \emph{variational integrator networks}, a class of neural network architectures designed to preserve the geometric structure of physical systems. This class o… ▽ More

    Submitted 2 March, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Journal ref: Artificial Intelligence and Statistics, 2020

  21. arXiv:1906.02416  [pdf, other

    stat.ML cs.CL cs.IR cs.LG

    Sparse Parallel Training of Hierarchical Dirichlet Process Topic Models

    Authors: Alexander Terenin, Måns Magnusson, Leif Jonsson

    Abstract: To scale non-parametric extensions of probabilistic topic models such as Latent Dirichlet allocation to larger data sets, practitioners rely increasingly on parallel and distributed systems. In this work, we study data-parallel training for the hierarchical Dirichlet process (HDP) topic model. Based upon a representation of certain conditional distributions within an HDP, we propose a doubly spars… ▽ More

    Submitted 6 October, 2020; v1 submitted 6 June, 2019; originally announced June 2019.

    Journal ref: Conference on Empirical Methods in Natural Language Processing, 2020

  22. arXiv:1711.06719  [pdf, other

    stat.ML cs.LG stat.CO

    Techniques for proving Asynchronous Convergence results for Markov Chain Monte Carlo methods

    Authors: Alexander Terenin, Eric P. Xing

    Abstract: Markov Chain Monte Carlo (MCMC) methods such as Gibbs sampling are finding widespread use in applied statistics and machine learning. These often lead to difficult computational problems, which are increasingly being solved on parallel and distributed systems such as compute clusters. Recent work has proposed running iterative algorithms such as gradient descent and MCMC in parallel asynchronously… ▽ More

    Submitted 3 June, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: Workshop on Advances in Approximate Bayesian Inference, 31st Conference on Neural Information Processing Systems, 2017

  23. GPU-accelerated Gibbs sampling: a case study of the Horseshoe Probit model

    Authors: Alexander Terenin, Shawfeng Dong, David Draper

    Abstract: Gibbs sampling is a widely used Markov chain Monte Carlo (MCMC) method for numerically approximating integrals of interest in Bayesian statistics and other mathematical sciences. Many implementations of MCMC methods do not extend easily to parallel computing environments, as their inherently sequential nature incurs a large synchronization cost. In the case study illustrated by this paper, we show… ▽ More

    Submitted 21 March, 2018; v1 submitted 15 August, 2016; originally announced August 2016.

    Journal ref: Statistics and Computing 29(2):301-310, 2019