Skip to main content

Showing 1–31 of 31 results for author: Paige, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01616  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative Active Learning for the Search of Small-molecule Protein Binders

    Authors: Maksym Korablyov, Cheng-Hao Liu, Moksh Jain, Almer M. van der Sloot, Eric Jolicoeur, Edward Ruediger, Andrei Cristian Nica, Emmanuel Bengio, Kostiantyn Lapchevskyi, Daniel St-Cyr, Doris Alexandra Schuetz, Victor Ion Butoi, Jarrid Rector-Brooks, Simon Blackburn, Leo Feng, Hadi Nekoei, SaiKrishna Gottipati, Priyesh Vijayan, Prateek Gupta, Ladislav Rampášek, Sasikanth Avancha, Pierre-Luc Bacon, William L. Hamilton, Brooks Paige, Sanchit Misra , et al. (9 additional authors not shown)

    Abstract: Despite substantial progress in machine learning for scientific discovery in recent years, truly de novo design of small molecules which exhibit a property of interest remains a significant challenge. We introduce LambdaZero, a generative active learning approach to search for synthesizable molecules. Powered by deep reinforcement learning, LambdaZero learns to search over the vast space of molecu… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2403.01272  [pdf, other

    cs.LG stat.ML

    Can a Confident Prior Replace a Cold Posterior?

    Authors: Martin Marek, Brooks Paige, Pavel Izmailov

    Abstract: Benchmark datasets used for image classification tend to have very low levels of label noise. When Bayesian neural networks are trained on these datasets, they often underfit, misrepresenting the aleatoric uncertainty of the data. A common solution is to cool the posterior, which improves fit to the training data but is challenging to interpret from a Bayesian perspective. We explore whether poste… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  3. arXiv:2402.03008  [pdf, other

    stat.ML cs.LG stat.CO

    Diffusive Gibbs Sampling

    Authors: Wenlin Chen, Mingtian Zhang, Brooks Paige, José Miguel Hernández-Lobato, David Barber

    Abstract: The inadequate mixing of conventional Markov Chain Monte Carlo (MCMC) methods for multi-modal distributions presents a significant challenge in practical applications such as Bayesian inference and molecular dynamics. Addressing this, we propose Diffusive Gibbs Sampling (DiGS), an innovative family of sampling methods designed for effective sampling from distributions characterized by distant and… ▽ More

    Submitted 29 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted for publication at ICML 2024. Code available: https://github.com/Wenlin-Chen/DiGS

  4. arXiv:2311.01198  [pdf, other

    cs.LG stat.ML

    Gaussian Processes on Cellular Complexes

    Authors: Mathieu Alain, So Takao, Brooks Paige, Marc Peter Deisenroth

    Abstract: In recent years, there has been considerable interest in develo** machine learning models on graphs in order to account for topological inductive biases. In particular, recent attention was given to Gaussian processes on such structures since they can additionally account for uncertainty. However, graphs are limited to modelling relations between two vertices. In this paper, we go beyond this dy… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  5. arXiv:2305.11650  [pdf, other

    stat.ML cs.LG

    Moment Matching Denoising Gibbs Sampling

    Authors: Mingtian Zhang, Alex Hawkins-Hooker, Brooks Paige, David Barber

    Abstract: Energy-Based Models (EBMs) offer a versatile framework for modeling complex data distributions. However, training and sampling from EBMs continue to pose significant challenges. The widely-used Denoising Score Matching (DSM) method for scalable EBM training suffers from inconsistency issues, causing the energy model to learn a `noisy' data distribution. In this work, we propose an efficient sampli… ▽ More

    Submitted 19 March, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

  6. arXiv:2209.07396  [pdf, other

    stat.ML cs.LG

    Towards Healing the Blindness of Score Matching

    Authors: Mingtian Zhang, Oscar Key, Peter Hayes, David Barber, Brooks Paige, François-Xavier Briol

    Abstract: Score-based divergences have been widely used in machine learning and statistics applications. Despite their empirical success, a blindness problem has been observed when using these for multi-modal distributions. In this work, we discuss the blindness problem and propose a new family of divergences that can mitigate the blindness problem. We illustrate our proposed divergence in the context of de… ▽ More

    Submitted 15 October, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

  7. arXiv:2205.14539  [pdf, other

    stat.ML cs.LG

    Improving VAE-based Representation Learning

    Authors: Mingtian Zhang, Tim Z. Xiao, Brooks Paige, David Barber

    Abstract: Latent variable models like the Variational Auto-Encoder (VAE) are commonly used to learn representations of images. However, for downstream tasks like semantic classification, the representations learned by VAE are less competitive than other non-latent variable models. This has led to some speculations that latent variable models may be fundamentally unsuitable for representation learning. In th… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  8. arXiv:2112.03235  [pdf, other

    cs.AI cs.CE cs.LG cs.MS

    Simulation Intelligence: Towards a New Generation of Scientific Methods

    Authors: Alexander Lavin, David Krakauer, Hector Zenil, Justin Gottschlich, Tim Mattson, Johann Brehmer, Anima Anandkumar, Sanjay Choudry, Kamil Rocki, Atılım Güneş Baydin, Carina Prunkl, Brooks Paige, Olexandr Isayev, Erik Peterson, Peter L. McMahon, Jakob Macke, Kyle Cranmer, Jiaxin Zhang, Haruko Wainwright, Adi Hanuka, Manuela Veloso, Samuel Assefa, Stephan Zheng, Avi Pfeffer

    Abstract: The original "Seven Motifs" set forth a roadmap of essential methods for the field of scientific computing, where a motif is an algorithmic method that captures a pattern of computation and data movement. We present the "Nine Motifs of Simulation Intelligence", a roadmap for the development and integration of the essential algorithms necessary for a merger of scientific computing, scientific simul… ▽ More

    Submitted 27 November, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  9. arXiv:2111.04558  [pdf, other

    stat.ML cs.LG

    Fast and Scalable Spike and Slab Variable Selection in High-Dimensional Gaussian Processes

    Authors: Hugh Dance, Brooks Paige

    Abstract: Variable selection in Gaussian processes (GPs) is typically undertaken by thresholding the inverse lengthscales of automatic relevance determination kernels, but in high-dimensional datasets this approach can be unreliable. A more probabilistically principled alternative is to use spike and slab priors and infer a posterior probability of variable inclusion. However, existing implementations in GP… ▽ More

    Submitted 24 February, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: Accepted at the 25th International Conference on Artificial Intelligence and Statistics (AISTATS 2022)

  10. arXiv:2106.05238  [pdf, other

    cs.LG cs.CV eess.SP stat.ML

    I Don't Need u: Identifiable Non-Linear ICA Without Side Information

    Authors: Matthew Willetts, Brooks Paige

    Abstract: In this paper, we investigate the algorithmic stability of unsupervised representation learning with deep generative models, as a function of repeated re-training on the same input data. Algorithms for learning low dimensional linear representations -- for example principal components analysis (PCA), or linear independent components analysis (ICA) -- come with guarantees that they will always reve… ▽ More

    Submitted 4 July, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 10 pages plus appendix

  11. arXiv:2012.11522  [pdf, other

    cs.LG q-bio.BM q-bio.QM

    Barking up the right tree: an approach to search over molecule synthesis DAGs

    Authors: John Bradshaw, Brooks Paige, Matt J. Kusner, Marwin H. S. Segler, José Miguel Hernández-Lobato

    Abstract: When designing new molecules with particular properties, it is not only important what to make but crucially how to make it. These instructions form a synthesis directed acyclic graph (DAG), describing how a large vocabulary of simple building blocks can be recursively combined through chemical reactions to create more complicated molecules of interest. In contrast, many current deep generative mo… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: To appear in Advances in Neural Information Processing Systems 2020

  12. arXiv:2012.02089  [pdf, other

    q-bio.BM cs.LG

    Bayesian Graph Neural Networks for Molecular Property Prediction

    Authors: George Lamb, Brooks Paige

    Abstract: Graph neural networks for molecular property prediction are frequently underspecified by data and fail to generalise to new scaffolds at test time. A potential solution is Bayesian learning, which can capture our uncertainty in the model parameters. This study benchmarks a set of Bayesian methods applied to a directed MPNN, using the QM9 regression dataset. We find that capturing uncertainty in bo… ▽ More

    Submitted 25 November, 2020; originally announced December 2020.

    Comments: Presented at NeurIPS 2020 Machine Learning for Molecules workshop

  13. arXiv:2011.12203  [pdf, other

    cs.LG cs.AI

    Making Graph Neural Networks Worth It for Low-Data Molecular Machine Learning

    Authors: Aneesh Pappu, Brooks Paige

    Abstract: Graph neural networks have become very popular for machine learning on molecules due to the expressive power of their learnt representations. However, molecular machine learning is a classically low-data regime and it isn't clear that graph neural networks can avoid overfitting in low-resource settings. In contrast, fingerprint methods are the traditional standard for low-data environments due to… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  14. arXiv:2010.02311  [pdf, other

    cs.LG stat.ML

    Goal-directed Generation of Discrete Structures with Conditional Generative Models

    Authors: Amina Mollaysa, Brooks Paige, Alexandros Kalousis

    Abstract: Despite recent advances, goal-directed generation of structured discrete data remains challenging. For problems such as program synthesis (generating source code) and materials design (generating molecules), finding examples which satisfy desired constraints or exhibit desired properties is difficult. In practice, expensive heuristic search or reinforcement learning algorithms are often employed.… ▽ More

    Submitted 23 October, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

  15. arXiv:2007.01179  [pdf, other

    cs.LG stat.ML

    Relating by Contrasting: A Data-efficient Framework for Multimodal Generative Models

    Authors: Yuge Shi, Brooks Paige, Philip H. S. Torr, N. Siddharth

    Abstract: Multimodal learning for generative models often refers to the learning of abstract concepts from the commonality of information in multiple modalities, such as vision and language. While it has proven effective for learning generalisable representations, the training of such models often requires a large amount of "related" multimodal data that shares commonality, which can be expensive to come by… ▽ More

    Submitted 21 April, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

  16. arXiv:2002.07766  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Bijective Feature Maps for Linear ICA

    Authors: Alexander Camuto, Matthew Willetts, Brooks Paige, Chris Holmes, Stephen Roberts

    Abstract: Separating high-dimensional data like images into independent latent factors, i.e independent component analysis (ICA), remains an open research problem. As we show, existing probabilistic deep generative models (DGMs), which are tailor-made for image data, underperform on non-linear ICA tasks. To address this, we propose a DGM which combines bijective feature maps with a linear ICA model to learn… ▽ More

    Submitted 29 January, 2021; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: 8 pages

    Journal ref: AISTATS 2021

  17. arXiv:1911.03393  [pdf, other

    stat.ML cs.LG

    Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative Models

    Authors: Yuge Shi, N. Siddharth, Brooks Paige, Philip H. S. Torr

    Abstract: Learning generative models that span multiple data modalities, such as vision and language, is often motivated by the desire to learn more useful, generalisable representations that faithfully capture common underlying factors between the modalities. In this work, we characterise successful learning of such models as the fulfillment of four criteria: i) implicit latent decomposition into shared an… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

  18. arXiv:1911.02624  [pdf, other

    cs.LG cs.NE cs.PL stat.ML

    Data Generation for Neural Programming by Example

    Authors: Judith Clymo, Haik Manukian, Nathanaël Fijalkow, Adrià Gascón, Brooks Paige

    Abstract: Programming by example is the problem of synthesizing a program from a small set of input / output pairs. Recent works applying machine learning methods to this task show promise, but are typically reliant on generating synthetic examples for training. A particular challenge lies in generating meaningful sets of inputs and outputs, which well-characterize a given program and accurately demonstrate… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

  19. arXiv:1906.05221  [pdf, other

    cs.LG physics.comp-ph stat.ML

    A Model to Search for Synthesizable Molecules

    Authors: John Bradshaw, Brooks Paige, Matt J. Kusner, Marwin H. S. Segler, José Miguel Hernández-Lobato

    Abstract: Deep generative models are able to suggest new organic molecules by generating strings, trees, and graphs representing their structure. While such models allow one to generate molecules with desirable properties, they give no guarantees that the molecules can actually be synthesized in practice. We propose a new molecule generation model, mirroring a more realistic real-world process, where (a) re… ▽ More

    Submitted 4 December, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: To appear in Advances in Neural Information Processing Systems 2019

  20. arXiv:1809.10756  [pdf, other

    stat.ML cs.AI cs.LG cs.PL

    An Introduction to Probabilistic Programming

    Authors: Jan-Willem van de Meent, Brooks Paige, Hongseok Yang, Frank Wood

    Abstract: This book is a graduate-level introduction to probabilistic programming. It not only provides a thorough background for anyone wishing to use a probabilistic programming system, but also introduces the techniques needed to design and build these systems. It is aimed at people who have an undergraduate-level understanding of either or, ideally, both probabilistic machine learning and programming la… ▽ More

    Submitted 19 October, 2021; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: Under review at Foundations and Trends in Machine Learning

  21. arXiv:1807.07155  [pdf, other

    econ.EM cs.CV

    Take a Look Around: Using Street View and Satellite Images to Estimate House Prices

    Authors: Stephen Law, Brooks Paige, Chris Russell

    Abstract: When an individual purchases a home, they simultaneously purchase its structural features, its accessibility to work, and the neighborhood amenities. Some amenities, such as air quality, are measurable while others, such as the prestige or the visual impression of a neighborhood, are difficult to quantify. Despite the well-known impacts intangible housing features have on house prices, limited att… ▽ More

    Submitted 21 October, 2019; v1 submitted 18 July, 2018; originally announced July 2018.

    Comments: published in ACM Transactions on Intelligent Systems and Technology (TIST) Volume 10 Issue 5, October 2019 Article No. 54

  22. arXiv:1805.10970  [pdf, other

    physics.chem-ph cs.LG stat.ML

    A Generative Model For Electron Paths

    Authors: John Bradshaw, Matt J. Kusner, Brooks Paige, Marwin H. S. Segler, José Miguel Hernández-Lobato

    Abstract: Chemical reactions can be described as the stepwise redistribution of electrons in molecules. As such, reactions are often depicted using `arrow-pushing' diagrams which show this movement as a sequence of arrows. We propose an electron path prediction model (ELECTRO) to learn these sequences directly from raw reaction data. Instead of predicting product molecules directly from reactant molecules i… ▽ More

    Submitted 20 March, 2019; v1 submitted 23 May, 2018; originally announced May 2018.

  23. arXiv:1804.02086  [pdf, other

    stat.ML cs.LG

    Structured Disentangled Representations

    Authors: Babak Esmaeili, Hao Wu, Sarthak Jain, Alican Bozkurt, N. Siddharth, Brooks Paige, Dana H. Brooks, Jennifer Dy, Jan-Willem van de Meent

    Abstract: Deep latent-variable models learn representations of high-dimensional data in an unsupervised manner. A number of recent efforts have focused on learning representations that disentangle statistically independent axes of variation by introducing modifications to the standard objective function. These approaches generally assume a simple diagonal Gaussian prior and as a result are not able to relia… ▽ More

    Submitted 12 December, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

  24. arXiv:1712.01664  [pdf, other

    stat.ML cs.LG

    Learning a Generative Model for Validity in Complex Discrete Structures

    Authors: David Janz, Jos van der Westhuizen, Brooks Paige, Matt J. Kusner, José Miguel Hernández-Lobato

    Abstract: Deep generative models have been successfully used to learn representations for high-dimensional discrete spaces by representing discrete objects as sequences and employing powerful sequence-based deep models. Unfortunately, these sequence-based models often produce invalid sequences: sequences which do not represent any underlying discrete structure; invalid sequences hinder the utility of such m… ▽ More

    Submitted 1 November, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

    Comments: Conference paper at ICLR 2018. Code available online

  25. arXiv:1706.00400  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Disentangled Representations with Semi-Supervised Deep Generative Models

    Authors: N. Siddharth, Brooks Paige, Jan-Willem van de Meent, Alban Desmaison, Noah D. Goodman, Pushmeet Kohli, Frank Wood, Philip H. S. Torr

    Abstract: Variational autoencoders (VAEs) learn representations of data by jointly training a probabilistic encoder and decoder network. Typically these models encode all features of the data into a single variable. Here we are interested in learning disentangled representations that encode distinct aspects of the data into separate variables. We propose to learn such representations using model architectur… ▽ More

    Submitted 13 November, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

    Comments: Accepted for publication at NIPS 2017

  26. arXiv:1611.07492  [pdf, other

    stat.ML cs.CV cs.LG

    Inducing Interpretable Representations with Variational Autoencoders

    Authors: N. Siddharth, Brooks Paige, Alban Desmaison, Jan-Willem Van de Meent, Frank Wood, Noah D. Goodman, Pushmeet Kohli, Philip H. S. Torr

    Abstract: We develop a framework for incorporating structured graphical models in the \emph{encoders} of variational autoencoders (VAEs) that allows us to induce interpretable representations through approximate variational inference. This allows us to both perform reasoning (e.g. classification) under the structural constraints of a given graphical model, and use deep generative models to deal with messy,… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  27. arXiv:1611.06863  [pdf, other

    stat.ML cs.LG

    Probabilistic structure discovery in time series data

    Authors: David Janz, Brooks Paige, Tom Rainforth, Jan-Willem van de Meent, Frank Wood

    Abstract: Existing methods for structure discovery in time series data construct interpretable, compositional kernels for Gaussian process regression models. While the learned Gaussian process model provides posterior mean and variance estimates, typically the structure is learned via a greedy optimization procedure. This restricts the space of possible solutions and leads to over-confident uncertainty esti… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

  28. arXiv:1507.04635  [pdf, other

    stat.ML cs.AI

    Black-Box Policy Search with Probabilistic Programs

    Authors: Jan-Willem van de Meent, Brooks Paige, David Tolpin, Frank Wood

    Abstract: In this work, we explore how probabilistic programs can be used to represent policies in sequential decision problems. In this formulation, a probabilistic program is a black-box stochastic simulator for both the problem domain and the agent. We relate classic policy gradient techniques to recently introduced black-box variational methods which generalize to probabilistic program inference. We pre… ▽ More

    Submitted 4 August, 2016; v1 submitted 16 July, 2015; originally announced July 2015.

    Journal ref: Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (2016) 1195-1204

  29. arXiv:1502.07314  [pdf, other

    cs.AI

    Path Finding under Uncertainty through Probabilistic Inference

    Authors: David Tolpin, Brooks Paige, Jan Willem van de Meent, Frank Wood

    Abstract: We introduce a new approach to solving path-finding problems under uncertainty by representing them as probabilistic models and applying domain-independent inference algorithms to the models. This approach separates problem representation from the inference algorithm and provides a framework for efficient learning of path-finding policies. We evaluate the new approach on the Canadian Traveler Prob… ▽ More

    Submitted 8 June, 2015; v1 submitted 25 February, 2015; originally announced February 2015.

  30. arXiv:1501.05677  [pdf, other

    cs.AI stat.ML

    Output-Sensitive Adaptive Metropolis-Hastings for Probabilistic Programs

    Authors: David Tolpin, Jan Willem van de Meent, Brooks Paige, Frank Wood

    Abstract: We introduce an adaptive output-sensitive Metropolis-Hastings algorithm for probabilistic models expressed as programs, Adaptive Lightweight Metropolis-Hastings (AdLMH). The algorithm extends Lightweight Metropolis-Hastings (LMH) by adjusting the probabilities of proposing random variables for modification to improve convergence of the program output. We show that AdLMH converges to the correct eq… ▽ More

    Submitted 5 May, 2015; v1 submitted 22 January, 2015; originally announced January 2015.

  31. arXiv:1403.0504  [pdf, other

    cs.AI cs.PL stat.ML

    A Compilation Target for Probabilistic Programming Languages

    Authors: Brooks Paige, Frank Wood

    Abstract: Forward inference techniques such as sequential Monte Carlo and particle Markov chain Monte Carlo for probabilistic programming can be implemented in any programming language by creative use of standardized operating system functionality including processes, forking, mutexes, and shared memory. Exploiting this we have defined, developed, and tested a probabilistic programming language intermediate… ▽ More

    Submitted 10 July, 2014; v1 submitted 3 March, 2014; originally announced March 2014.

    Comments: In Proceedings of the 31st International Conference on Machine Learning (ICML), 2014

    Journal ref: JMLR W&CP 32 (1) : 1935-1943, 2014