Skip to main content

Showing 1–21 of 21 results for author: Ruiz, F J R

.
  1. arXiv:2406.04824  [pdf, other

    cs.LG stat.ML

    FunBO: Discovering Acquisition Functions for Bayesian Optimization with FunSearch

    Authors: Virginia Aglietti, Ira Ktena, Jessica Schrouff, Eleni Sgouritsa, Francisco J. R. Ruiz, Alan Malek, Alexis Bellot, Silvia Chiappa

    Abstract: The sample efficiency of Bayesian optimization algorithms depends on carefully crafted acquisition functions (AFs) guiding the sequential collection of function evaluations. The best-performing AF can vary significantly across optimization problems, often requiring ad-hoc and problem-specific choices. This work tackles the challenge of designing novel AFs that perform well across a variety of expe… ▽ More

    Submitted 1 July, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2402.14396  [pdf, other

    quant-ph cs.LG

    Quantum Circuit Optimization with AlphaTensor

    Authors: Francisco J. R. Ruiz, Tuomas Laakkonen, Johannes Bausch, Matej Balog, Mohammadamin Barekatain, Francisco J. H. Heras, Alexander Novikov, Nathan Fitzpatrick, Bernardino Romera-Paredes, John van de Wetering, Alhussein Fawzi, Konstantinos Meichanetzidis, Pushmeet Kohli

    Abstract: A key challenge in realizing fault-tolerant quantum computers is circuit optimization. Focusing on the most expensive gates in fault-tolerant quantum computation (namely, the T gates), we address the problem of T-count optimization, i.e., minimizing the number of T gates that are needed to implement a given circuit. To achieve this, we develop AlphaTensor-Quantum, a method based on deep reinforcem… ▽ More

    Submitted 5 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 25 pages main paper + 19 pages appendix

  3. arXiv:2106.06189  [pdf, other

    stat.ML cs.LG cs.SI

    Order Matters: Probabilistic Modeling of Node Sequence for Graph Generation

    Authors: Xiaohui Chen, Xu Han, Jia**g Hu, Francisco J. R. Ruiz, Li** Liu

    Abstract: A graph generative model defines a distribution over graphs. One type of generative model is constructed by autoregressive neural networks, which sequentially add nodes and edges to generate a graph. However, the likelihood of a graph under the autoregressive model is intractable, as there are numerous sequences leading to the given graph; this makes maximum likelihood estimation challenging. Inst… ▽ More

    Submitted 14 June, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  4. arXiv:2010.10436  [pdf, other

    stat.ML cs.LG math.ST

    VarGrad: A Low-Variance Gradient Estimator for Variational Inference

    Authors: Lorenz Richter, Ayman Boustati, Nikolas Nüsken, Francisco J. R. Ruiz, Ömer Deniz Akyildiz

    Abstract: We analyse the properties of an unbiased gradient estimator of the ELBO for variational inference, based on the score function method with leave-one-out control variates. We show that this gradient estimator can be obtained using a new loss, defined as the variance of the log-ratio between the exact posterior and the variational approximation, which we call the $\textit{log-variance loss}$. Under… ▽ More

    Submitted 29 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

  5. arXiv:2010.01845  [pdf, other

    cs.LG stat.ML

    Unbiased Gradient Estimation for Variational Auto-Encoders using Coupled Markov Chains

    Authors: Francisco J. R. Ruiz, Michalis K. Titsias, Taylan Cemgil, Arnaud Doucet

    Abstract: The variational auto-encoder (VAE) is a deep latent variable model that has two neural networks in an autoencoder-like architecture; one of them parameterizes the model's likelihood. Fitting its parameters via maximum likelihood (ML) is challenging since the computation of the marginal likelihood involves an intractable integral over the latent space; thus the VAE is trained instead by maximizing… ▽ More

    Submitted 2 June, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Journal ref: Conference on Uncertainty in Artificial Intelligence (UAI, 2021)

  6. arXiv:2009.03228  [pdf, other

    cs.LG cs.AI stat.ML

    Information Theoretic Meta Learning with Gaussian Processes

    Authors: Michalis K. Titsias, Francisco J. R. Ruiz, Sotirios Nikoloutsopoulos, Alexandre Galashov

    Abstract: We formulate meta learning using information theoretic concepts; namely, mutual information and the information bottleneck. The idea is to learn a stochastic representation or encoding of the task description, given by a training set, that is highly informative about predicting the validation set. By making use of variational approximations to the mutual information, we derive a general and tracta… ▽ More

    Submitted 5 July, 2021; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: 15 pages, 2 figures

  7. arXiv:1910.04302  [pdf, other

    stat.ML cs.LG stat.ME

    Prescribed Generative Adversarial Networks

    Authors: Adji B. Dieng, Francisco J. R. Ruiz, David M. Blei, Michalis K. Titsias

    Abstract: Generative adversarial networks (GANs) are a powerful approach to unsupervised learning. They have achieved state-of-the-art performance in the image domain. However, GANs are limited in two ways. They often learn distributions with low support---a phenomenon known as mode collapse---and they do not guarantee the existence of a probability density, which makes evaluating generalization using predi… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: Code for this paper can be found at https://github.com/adjidieng/PresGANs

  8. arXiv:1907.05545  [pdf, other

    cs.CL stat.ML

    The Dynamic Embedded Topic Model

    Authors: Adji B. Dieng, Francisco J. R. Ruiz, David M. Blei

    Abstract: Topic modeling analyzes documents to learn meaningful patterns of words. For documents collected in sequence, dynamic topic models capture how these patterns vary over time. We develop the dynamic embedded topic model (D-ETM), a generative model of documents that combines dynamic latent Dirichlet allocation (D-LDA) and word embeddings. The D-ETM models each word with a categorical distribution par… ▽ More

    Submitted 10 October, 2019; v1 submitted 11 July, 2019; originally announced July 2019.

  9. arXiv:1907.04907  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    Topic Modeling in Embedding Spaces

    Authors: Adji B. Dieng, Francisco J. R. Ruiz, David M. Blei

    Abstract: Topic modeling analyzes documents to learn meaningful patterns of words. However, existing topic models fail to learn interpretable topics when working with large and heavy-tailed vocabularies. To this end, we develop the Embedded Topic Model (ETM), a generative model of documents that marries traditional topic models with word embeddings. In particular, it models each word with a categorical dist… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

    Comments: Code can be found at https://github.com/adjidieng/ETM

  10. arXiv:1905.04062  [pdf, other

    stat.ML cs.LG

    A Contrastive Divergence for Combining Variational Inference and MCMC

    Authors: Francisco J. R. Ruiz, Michalis K. Titsias

    Abstract: We develop a method to combine Markov chain Monte Carlo (MCMC) and variational inference (VI), leveraging the advantages of both inference approaches. Specifically, we improve the variational distribution by running a few MCMC steps. To make inference tractable, we introduce the variational contrastive divergence (VCD), a new divergence that replaces the standard Kullback-Leibler (KL) divergence u… ▽ More

    Submitted 28 May, 2019; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: International Conference on Machine Learning (ICML 2019). 12 pages, 3 figures

  11. Poisson Multi-Bernoulli Map** Using Gibbs Sampling

    Authors: Maryam Fatemi, Karl Granström, Lennart Svensson, Francisco J. R. Ruiz, Lars Hammarstrand

    Abstract: This paper addresses the map** problem. Using a conjugate prior form, we derive the exact theoretical batch multi-object posterior density of the map given a set of measurements. The landmarks in the map are modeled as extended objects, and the measurements are described as a Poisson process, conditioned on the map. We use a Poisson process prior on the map and prove that the posterior distribut… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

    Comments: 14 pages, 6 figures

    Journal ref: IEEE Transactions on Signal Processing, Vol. 65, Issue 11, June 2017

  12. arXiv:1810.09261  [pdf, other

    eess.SP cs.IT cs.LG stat.ML

    Infinite Factorial Finite State Machine for Blind Multiuser Channel Estimation

    Authors: Francisco J. R. Ruiz, Isabel Valera, Lennart Svensson, Fernando Perez-Cruz

    Abstract: New communication standards need to deal with machine-to-machine communications, in which users may start or stop transmitting at any time in an asynchronous manner. Thus, the number of users is an unknown and time-varying parameter that needs to be accurately estimated in order to properly recover the symbols transmitted by all users in the system. In this paper, we address the problem of joint c… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

    Comments: 15 pages, 15 figures

    Journal ref: IEEE Transactions on Cognitive Communications and Networking, June 2018, Vol 2, Issue 2, pages 177-191

  13. arXiv:1808.02078  [pdf, other

    stat.ML cs.LG

    Unbiased Implicit Variational Inference

    Authors: Michalis K. Titsias, Francisco J. R. Ruiz

    Abstract: We develop unbiased implicit variational inference (UIVI), a method that expands the applicability of variational inference by defining an expressive variational family. UIVI considers an implicit variational distribution obtained in a hierarchical manner using a simple reparameterizable distribution whose variational parameters are defined by arbitrarily flexible deep neural networks. Unlike prev… ▽ More

    Submitted 6 February, 2019; v1 submitted 6 August, 2018; originally announced August 2018.

    Comments: 9 pages, 3 figures

    Journal ref: Artificial Intelligence and Statistics (AISTATS 2019)

  14. arXiv:1802.04220  [pdf, other

    stat.ML cs.LG

    Augment and Reduce: Stochastic Inference for Large Categorical Distributions

    Authors: Francisco J. R. Ruiz, Michalis K. Titsias, Adji B. Dieng, David M. Blei

    Abstract: Categorical distributions are ubiquitous in machine learning, e.g., in classification, language models, and recommendation systems. However, when the number of possible outcomes is very large, using categorical distributions becomes computationally expensive, as the complexity scales linearly with the number of outcomes. To address this problem, we propose augment and reduce (A&R), a method to all… ▽ More

    Submitted 7 June, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: 11 pages, 2 figures

    Journal ref: Francisco J. R. Ruiz, Michalis K. Titsias, Adji B. Dieng, and David M. Blei. Augment and Reduce: Stochastic Inference for Large Categorical Distributions. International Conference on Machine Learning. Stockholm (Sweden), July 2018

  15. arXiv:1711.03560  [pdf, other

    stat.ML cs.LG econ.EM

    SHOPPER: A Probabilistic Model of Consumer Choice with Substitutes and Complements

    Authors: Francisco J. R. Ruiz, Susan Athey, David M. Blei

    Abstract: We develop SHOPPER, a sequential probabilistic model of shop** data. SHOPPER uses interpretable components to model the forces that drive how a customer chooses products; in particular, we designed SHOPPER to capture how items interact with other items. We develop an efficient posterior inference algorithm to estimate these forces from large-scale data, and we analyze a large dataset from a majo… ▽ More

    Submitted 9 June, 2019; v1 submitted 9 November, 2017; originally announced November 2017.

    Comments: Published at Annals of Applied Statistics. 27 pages, 4 figures

  16. arXiv:1610.09037  [pdf, other

    stat.ME

    Model Criticism for Bayesian Causal Inference

    Authors: Dustin Tran, Francisco J. R. Ruiz, Susan Athey, David M. Blei

    Abstract: The goal of causal inference is to understand the outcome of alternative courses of action. However, all causal inference requires assumptions. Such assumptions can be more influential than in typical tasks for probabilistic modeling, and testing those assumptions is important to assess the validity of causal inference. We develop model criticism for Bayesian causal inference, building on the idea… ▽ More

    Submitted 27 October, 2016; originally announced October 2016.

  17. arXiv:1610.05683  [pdf, other

    stat.ML stat.ME

    Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms

    Authors: Christian A. Naesseth, Francisco J. R. Ruiz, Scott W. Linderman, David M. Blei

    Abstract: Variational inference using the reparameterization trick has enabled large-scale approximate Bayesian inference in complex probabilistic models, leveraging stochastic optimization to sidestep intractable expectations. The reparameterization trick is applicable when we can simulate a random variable by applying a differentiable deterministic function on an auxiliary random variable whose distributi… ▽ More

    Submitted 12 February, 2020; v1 submitted 18 October, 2016; originally announced October 2016.

    Comments: An error in the von Mises distribution reparameterization in Table 2 has been corrected

  18. arXiv:1610.02287  [pdf, other

    stat.ML

    The Generalized Reparameterization Gradient

    Authors: Francisco J. R. Ruiz, Michalis K. Titsias, David M. Blei

    Abstract: The reparameterization gradient has become a widely used method to obtain Monte Carlo gradients to optimize the variational objective. However, this technique does not easily apply to commonly used distributions such as beta or gamma without further approximations, and most practical applications of the reparameterization gradient fit Gaussian distributions. In this paper, we introduce the general… ▽ More

    Submitted 19 October, 2016; v1 submitted 7 October, 2016; originally announced October 2016.

    Comments: 16 pages, 15 figures, NIPS version

  19. arXiv:1608.00778  [pdf, other

    stat.ML cs.LG

    Exponential Family Embeddings

    Authors: Maja R. Rudolph, Francisco J. R. Ruiz, Stephan Mandt, David M. Blei

    Abstract: Word embeddings are a powerful approach for capturing semantic similarity among terms in a vocabulary. In this paper, we develop exponential family embeddings, a class of methods that extends the idea of word embeddings to other types of high-dimensional data. As examples, we studied neural data with real-valued observations, count data from a market basket analysis, and ratings data from a movie… ▽ More

    Submitted 21 November, 2016; v1 submitted 2 August, 2016; originally announced August 2016.

  20. arXiv:1603.01140  [pdf, other

    stat.ML

    Overdispersed Black-Box Variational Inference

    Authors: Francisco J. R. Ruiz, Michalis K. Titsias, David M. Blei

    Abstract: We introduce overdispersed black-box variational inference, a method to reduce the variance of the Monte Carlo estimator of the gradient in black-box variational inference. Instead of taking samples from the variational distribution, we use importance sampling to take samples from an overdispersed distribution in the same exponential family as the variational approximation. Our approach is general… ▽ More

    Submitted 3 March, 2016; originally announced March 2016.

    Comments: 10 pages, 6 figures

  21. arXiv:1401.7620  [pdf, other

    stat.ML cs.LG

    Bayesian nonparametric comorbidity analysis of psychiatric disorders

    Authors: Francisco J. R. Ruiz, Isabel Valera, Carlos Blanco, Fernando Perez-Cruz

    Abstract: The analysis of comorbidity is an open and complex research field in the branch of psychiatry, where clinical experience and several studies suggest that the relation among the psychiatric disorders may have etiological and treatment implications. In this paper, we are interested in applying latent feature modeling to find the latent structure behind the psychiatric disorders that can help to exam… ▽ More

    Submitted 29 January, 2014; originally announced January 2014.

    Comments: Submitted to Journal of Machine Learning Research