Skip to main content

Showing 1–27 of 27 results for author: Lazaro-Gredilla, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19635  [pdf, other

    cs.LG cs.CV

    Model Predictive Simulation Using Structured Graphical Models and Transformers

    Authors: Xinghua Lou, Meet Dave, Shrinu Kushagra, Miguel Lazaro-Gredilla, Kevin Murphy

    Abstract: We propose an approach to simulating trajectories of multiple interacting agents (road users) based on transformers and probabilistic graphical models (PGMs), and apply it to the Waymo SimAgents challenge. The transformer baseline is based on the MTR model, which predicts multiple future trajectories conditioned on the past trajectories and static road layout features. We then improve upon these g… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Special Mention at the Waymo Sim Agents Challenge 2024

  2. arXiv:2406.17863  [pdf, other

    cs.AI stat.ML

    What type of inference is planning?

    Authors: Miguel Lázaro-Gredilla, Li Yang Ku, Kevin P. Murphy, Dileep George

    Abstract: Multiple types of inference are available for probabilistic graphical models, e.g., marginal, maximum-a-posteriori, and even marginal maximum-a-posteriori. Which one do researchers mean when they talk about "planning as inference"? There is no consistency in the literature, different types are used, and their ability to do planning is further entangled with specific approximations or additional co… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2401.05946  [pdf, other

    cs.LG cs.AI

    Learning Cognitive Maps from Transformer Representations for Efficient Planning in Partially Observed Environments

    Authors: Antoine Dedieu, Wolfgang Lehrach, Guangyao Zhou, Dileep George, Miguel Lázaro-Gredilla

    Abstract: Despite their stellar performance on a wide range of tasks, including in-context tasks only revealed during inference, vanilla transformers and variants trained for next-token predictions (a) do not learn an explicit world model of their environment which can be flexibly queried and (b) cannot be used for planning or navigation. In this paper, we consider partially observed environments (POEs), wh… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  4. arXiv:2307.01201  [pdf, other

    cs.CL cs.AI

    Schema-learning and rebinding as mechanisms of in-context learning and emergence

    Authors: Sivaramakrishnan Swaminathan, Antoine Dedieu, Rajkumar Vasudeva Raju, Murray Shanahan, Miguel Lazaro-Gredilla, Dileep George

    Abstract: In-context learning (ICL) is one of the most powerful and most unexpected capabilities to emerge in recent transformer-based large language models (LLMs). Yet the mechanisms that underlie it are poorly understood. In this paper, we demonstrate that comparable ICL capabilities can be acquired by an alternative sequence prediction learning method using clone-structured causal graphs (CSCGs). Moreove… ▽ More

    Submitted 15 June, 2023; originally announced July 2023.

  5. arXiv:2303.07397  [pdf, other

    cs.LG cs.AI

    Fast exploration and learning of latent graphs with aliased observations

    Authors: Miguel Lazaro-Gredilla, Ishan Deshpande, Sivaramakrishnan Swaminathan, Meet Dave, Dileep George

    Abstract: We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which node it is located. The agent needs to uncover the hidden topology as accurately as possible and in as… ▽ More

    Submitted 25 September, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  6. arXiv:2302.07350  [pdf, other

    cs.AI cs.LG q-bio.NC

    Graph schemas as abstractions for transfer learning, inference, and planning

    Authors: J. Swaroop Guntupalli, Rajkumar Vasudeva Raju, Shrinu Kushagra, Carter Wendelken, Danny Sawyer, Ishan Deshpande, Guangyao Zhou, Miguel Lázaro-Gredilla, Dileep George

    Abstract: Transferring latent structure from one environment or problem to another is a mechanism by which humans and animals generalize with very little data. Inspired by cognitive and neurobiological insights, we propose graph schemas as a mechanism of abstraction for transfer learning. Graph schemas start with latent graph learning where perceptually aliased observations are disambiguated in the latent s… ▽ More

    Submitted 12 December, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 14 pages, 4 figures in main paper, 13 pages and 8 figures in appendix

  7. arXiv:2302.03744  [pdf, other

    cs.CV

    3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation

    Authors: Guangyao Zhou, Nishad Gothoskar, Lirui Wang, Joshua B. Tenenbaum, Dan Gutfreund, Miguel Lázaro-Gredilla, Dileep George, Vikash K. Mansinghka

    Abstract: The ability to perceive and understand 3D scenes is crucial for many applications in computer vision and robotics. Inverse graphics is an appealing approach to 3D scene understanding that aims to infer the 3D scene structure from 2D images. In this paper, we introduce probabilistic modeling to the inverse graphics framework to quantify uncertainty and achieve robustness in 6D pose estimation tasks… ▽ More

    Submitted 6 September, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: ICCV 2023 camera ready

  8. arXiv:2302.00099  [pdf, other

    cs.LG

    Learning noisy-OR Bayesian Networks with Max-Product Belief Propagation

    Authors: Antoine Dedieu, Guangyao Zhou, Dileep George, Miguel Lazaro-Gredilla

    Abstract: Noisy-OR Bayesian Networks (BNs) are a family of probabilistic graphical models which express rich statistical dependencies in binary data. Variational inference (VI) has been the main method proposed to learn noisy-OR BNs with complex latent structures (Jaakkola & Jordan, 1999; Ji et al., 2020; Buhai et al., 2020). However, the proposed VI approaches either (a) use a recognition network with stan… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  9. arXiv:2301.10289  [pdf, other

    cs.AI

    PushWorld: A benchmark for manipulation planning with tools and movable obstacles

    Authors: Ken Kansky, Skanda Vaidyanath, Scott Swingle, Xinghua Lou, Miguel Lazaro-Gredilla, Dileep George

    Abstract: While recent advances in artificial intelligence have achieved human-level performance in environments like Starcraft and Go, many physical reasoning tasks remain challenging for modern algorithms. To date, few algorithms have been evaluated on physical tasks that involve manipulating objects when movable obstacles are present and when tools must be used to perform the manipulation. To promote res… ▽ More

    Submitted 1 February, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

  10. arXiv:2202.04110  [pdf, other

    cs.LG cs.AI stat.ML

    PGMax: Factor Graphs for Discrete Probabilistic Graphical Models and Loopy Belief Propagation in JAX

    Authors: Guangyao Zhou, Antoine Dedieu, Nishanth Kumar, Wolfgang Lehrach, Miguel Lázaro-Gredilla, Shrinu Kushagra, Dileep George

    Abstract: PGMax is an open-source Python package for (a) easily specifying discrete Probabilistic Graphical Models (PGMs) as factor graphs; and (b) automatically running efficient and scalable loopy belief propagation (LBP) in JAX. PGMax supports general factor graphs with tractable factors, and leverages modern accelerators like GPUs for inference. Compared with existing alternatives, PGMax obtains higher-… ▽ More

    Submitted 24 March, 2023; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Update authors list

  11. arXiv:2202.03697  [pdf, other

    cs.RO

    DURableVS: Data-efficient Unsupervised Recalibrating Visual Servoing via online learning in a structured generative model

    Authors: Nishad Gothoskar, Miguel Lázaro-Gredilla, Yasemin Bekiroglu, Abhishek Agarwal, Joshua B. Tenenbaum, Vikash K. Mansinghka, Dileep George

    Abstract: Visual servoing enables robotic systems to perform accurate closed-loop control, which is required in many applications. However, existing methods either require precise calibration of the robot kinematic model and cameras or use neural architectures that require large amounts of data to train. In this work, we present a method for unsupervised learning of visual servoing that does not require any… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  12. arXiv:2112.03371  [pdf, other

    cs.LG cs.CV

    Graphical Models with Attention for Context-Specific Independence and an Application to Perceptual Grou**

    Authors: Guangyao Zhou, Wolfgang Lehrach, Antoine Dedieu, Miguel Lázaro-Gredilla, Dileep George

    Abstract: Discrete undirected graphical models, also known as Markov Random Fields (MRFs), can flexibly encode probabilistic interactions of multiple variables, and have enjoyed successful applications to a wide range of problems. However, a well-known yet little studied limitation of discrete MRFs is that they cannot capture context-specific independence (CSI). Existing methods require carefully developed… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

  13. arXiv:2111.02458  [pdf, other

    stat.ML cs.LG

    Perturb-and-max-product: Sampling and learning in discrete energy-based models

    Authors: Miguel Lazaro-Gredilla, Antoine Dedieu, Dileep George

    Abstract: Perturb-and-MAP offers an elegant approach to approximately sample from a energy-based model (EBM) by computing the maximum-a-posteriori (MAP) configuration of a perturbed version of the model. Sampling in turn enables learning. However, this line of research has been hindered by the general intractability of the MAP computation. Very few works venture outside tractable models, and when they do, t… ▽ More

    Submitted 5 November, 2021; v1 submitted 3 November, 2021; originally announced November 2021.

  14. arXiv:2012.01744  [pdf, other

    stat.ML cs.LG

    Sample-Efficient L0-L2 Constrained Structure Learning of Sparse Ising Models

    Authors: Antoine Dedieu, Miguel Lázaro-Gredilla, Dileep George

    Abstract: We consider the problem of learning the underlying graph of a sparse Ising model with $p$ nodes from $n$ i.i.d. samples. The most recent and best performing approaches combine an empirical loss (the logistic regression loss or the interaction screening loss) with a regularizer (an L1 penalty or an L1 constraint). This results in a convex problem that can be solved separately for each node of the g… ▽ More

    Submitted 15 September, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

  15. arXiv:2006.06803  [pdf, other

    stat.ML cs.LG

    Query Training: Learning a Worse Model to Infer Better Marginals in Undirected Graphical Models with Hidden Variables

    Authors: Miguel Lázaro-Gredilla, Wolfgang Lehrach, Nishad Gothoskar, Guangyao Zhou, Antoine Dedieu, Dileep George

    Abstract: Probabilistic graphical models (PGMs) provide a compact representation of knowledge that can be queried in a flexible way: after learning the parameters of a graphical model once, new probabilistic queries can be answered at test time without retraining. However, when using undirected PGMS with hidden variables, two sources of error typically compound in all but the simplest models (a) learning er… ▽ More

    Submitted 25 February, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

  16. arXiv:2006.06620  [pdf, other

    cs.RO cs.AI cs.LG

    From proprioception to long-horizon planning in novel environments: A hierarchical RL model

    Authors: Nishad Gothoskar, Miguel Lázaro-Gredilla, Dileep George

    Abstract: For an intelligent agent to flexibly and efficiently operate in complex environments, they must be able to reason at multiple levels of temporal, spatial, and conceptual abstraction. At the lower levels, the agent must interpret their proprioceptive inputs and control their muscles, and at the higher levels, the agent must select goals and plan how they will achieve those goals. It is clear that e… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  17. arXiv:2003.04474  [pdf, other

    cs.RO cs.AI

    Learning a generative model for robot control using visual feedback

    Authors: Nishad Gothoskar, Miguel Lázaro-Gredilla, Abhishek Agarwal, Yasemin Bekiroglu, Dileep George

    Abstract: We introduce a novel formulation for incorporating visual feedback in controlling robots. We define a generative model from actions to image observations of features on the end-effector. Inference in the model allows us to infer the robot state corresponding to target locations of the features. This, in turn, guides motion of the robot and allows for matching the target locations of the features i… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  18. arXiv:2002.04021  [pdf, other

    cs.AI cs.RO

    A Model of Fast Concept Inference with Object-Factorized Cognitive Programs

    Authors: Daniel P. Sawyer, Miguel Lázaro-Gredilla, Dileep George

    Abstract: The ability of humans to quickly identify general concepts from a handful of images has proven difficult to emulate with robots. Recently, a computer architecture was developed that allows robots to mimic some aspects of this human ability by modeling concepts as cognitive programs using an instruction set of primitive cognitive functions. This allowed a robot to emulate human imagination by simul… ▽ More

    Submitted 18 June, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: 7 pages, 4 figures, 5 tables, to be presented at CogSci 2020

  19. arXiv:1912.02893  [pdf, other

    cs.LG stat.ML

    Learning undirected models via query training

    Authors: Miguel Lazaro-Gredilla, Wolfgang Lehrach, Dileep George

    Abstract: Typical amortized inference in variational autoencoders is specialized for a single probabilistic query. Here we propose an inference network architecture that generalizes to unseen probabilistic queries. Instead of an encoder-decoder pair, we can train a single inference network directly from data, using a cost function that is stochastic not only over samples, but also over queries. We can use t… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  20. arXiv:1905.00507  [pdf, other

    stat.ML cs.LG

    Learning higher-order sequential structure with cloned HMMs

    Authors: Antoine Dedieu, Nishad Gothoskar, Scott Swingle, Wolfgang Lehrach, Miguel Lázaro-Gredilla, Dileep George

    Abstract: Variable order sequence modeling is an important problem in artificial and natural intelligence. While overcomplete Hidden Markov Models (HMMs), in theory, have the capacity to represent long-term temporal structure, they often fail to learn and converge to local minima. We show that by constraining HMMs with a simple sparsity structure inspired by biology, we can make it learn variable order sequ… ▽ More

    Submitted 15 May, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

  21. arXiv:1812.02788  [pdf, other

    cs.AI cs.LG

    Beyond imitation: Zero-shot task transfer on robots by learning concepts as cognitive programs

    Authors: Miguel Lázaro-Gredilla, Dianhuan Lin, J. Swaroop Guntupalli, Dileep George

    Abstract: Humans can infer concepts from image pairs and apply those in the physical world in a completely different setting, enabling tasks like IKEA assembly from diagrams. If robots could represent and infer high-level concepts, it would significantly improve their ability to understand our intent and to transfer tasks between different environments. To that end, we introduce a computational framework th… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

  22. arXiv:1808.01058  [pdf

    q-bio.NC cs.CV

    Cortical Microcircuits from a Generative Vision Model

    Authors: Dileep George, Alexander Lavin, J. Swaroop Guntupalli, David Mely, Nick Hay, Miguel Lazaro-Gredilla

    Abstract: Understanding the information processing roles of cortical circuits is an outstanding problem in neuroscience and artificial intelligence. The theoretical setting of Bayesian inference has been suggested as a framework for understanding cortical computation. Based on a recently published generative model for visual inference (George et al., 2017), we derive a family of anatomically instantiated an… ▽ More

    Submitted 2 August, 2018; originally announced August 2018.

  23. arXiv:1804.01712  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Variational Rejection Sampling

    Authors: Aditya Grover, Ramki Gummadi, Miguel Lazaro-Gredilla, Dale Schuurmans, Stefano Ermon

    Abstract: Learning latent variable models with stochastic variational inference is challenging when the approximate posterior is far from the true posterior, due to high variance in the gradient estimates. We propose a novel rejection sampling step that discards samples from the variational posterior which are assigned low likelihoods by the model. Our approach provides an arbitrarily accurate approximation… ▽ More

    Submitted 5 April, 2018; originally announced April 2018.

    Comments: AISTATS 2018

  24. arXiv:1706.04317  [pdf, other

    cs.AI

    Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics

    Authors: Ken Kansky, Tom Silver, David A. Mély, Mohamed Eldawy, Miguel Lázaro-Gredilla, Xinghua Lou, Nimrod Dorfman, Szymon Sidor, Scott Phoenix, Dileep George

    Abstract: The recent adaptation of deep neural network-based methods to reinforcement learning and planning domains has yielded remarkable progress on individual tasks. Nonetheless, progress on task-to-task transfer remains limited. In pursuit of efficient and robust generalization, we introduce the Schema Network, an object-oriented generative physics simulator capable of disentangling multiple causes of e… ▽ More

    Submitted 17 August, 2017; v1 submitted 14 June, 2017; originally announced June 2017.

  25. arXiv:1611.02252  [pdf, other

    cs.LG cs.AI stat.ML

    Hierarchical compositional feature learning

    Authors: Miguel Lázaro-Gredilla, Yi Liu, D. Scott Phoenix, Dileep George

    Abstract: We introduce the hierarchical compositional network (HCN), a directed generative model able to discover and disentangle, without supervision, the building blocks of a set of binary images. The building blocks are binary features defined hierarchically as a composition of some of the features in the layer immediately below, arranged in a particular manner. At a high level, HCN is similar to a sigmo… ▽ More

    Submitted 25 October, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: Removed the "under review" header from every page, no changes to content

  26. arXiv:1303.2823  [pdf, other

    cs.LG cs.IT stat.ML

    Gaussian Processes for Nonlinear Signal Processing

    Authors: Fernando Pérez-Cruz, Steven Van Vaerenbergh, Juan José Murillo-Fuentes, Miguel Lázaro-Gredilla, Ignacio Santamaria

    Abstract: Gaussian processes (GPs) are versatile tools that have been successfully employed to solve nonlinear estimation problems in machine learning, but that are rarely used in signal processing. In this tutorial, we present GPs for regression as a natural nonlinear extension to optimal Wiener filtering. After establishing their basic formulation, we discuss several important aspects and extensions, incl… ▽ More

    Submitted 27 September, 2013; v1 submitted 12 March, 2013; originally announced March 2013.

    Journal ref: IEEE Signal Processing Magazine, vol.30, no.4, pp.40-50, July 2013

  27. arXiv:1108.3372  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Overlap** Mixtures of Gaussian Processes for the Data Association Problem

    Authors: Miguel Lázaro-Gredilla, Steven Van Vaerenbergh, Neil Lawrence

    Abstract: In this work we introduce a mixture of GPs to address the data association problem, i.e. to label a group of observations according to the sources that generated them. Unlike several previously proposed GP mixtures, the novel mixture has the distinct characteristic of using no gating function to determine the association of samples and mixture components. Instead, all the GPs in the mixture are gl… ▽ More

    Submitted 16 August, 2011; originally announced August 2011.