Skip to main content

Showing 1–7 of 7 results for author: Vértes, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.04009  [pdf, other

    cs.LG

    Investigating the role of model-based learning in exploration and transfer

    Authors: Jacob Walker, Eszter Vértes, Yazhe Li, Gabriel Dulac-Arnold, Ankesh Anand, Théophane Weber, Jessica B. Hamrick

    Abstract: State of the art reinforcement learning has enabled training agents on tasks of ever increasing complexity. However, the current paradigm tends to favor training agents from scratch on every new task or on collections of tasks with a view towards generalizing to novel task configurations. The former suffers from poor data efficiency while the latter is difficult when test tasks are out-of-distribu… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  2. arXiv:2211.05039  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Active Acquisition for Multimodal Temporal Data: A Challenging Decision-Making Task

    Authors: Jannik Kossen, Cătălina Cangea, Eszter Vértes, Andrew Jaegle, Viorica Patraucean, Ira Ktena, Nenad Tomasev, Danielle Belgrave

    Abstract: We introduce a challenging decision-making task that we call active acquisition for multimodal temporal data (A2MT). In many real-world scenarios, input features are not readily available at test time and must instead be acquired at significant cost. With A2MT, we aim to learn agents that actively select which modalities of an input to acquire, trading off acquisition cost and predictive performan… ▽ More

    Submitted 3 July, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: Published in Transactions on Machine Learning Research. Previous version accepted to Foundation Models for Decision Making Workshop at NeurIPS 2022

  3. arXiv:2112.04153  [pdf, other

    cs.LG cs.AI

    Model-Value Inconsistency as a Signal for Epistemic Uncertainty

    Authors: Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram Friesen, Feryal Behbahani, Tom Schaul, André Barreto, Simon Osindero

    Abstract: Using a model of the environment and a value function, an agent can construct many estimates of a state's value, by unrolling the model for different lengths and bootstrap** with its value function. Our key insight is that one can treat this set of value estimates as a type of ensemble, which we call an \emph{implicit value ensemble} (IVE). Consequently, the discrepancy between these estimates c… ▽ More

    Submitted 29 June, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: The first three authors contributed equally. Accepted at ICML 2022

  4. arXiv:2111.01587  [pdf, other

    cs.LG cs.AI

    Procedural Generalization by Planning with Self-Supervised World Models

    Authors: Ankesh Anand, Jacob Walker, Yazhe Li, Eszter Vértes, Julian Schrittwieser, Sherjil Ozair, Théophane Weber, Jessica B. Hamrick

    Abstract: One of the key promises of model-based reinforcement learning is the ability to generalize using an internal model of the world to make predictions in novel environments and tasks. However, the generalization ability of model-based agents is not well understood because existing work has focused on model-free agents when benchmarking generalization. Here, we explicitly measure the generalization ab… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  5. arXiv:1906.09480  [pdf, other

    stat.ML cs.LG cs.NE q-bio.NC

    A neurally plausible model learns successor representations in partially observable environments

    Authors: Eszter Vertes, Maneesh Sahani

    Abstract: Animals need to devise strategies to maximize returns while interacting with their environment based on incoming noisy sensory observations. Task-relevant states, such as the agent's location within an environment or the presence of a predator, are often not directly observable but must be inferred using available sensory information. Successor representations (SR) have been proposed as a middle-g… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

  6. arXiv:1805.11051  [pdf, other

    stat.ML cs.LG

    Flexible and accurate inference and learning for deep generative models

    Authors: Eszter Vertes, Maneesh Sahani

    Abstract: We introduce a new approach to learning in hierarchical latent-variable generative models called the "distributed distributional code Helmholtz machine", which emphasises flexibility and accuracy in the inferential process. In common with the original Helmholtz machine and later variational autoencoder algorithms (but unlike adverserial methods) our approach learns an explicit inference or "recogn… ▽ More

    Submitted 28 May, 2018; originally announced May 2018.

  7. arXiv:1701.00568  [pdf, other

    physics.soc-ph cs.SI q-bio.QM

    Versatility of nodal affiliation to communities

    Authors: Maxwell Shinn, Rafael Romero-Garcia, Jakob Seidlitz, František Váša, Petra E. Vértes, Edward Bullmore

    Abstract: Graph theoretical analysis of the community structure of networks attempts to identify the communities (or modules) to which each node affiliates. However, this is in most cases an ill-posed problem, as the affiliation of a node to a single community is often ambiguous. Previous solutions have attempted to identify all of the communities to which each node affiliates. Instead of taking this approa… ▽ More

    Submitted 2 January, 2017; originally announced January 2017.

    Journal ref: Scientific Reports 7, Article number: 4273 (2017)