Skip to main content

Showing 1–33 of 33 results for author: Barrett, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16424  [pdf, other

    cs.AI cs.LG

    Memory-Enhanced Neural Solvers for Efficient Adaptation in Combinatorial Optimization

    Authors: Felix Chalumeau, Refiloe Shabe, Noah de Nicola, Arnu Pretorius, Thomas D. Barrett, Nathan Grinsztajn

    Abstract: Combinatorial Optimization is crucial to numerous real-world applications, yet still presents challenges due to its (NP-)hard nature. Amongst existing approaches, heuristics often offer the best trade-off between quality and scalability, making them suitable for industrial use. While Reinforcement Learning (RL) offers a flexible framework for designing heuristics, its adoption over handcrafted heu… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2405.15840  [pdf, other

    q-bio.QM cs.LG

    Learning the Language of Protein Structure

    Authors: Benoit Gaujac, Jérémie Donà, Liviu Copoiu, Timothy Atkinson, Thomas Pierrot, Thomas D. Barrett

    Abstract: Representation learning and \emph{de novo} generation of proteins are pivotal computational biology tasks. Whilst natural language processing (NLP) techniques have proven highly effective for protein sequence modelling, structure modelling presents a complex challenge, primarily due to its continuous and three-dimensional nature. Motivated by this discrepancy, we introduce an approach using a vect… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2405.03162  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Advancing Multimodal Medical Capabilities of Gemini

    Authors: Lin Yang, Shawn Xu, Andrew Sellergren, Timo Kohlberger, Yuchen Zhou, Ira Ktena, Atilla Kiraly, Faruk Ahmed, Farhad Hormozdiari, Tiam Jaroensri, Eric Wang, Ellery Wulczyn, Fayaz Jamil, Theo Guidroz, Chuck Lau, Siyuan Qiao, Yun Liu, Akshay Goel, Kendall Park, Arnav Agharwal, Nick George, Yang Wang, Ryutaro Tanno, David G. T. Barrett, Wei-Hung Weng , et al. (22 additional authors not shown)

    Abstract: Many clinical tasks require an understanding of specialized data, such as medical images and genomics, which is not typically found in general-purpose large multimodal models. Building upon Gemini's multimodal models, we develop several models within the new Med-Gemini family that inherit core capabilities of Gemini and are optimized for medical use via fine-tuning with 2D and 3D radiology, histop… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  4. arXiv:2404.18416  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Capabilities of Gemini Models in Medicine

    Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G. T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby , et al. (42 additional authors not shown)

    Abstract: Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  5. arXiv:2311.18260  [pdf, other

    eess.IV cs.CL cs.CV cs.LG

    Consensus, dissensus and synergy between clinicians and specialist foundation models in radiology report generation

    Authors: Ryutaro Tanno, David G. T. Barrett, Andrew Sellergren, Sumedh Ghaisas, Sumanth Dathathri, Abigail See, Johannes Welbl, Karan Singhal, Shekoofeh Azizi, Tao Tu, Mike Schaekermann, Rhys May, Roy Lee, SiWai Man, Zahra Ahmed, Sara Mahdavi, Yossi Matias, Joelle Barral, Ali Eslami, Danielle Belgrave, Vivek Natarajan, Shravya Shetty, Pushmeet Kohli, Po-Sen Huang, Alan Karthikesalingam , et al. (1 additional authors not shown)

    Abstract: Radiology reports are an instrumental part of modern medicine, informing key clinical decisions such as diagnosis and treatment. The worldwide shortage of radiologists, however, restricts access to expert care and imposes heavy workloads, contributing to avoidable errors and delays in report delivery. While recent progress in automated report generation with vision-language models offer clear pote… ▽ More

    Submitted 20 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

  6. arXiv:2311.17371  [pdf, other

    cs.CL cs.AI

    Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs

    Authors: Andries Smit, Paul Duckworth, Nathan Grinsztajn, Thomas D. Barrett, Arnu Pretorius

    Abstract: Recent advancements in large language models (LLMs) underscore their potential for responding to inquiries in various domains. However, ensuring that generative agents provide accurate and reliable answers remains an ongoing challenge. In this context, multi-agent debate (MAD) has emerged as a promising strategy for enhancing the truthfulness of LLMs. We benchmark a range of debating and prompting… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 2 pages, 13 figures

  7. arXiv:2311.13569  [pdf, other

    cs.LG cs.AI

    Combinatorial Optimization with Policy Adaptation using Latent Space Search

    Authors: Felix Chalumeau, Shikha Surana, Clement Bonnet, Nathan Grinsztajn, Arnu Pretorius, Alexandre Laterre, Thomas D. Barrett

    Abstract: Combinatorial Optimization underpins many real-world applications and yet, designing performant algorithms to solve these complex, typically NP-hard, problems remains a significant research challenge. Reinforcement Learning (RL) provides a versatile framework for designing heuristics across a broad spectrum of problem domains. However, despite notable progress, RL has not yet supplanted industrial… ▽ More

    Submitted 28 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Fix typo in formula and add a reference

  8. Skin Deep: Investigating Subjectivity in Skin Tone Annotations for Computer Vision Benchmark Datasets

    Authors: Teanna Barrett, Quan Ze Chen, Amy X. Zhang

    Abstract: To investigate the well-observed racial disparities in computer vision systems that analyze images of humans, researchers have turned to skin tone as more objective annotation than race metadata for fairness performance evaluations. However, the current state of skin tone annotation procedures is highly varied. For instance, researchers use a range of untested scales and skin tone categories, have… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: To appear in FAcct '23

  9. arXiv:2211.02799  [pdf

    cs.CV cs.AI

    Evaluating Novel Mask-RCNN Architectures for Ear Mask Segmentation

    Authors: Saurav K. Aryal, Teanna Barrett, Gloria Washington

    Abstract: The human ear is generally universal, collectible, distinct, and permanent. Ear-based biometric recognition is a niche and recent approach that is being explored. For any ear-based biometric algorithm to perform well, ear detection and segmentation need to be accurately performed. While significant work has been done in existing literature for bounding boxes, a lack of approaches output a segmenta… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: Accepted into ICCBS 2022

  10. arXiv:2210.03475  [pdf, other

    cs.AI cs.LG

    Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization

    Authors: Nathan Grinsztajn, Daniel Furelos-Blanco, Shikha Surana, Clément Bonnet, Thomas D. Barrett

    Abstract: Applying reinforcement learning (RL) to combinatorial optimization problems is attractive as it removes the need for expert knowledge or pre-solved instances. However, it is unrealistic to expect an agent to solve these (often NP-)hard problems in a single shot at inference due to their inherent complexity. Thus, leading approaches often implement additional search strategies, from stochastic samp… ▽ More

    Submitted 13 November, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

  11. arXiv:2209.13083  [pdf, other

    cs.LG stat.ML

    Why neural networks find simple solutions: the many regularizers of geometric complexity

    Authors: Benoit Dherin, Michael Munn, Mihaela Rosca, David G. T. Barrett

    Abstract: In many contexts, simpler models are preferable to more complex models and the control of this model complexity is the goal for many methods in machine learning such as regularization, hyperparameter tuning and architecture design. In deep learning, it has been difficult to understand the underlying mechanisms of complexity control, since many traditional measures are not naturally suitable for de… ▽ More

    Submitted 23 December, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: Accepted as a NeurIPS 2022 paper

  12. arXiv:2206.06758  [pdf, other

    cs.MA cs.DM cs.LG

    Universally Expressive Communication in Multi-Agent Reinforcement Learning

    Authors: Matthew Morris, Thomas D. Barrett, Arnu Pretorius

    Abstract: Allowing agents to share information through communication is crucial for solving complex tasks in multi-agent reinforcement learning. In this work, we consider the question of whether a given communication protocol can express an arbitrary policy. By observing that many existing protocols can be viewed as instances of graph neural networks (GNNs), we demonstrate the equivalence of joint action se… ▽ More

    Submitted 13 January, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Published in NeurIPS 2022

    MSC Class: 68T07; 68T42; 68R10 (Primary) 68T20; 05C15 (Secondary) ACM Class: I.2.11; I.2.6; I.2.8

  13. arXiv:2205.14345  [pdf, other

    cs.LG

    Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories

    Authors: Christopher W. F. Parsonson, Alexandre Laterre, Thomas D. Barrett

    Abstract: Combinatorial optimisation problems framed as mixed integer linear programmes (MILPs) are ubiquitous across a range of real-world applications. The canonical branch-and-bound algorithm seeks to exactly solve MILPs by constructing a search tree of increasingly constrained sub-problems. In practice, its solving time performance is dependent on heuristics, such as the choice of the next variable to c… ▽ More

    Submitted 5 December, 2022; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: Accepted to AAAI'23: Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence

    Journal ref: AAAI'23: Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

  14. arXiv:2205.14105  [pdf, other

    cs.LG cs.AI

    Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration

    Authors: Thomas D. Barrett, Christopher W. F. Parsonson, Alexandre Laterre

    Abstract: From logistics to the natural sciences, combinatorial optimisation on graphs underpins numerous real-world applications. Reinforcement learning (RL) has shown particular promise in this setting as it can adapt to specific problem structures and does not require pre-solved instances for these, often NP-hard, problems. However, state-of-the-art (SOTA) approaches typically suffer from severe scalabil… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  15. arXiv:2111.15090  [pdf, other

    cs.LG stat.ML

    The Geometric Occam's Razor Implicit in Deep Learning

    Authors: Benoit Dherin, Michael Munn, David G. T. Barrett

    Abstract: In over-parameterized deep neural networks there can be many possible parameter configurations that fit the training data exactly. However, the properties of these interpolating solutions are poorly understood. We argue that over-parameterized neural networks trained with stochastic gradient descent are subject to a Geometric Occam's Razor; that is, these networks are implicitly regularized by the… ▽ More

    Submitted 30 November, 2021; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: Accepted as a NeurIPS 2021 workshop paper (OPT2021)

  16. arXiv:2111.00206  [pdf, other

    cs.LG cs.AI

    One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning

    Authors: Clément Bonnet, Paul Caron, Thomas Barrett, Ian Davies, Alexandre Laterre

    Abstract: Self-tuning algorithms that adapt the learning process online encourage more effective and robust learning. Among all the methods available, meta-gradients have emerged as a promising approach. They leverage the differentiability of the learning rule with respect to some hyper-parameters to adapt them in an online fashion. Although meta-gradients can be accumulated over multiple learning steps to… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: 14 pages, 6 figures, 2 tables

  17. arXiv:2109.12606  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph quant-ph

    Autoregressive neural-network wavefunctions for ab initio quantum chemistry

    Authors: Thomas D. Barrett, Aleksei Malyshev, A. I. Lvovsky

    Abstract: In recent years, neural network quantum states (NNQS) have emerged as powerful tools for the study of quantum many-body systems. Electronic structure calculations are one such canonical many-body problem that have attracted significant research efforts spanning multiple decades, whilst only recently being attempted with NNQS. However, the complex non-local interactions and high sample complexity a… ▽ More

    Submitted 25 January, 2022; v1 submitted 26 September, 2021; originally announced September 2021.

    Comments: 8 pages, plus Methods and Supplementary Information

  18. arXiv:2105.13922  [pdf, other

    stat.ML cs.LG

    Discretization Drift in Two-Player Games

    Authors: Mihaela Rosca, Yan Wu, Benoit Dherin, David G. T. Barrett

    Abstract: Gradient-based methods for two-player games produce rich dynamics that can solve challenging problems, yet can be difficult to stabilize and understand. Part of this complexity originates from the discrete update steps given by simultaneous or alternating gradient descent, which causes each player to drift away from the continuous gradient flow -- a phenomenon we call discretization drift. Using b… ▽ More

    Submitted 1 July, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

  19. arXiv:2105.08702  [pdf

    cs.SE

    Component Based Solutions Under Architecture

    Authors: T. A. Barrett, H. A. Proper

    Abstract: Many of today's applications have an, almost tangible, monolithic nature. They are built as 'islands', purporting to be self contained, offering little or nothing in the way of integration with other applications. In the past, being large and self-contained may have eliminated the need to interact with other solutions to some extent. However, in the business environments of today the interaction w… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

  20. arXiv:2101.12176  [pdf, other

    cs.LG stat.ML

    On the Origin of Implicit Regularization in Stochastic Gradient Descent

    Authors: Samuel L. Smith, Benoit Dherin, David G. T. Barrett, Soham De

    Abstract: For infinitesimal learning rates, stochastic gradient descent (SGD) follows the path of gradient flow on the full batch loss function. However moderately large learning rates can achieve higher test accuracies, and this generalization benefit is not explained by convergence bounds, since the learning rate which maximizes test accuracy is often larger than the learning rate which minimizes training… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

    Comments: Accepted as a conference paper at ICLR 2021

  21. arXiv:2009.11162  [pdf, other

    cs.LG stat.ML

    Implicit Gradient Regularization

    Authors: David G. T. Barrett, Benoit Dherin

    Abstract: Gradient descent can be surprisingly good at optimizing deep neural networks without overfitting and without explicit regularization. We find that the discrete steps of gradient descent implicitly regularize models by penalizing gradient descent trajectories that have large loss gradients. We call this Implicit Gradient Regularization (IGR) and we use backward error analysis to calculate the size… ▽ More

    Submitted 18 July, 2022; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: Correction to formula A.14 in Appendix A.1 and update to the acknowledgments

    Journal ref: Published as a conference paper at ICLR 2021

  22. arXiv:2002.06991  [pdf, other

    cs.LG stat.ML

    Learning Group Structure and Disentangled Representations of Dynamical Environments

    Authors: Robin Quessard, Thomas D. Barrett, William R. Clements

    Abstract: Learning disentangled representations is a key step towards effectively discovering and modelling the underlying structure of environments. In the natural sciences, physics has found great success by describing the universe in terms of symmetry preserving transformations. Inspired by this formalism, we propose a framework, built upon the theory of group representation, for learning representations… ▽ More

    Submitted 25 October, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: Accepted to NeurIPS 2020

  23. arXiv:1912.12256  [pdf, ps, other

    cs.ET eess.SP physics.optics

    Backpropagation through nonlinear units for all-optical training of neural networks

    Authors: Xianxin Guo, Thomas D. Barrett, Zhiming M. Wang, A. I. Lvovsky

    Abstract: Backpropagation through nonlinear neurons is an outstanding challenge to the field of optical neural networks and the major conceptual barrier to all-optical training schemes. Each neuron is required to exhibit a directionally dependent response to propagating optical signals, with the backwards response conditioned on the forward signal, which is highly non-trivial to implement optically. We prop… ▽ More

    Submitted 8 October, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

    Comments: Error fixed in Fig.1

    Journal ref: Photonics Research 9, B71-B80 (2021)

  24. arXiv:1909.04063  [pdf, other

    cs.LG cs.AI stat.ML

    Exploratory Combinatorial Optimization with Reinforcement Learning

    Authors: Thomas D. Barrett, William R. Clements, Jakob N. Foerster, A. I. Lvovsky

    Abstract: Many real-world problems can be reduced to combinatorial optimization on a graph, where the subset or ordering of vertices that maximize some objective function must be found. With such tasks often NP-hard and analytically intractable, reinforcement learning (RL) has shown promise as a framework with which efficient heuristic methods to tackle these problems can be learned. Previous works construc… ▽ More

    Submitted 31 January, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: In Proceedings of the 34th National Conference on Artificial Intelligence, AAAI 2020

    Journal ref: Proceedings of Thirty-fourth AAAI conference on artificial intelligence, 3243-3250 (2020)

  25. arXiv:1902.00120  [pdf, other

    cs.AI

    Learning to Make Analogies by Contrasting Abstract Relational Structure

    Authors: Felix Hill, Adam Santoro, David G. T. Barrett, Ari S. Morcos, Timothy Lillicrap

    Abstract: Analogical reasoning has been a principal focus of various waves of AI research. Analogy is particularly challenging for machines because it requires relational structures to be represented such that they can be flexibly applied across diverse domains of experience. Here, we study how analogical reasoning can be induced in neural networks that learn to perceive and reason about raw visual data. We… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

  26. arXiv:1810.13373  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG stat.ML

    Analyzing biological and artificial neural networks: challenges with opportunities for synergy?

    Authors: David G. T. Barrett, Ari S. Morcos, Jakob H. Macke

    Abstract: Deep neural networks (DNNs) transform stimuli across multiple processing stages to produce representations that can be used to solve complex tasks, such as object recognition in images. However, a full understanding of how they achieve this remains elusive. The complexity of biological neural networks substantially exceeds the complexity of DNNs, making it even more challenging to understand the r… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

  27. arXiv:1807.04225  [pdf, other

    cs.LG stat.ML

    Measuring abstract reasoning in neural networks

    Authors: David G. T. Barrett, Felix Hill, Adam Santoro, Ari S. Morcos, Timothy Lillicrap

    Abstract: Whether neural networks can learn abstract reasoning or whether they merely rely on superficial statistics is a topic of recent debate. Here, we propose a dataset and challenge designed to probe abstract reasoning, inspired by a well-known human IQ test. To succeed at this challenge, models must cope with various generalisation `regimes' in which the training and test data differ in clearly-define… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: ICML 2018

  28. arXiv:1806.02215  [pdf, other

    cs.LG cs.AI stat.ML

    Spectral Inference Networks: Unifying Deep and Spectral Learning

    Authors: David Pfau, Stig Petersen, Ashish Agarwal, David G. T. Barrett, Kimberly L. Stachenfeld

    Abstract: We present Spectral Inference Networks, a framework for learning eigenfunctions of linear operators by stochastic optimization. Spectral Inference Networks generalize Slow Feature Analysis to generic symmetric operators, and are closely related to Variational Monte Carlo methods from computational physics. As such, they can be a powerful tool for unsupervised representation learning from video or… ▽ More

    Submitted 16 January, 2020; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: Fixed typo in math in section 4

    Journal ref: Seventh International Conference on Learning Representations (ICLR 2019)

  29. arXiv:1804.08663  [pdf, other

    eess.AS cs.SD

    A Discriminative Acoustic-Prosodic Approach for Measuring Local Entrainment

    Authors: Megan M. Willi, Stephanie A. Borrie, Tyson S. Barrett, Ming Tu, Visar Berisha

    Abstract: Acoustic-prosodic entrainment describes the tendency of humans to align or adapt their speech acoustics to each other in conversation. This alignment of spoken behavior has important implications for conversational success. However, modeling the subtle nature of entrainment in spoken dialogue continues to pose a challenge. In this paper, we propose a straightforward definition for local entrainmen… ▽ More

    Submitted 12 July, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

  30. arXiv:1803.06959  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    On the importance of single directions for generalization

    Authors: Ari S. Morcos, David G. T. Barrett, Neil C. Rabinowitz, Matthew Botvinick

    Abstract: Despite their ability to memorize large datasets, deep neural networks often achieve good generalization performance. However, the differences between the learned solutions of networks which generalize and those which do not remain unclear. Additionally, the tuning properties of single directions (defined as the activation of a single unit or some linear combination of units in response to some in… ▽ More

    Submitted 22 May, 2018; v1 submitted 19 March, 2018; originally announced March 2018.

    Comments: ICLR 2018 conference paper; added additional methodological details

  31. arXiv:1711.08378  [pdf

    cs.AI

    Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017

    Authors: M. Botvinick, D. G. T. Barrett, P. Battaglia, N. de Freitas, D. Kumaran, J. Z Leibo, T. Lillicrap, J. Modayil, S. Mohamed, N. C. Rabinowitz, D. J. Rezende, A. Santoro, T. Schaul, C. Summerfield, G. Wayne, T. Weber, D. Wierstra, S. Legg, D. Hassabis

    Abstract: We agree with Lake and colleagues on their list of key ingredients for building humanlike intelligence, including the idea that model-based reasoning is essential. However, we favor an approach that centers on one additional ingredient: autonomy. In particular, we aim toward agents that can both build and exploit their own internal models, with minimal human hand-engineering. We believe an approac… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

  32. arXiv:1706.08606  [pdf, other

    stat.ML cs.CV cs.LG

    Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study

    Authors: Samuel Ritter, David G. T. Barrett, Adam Santoro, Matt M. Botvinick

    Abstract: Deep neural networks (DNNs) have achieved unprecedented performance on a wide range of complex tasks, rapidly outpacing our understanding of the nature of their solutions. This has caused a recent surge of interest in methods for rendering modern neural systems more interpretable. In this work, we propose to address the interpretability problem in modern DNNs using the rich history of problem desc… ▽ More

    Submitted 29 June, 2017; v1 submitted 26 June, 2017; originally announced June 2017.

    Comments: ICML 2017

  33. arXiv:1706.01427  [pdf, other

    cs.CL cs.LG

    A simple neural network module for relational reasoning

    Authors: Adam Santoro, David Raposo, David G. T. Barrett, Mateusz Malinowski, Razvan Pascanu, Peter Battaglia, Timothy Lillicrap

    Abstract: Relational reasoning is a central component of generally intelligent behavior, but has proven difficult for neural networks to learn. In this paper we describe how to use Relation Networks (RNs) as a simple plug-and-play module to solve problems that fundamentally hinge on relational reasoning. We tested RN-augmented networks on three tasks: visual question answering using a challenging dataset ca… ▽ More

    Submitted 5 June, 2017; originally announced June 2017.