Multigrid Neural Memory

Huynh, Tri; Maire, Michael; Walter, Matthew R.

Computer Science > Machine Learning

arXiv:1906.05948 (cs)

[Submitted on 13 Jun 2019 (v1), last revised 15 Aug 2020 (this version, v4)]

Title:Multigrid Neural Memory

Authors:Tri Huynh, Michael Maire, Matthew R. Walter

View PDF

Abstract:We introduce a novel approach to endowing neural networks with emergent, long-term, large-scale memory. Distinct from strategies that connect neural networks to external memory banks via intricately crafted controllers and hand-designed attentional mechanisms, our memory is internal, distributed, co-located alongside computation, and implicitly addressed, while being drastically simpler than prior efforts. Architecting networks with multigrid structure and connectivity, while distributing memory cells alongside computation throughout this topology, we observe the emergence of coherent memory subsystems. Our hierarchical spatial organization, parameterized convolutionally, permits efficient instantiation of large-capacity memories, while multigrid topology provides short internal routing pathways, allowing convolutional networks to efficiently approximate the behavior of fully connected networks. Such networks have an implicit capacity for internal attention; augmented with memory, they learn to read and write specific memory locations in a dynamic data-dependent manner. We demonstrate these capabilities on exploration and map** tasks, where our network is able to self-organize and retain long-term memory for trajectories of thousands of time steps. On tasks decoupled from any notion of spatial geometry: sorting, associative recall, and question answering, our design functions as a truly generic memory and yields excellent results.

Comments:	ICML 2020; Project Website: this http URL
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1906.05948 [cs.LG]
	(or arXiv:1906.05948v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.05948

Submission history

From: Tri Huynh [view email]
[v1] Thu, 13 Jun 2019 22:10:01 UTC (720 KB)
[v2] Wed, 25 Sep 2019 21:18:45 UTC (816 KB)
[v3] Thu, 27 Feb 2020 22:26:05 UTC (1,119 KB)
[v4] Sat, 15 Aug 2020 21:12:15 UTC (1,133 KB)

Computer Science > Machine Learning

Title:Multigrid Neural Memory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multigrid Neural Memory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators