Skip to main content

Showing 1–12 of 12 results for author: Bartunov, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.12795  [pdf, other

    cs.LG cs.AI stat.ML

    Equilibrium Aggregation: Encoding Sets via Optimization

    Authors: Sergey Bartunov, Fabian B. Fuchs, Timothy Lillicrap

    Abstract: Processing sets or other unordered, potentially variable-sized inputs in neural networks is usually handled by aggregating a number of input tensors into a single representation. While a number of aggregation methods already exist from simple sum pooling to multi-head attention, they are limited in their representational power both from theoretical and empirical perspectives. On the search of a pr… ▽ More

    Submitted 3 July, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: Published at UAI 2022

  2. arXiv:2107.10201  [pdf, other

    math.OC cs.LG

    Learning a Large Neighborhood Search Algorithm for Mixed Integer Programs

    Authors: Nicolas Sonnerat, Pengming Wang, Ira Ktena, Sergey Bartunov, Vinod Nair

    Abstract: Large Neighborhood Search (LNS) is a combinatorial optimization heuristic that starts with an assignment of values for the variables to be optimized, and iteratively improves it by searching a large neighborhood around the current assignment. In this paper we consider a learning-based LNS approach for mixed integer programs (MIPs). We train a Neural Diving model to represent a probability distribu… ▽ More

    Submitted 20 May, 2022; v1 submitted 21 July, 2021; originally announced July 2021.

  3. arXiv:2105.02769  [pdf, other

    cs.CV cs.LG

    Computer-Aided Design as Language

    Authors: Yaroslav Ganin, Sergey Bartunov, Yujia Li, Ethan Keller, Stefano Saliceti

    Abstract: Computer-Aided Design (CAD) applications are used in manufacturing to model everything from coffee mugs to sports cars. These programs are complex and require years of training and experience to master. A component of all CAD models particularly difficult to make are the highly structured 2D sketches that lie at the heart of every 3D construction. In this work, we propose a machine learning model… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 24 pages, 11 figures, 3 tables

  4. arXiv:2012.13349  [pdf, other

    math.OC cs.AI cs.DM cs.LG cs.NE

    Solving Mixed Integer Programs Using Neural Networks

    Authors: Vinod Nair, Sergey Bartunov, Felix Gimeno, Ingrid von Glehn, Pawel Lichocki, Ivan Lobov, Brendan O'Donoghue, Nicolas Sonnerat, Christian Tjandraatmadja, Pengming Wang, Ravichandra Addanki, Tharindi Hapuarachchi, Thomas Keck, James Keeling, Pushmeet Kohli, Ira Ktena, Yujia Li, Oriol Vinyals, Yori Zwols

    Abstract: Mixed Integer Programming (MIP) solvers rely on an array of sophisticated heuristics developed with decades of research to solve large-scale MIP instances encountered in practice. Machine learning offers to automatically construct better heuristics from data by exploiting shared structure among instances in the data. This paper applies learning to the two key sub-tasks of a MIP solver, generating… ▽ More

    Submitted 29 July, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

  5. arXiv:1910.02720  [pdf, other

    stat.ML cs.LG cs.NE

    Meta-Learning Deep Energy-Based Memory Models

    Authors: Sergey Bartunov, Jack W Rae, Simon Osindero, Timothy P Lillicrap

    Abstract: We study the problem of learning associative memory -- a system which is able to retrieve a remembered pattern based on its distorted or incomplete version. Attractor networks provide a sound model of associative memory: patterns are stored as attractors of the network dynamics and associative retrieval is performed by running the dynamics starting from a query pattern until it converges to an att… ▽ More

    Submitted 20 April, 2021; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: ICLR 2020

  6. arXiv:1906.04304  [pdf, other

    cs.LG cs.DB cs.DS stat.ML

    Meta-Learning Neural Bloom Filters

    Authors: Jack W Rae, Sergey Bartunov, Timothy P Lillicrap

    Abstract: There has been a recent trend in training neural networks to replace data structures that have been crafted by hand, with an aim for faster execution, better accuracy, or greater compression. In this setting, a neural data structure is instantiated by training a network over many epochs of its inputs until convergence. In applications where inputs arrive at high throughput, or are ephemeral, train… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: International Conference on Machine Learning 2019

  7. arXiv:1807.04587  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Assessing the Scalability of Biologically-Motivated Deep Learning Algorithms and Architectures

    Authors: Sergey Bartunov, Adam Santoro, Blake A. Richards, Luke Marris, Geoffrey E. Hinton, Timothy Lillicrap

    Abstract: The backpropagation of error algorithm (BP) is impossible to implement in a real brain. The recent success of deep networks in machine learning and AI, however, has inspired proposals for understanding how the brain might learn across multiple layers, and hence how it might approximate BP. As of yet, none of these proposals have been rigorously evaluated on tasks where BP-guided deep learning has… ▽ More

    Submitted 20 November, 2018; v1 submitted 12 July, 2018; originally announced July 2018.

    Comments: NIPS 2018. Version 2 contains more experimental data including best hyperparameters found

  8. arXiv:1711.08330  [pdf, other

    cs.DB stat.ML

    Adaptive Cardinality Estimation

    Authors: Oleg Ivanov, Sergey Bartunov

    Abstract: In this paper we address cardinality estimation problem which is an important subproblem in query optimization. Query optimization is a part of every relational DBMS responsible for finding the best way of the execution for the given query. These ways are called plans. The execution time of different plans may differ by several orders, so query optimizer has a great influence on the whole DBMS per… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

    Comments: 12 pages, 11 figures, 1 table

  9. arXiv:1708.04782  [pdf, other

    cs.LG cs.AI

    StarCraft II: A New Challenge for Reinforcement Learning

    Authors: Oriol Vinyals, Timo Ewalds, Sergey Bartunov, Petko Georgiev, Alexander Sasha Vezhnevets, Michelle Yeo, Alireza Makhzani, Heinrich Küttler, John Agapiou, Julian Schrittwieser, John Quan, Stephen Gaffney, Stig Petersen, Karen Simonyan, Tom Schaul, Hado van Hasselt, David Silver, Timothy Lillicrap, Kevin Calderone, Paul Keet, Anthony Brunasso, David Lawrence, Anders Ekermo, Jacob Repp, Rodney Tsing

    Abstract: This paper introduces SC2LE (StarCraft II Learning Environment), a reinforcement learning environment based on the StarCraft II game. This domain poses a new grand challenge for reinforcement learning, representing a more difficult class of problems than considered in most prior work. It is a multi-agent problem with multiple players interacting; there is imperfect information due to a partially o… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

    Comments: Collaboration between DeepMind & Blizzard. 20 pages, 9 figures, 2 tables

  10. arXiv:1612.02192  [pdf, other

    stat.ML cs.LG

    Fast Adaptation in Generative Models with Generative Matching Networks

    Authors: Sergey Bartunov, Dmitry P. Vetrov

    Abstract: Despite recent advances, the remaining bottlenecks in deep generative models are necessity of extensive training and difficulties with generalization from small number of training examples. We develop a new generative model called Generative Matching Network which is inspired by the recently proposed matching networks for one-shot learning in discriminative tasks. By conditioning on the additional… ▽ More

    Submitted 5 September, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

    ACM Class: I.2.6; I.5

  11. arXiv:1605.06065  [pdf, other

    cs.LG

    One-shot Learning with Memory-Augmented Neural Networks

    Authors: Adam Santoro, Sergey Bartunov, Matthew Botvinick, Daan Wierstra, Timothy Lillicrap

    Abstract: Despite recent breakthroughs in the applications of deep neural networks, one setting that presents a persistent challenge is that of "one-shot learning." Traditional gradient-based networks require a lot of data to learn, often through extensive iterative training. When new data is encountered, the models must inefficiently relearn their parameters to adequately incorporate the new information wi… ▽ More

    Submitted 19 May, 2016; originally announced May 2016.

    Comments: 13 pages, 8 figures

  12. arXiv:1502.07257  [pdf, other

    cs.CL

    Breaking Sticks and Ambiguities with Adaptive Skip-gram

    Authors: Sergey Bartunov, Dmitry Kondrashkin, Anton Osokin, Dmitry Vetrov

    Abstract: Recently proposed Skip-gram model is a powerful method for learning high-dimensional word representations that capture rich semantic relationships between words. However, Skip-gram as well as most prior work on learning word representations does not take into account word ambiguity and maintain only single representation per word. Although a number of Skip-gram modifications were proposed to overc… ▽ More

    Submitted 15 November, 2015; v1 submitted 25 February, 2015; originally announced February 2015.