Skip to main content

Showing 1–16 of 16 results for author: Bartunov, S

.
  1. arXiv:2202.12795  [pdf, other

    cs.LG cs.AI stat.ML

    Equilibrium Aggregation: Encoding Sets via Optimization

    Authors: Sergey Bartunov, Fabian B. Fuchs, Timothy Lillicrap

    Abstract: Processing sets or other unordered, potentially variable-sized inputs in neural networks is usually handled by aggregating a number of input tensors into a single representation. While a number of aggregation methods already exist from simple sum pooling to multi-head attention, they are limited in their representational power both from theoretical and empirical perspectives. On the search of a pr… ▽ More

    Submitted 3 July, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: Published at UAI 2022

  2. arXiv:2107.10201  [pdf, other

    math.OC cs.LG

    Learning a Large Neighborhood Search Algorithm for Mixed Integer Programs

    Authors: Nicolas Sonnerat, Pengming Wang, Ira Ktena, Sergey Bartunov, Vinod Nair

    Abstract: Large Neighborhood Search (LNS) is a combinatorial optimization heuristic that starts with an assignment of values for the variables to be optimized, and iteratively improves it by searching a large neighborhood around the current assignment. In this paper we consider a learning-based LNS approach for mixed integer programs (MIPs). We train a Neural Diving model to represent a probability distribu… ▽ More

    Submitted 20 May, 2022; v1 submitted 21 July, 2021; originally announced July 2021.

  3. arXiv:2105.02769  [pdf, other

    cs.CV cs.LG

    Computer-Aided Design as Language

    Authors: Yaroslav Ganin, Sergey Bartunov, Yujia Li, Ethan Keller, Stefano Saliceti

    Abstract: Computer-Aided Design (CAD) applications are used in manufacturing to model everything from coffee mugs to sports cars. These programs are complex and require years of training and experience to master. A component of all CAD models particularly difficult to make are the highly structured 2D sketches that lie at the heart of every 3D construction. In this work, we propose a machine learning model… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 24 pages, 11 figures, 3 tables

  4. arXiv:2012.13349  [pdf, other

    math.OC cs.AI cs.DM cs.LG cs.NE

    Solving Mixed Integer Programs Using Neural Networks

    Authors: Vinod Nair, Sergey Bartunov, Felix Gimeno, Ingrid von Glehn, Pawel Lichocki, Ivan Lobov, Brendan O'Donoghue, Nicolas Sonnerat, Christian Tjandraatmadja, Pengming Wang, Ravichandra Addanki, Tharindi Hapuarachchi, Thomas Keck, James Keeling, Pushmeet Kohli, Ira Ktena, Yujia Li, Oriol Vinyals, Yori Zwols

    Abstract: Mixed Integer Programming (MIP) solvers rely on an array of sophisticated heuristics developed with decades of research to solve large-scale MIP instances encountered in practice. Machine learning offers to automatically construct better heuristics from data by exploiting shared structure among instances in the data. This paper applies learning to the two key sub-tasks of a MIP solver, generating… ▽ More

    Submitted 29 July, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

  5. arXiv:1910.02720  [pdf, other

    stat.ML cs.LG cs.NE

    Meta-Learning Deep Energy-Based Memory Models

    Authors: Sergey Bartunov, Jack W Rae, Simon Osindero, Timothy P Lillicrap

    Abstract: We study the problem of learning associative memory -- a system which is able to retrieve a remembered pattern based on its distorted or incomplete version. Attractor networks provide a sound model of associative memory: patterns are stored as attractors of the network dynamics and associative retrieval is performed by running the dynamics starting from a query pattern until it converges to an att… ▽ More

    Submitted 20 April, 2021; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: ICLR 2020

  6. arXiv:1906.04304  [pdf, other

    cs.LG cs.DB cs.DS stat.ML

    Meta-Learning Neural Bloom Filters

    Authors: Jack W Rae, Sergey Bartunov, Timothy P Lillicrap

    Abstract: There has been a recent trend in training neural networks to replace data structures that have been crafted by hand, with an aim for faster execution, better accuracy, or greater compression. In this setting, a neural data structure is instantiated by training a network over many epochs of its inputs until convergence. In applications where inputs arrive at high throughput, or are ephemeral, train… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: International Conference on Machine Learning 2019

  7. arXiv:1807.04587  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Assessing the Scalability of Biologically-Motivated Deep Learning Algorithms and Architectures

    Authors: Sergey Bartunov, Adam Santoro, Blake A. Richards, Luke Marris, Geoffrey E. Hinton, Timothy Lillicrap

    Abstract: The backpropagation of error algorithm (BP) is impossible to implement in a real brain. The recent success of deep networks in machine learning and AI, however, has inspired proposals for understanding how the brain might learn across multiple layers, and hence how it might approximate BP. As of yet, none of these proposals have been rigorously evaluated on tasks where BP-guided deep learning has… ▽ More

    Submitted 20 November, 2018; v1 submitted 12 July, 2018; originally announced July 2018.

    Comments: NIPS 2018. Version 2 contains more experimental data including best hyperparameters found

  8. arXiv:1711.08330  [pdf, other

    cs.DB stat.ML

    Adaptive Cardinality Estimation

    Authors: Oleg Ivanov, Sergey Bartunov

    Abstract: In this paper we address cardinality estimation problem which is an important subproblem in query optimization. Query optimization is a part of every relational DBMS responsible for finding the best way of the execution for the given query. These ways are called plans. The execution time of different plans may differ by several orders, so query optimizer has a great influence on the whole DBMS per… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

    Comments: 12 pages, 11 figures, 1 table

  9. arXiv:1708.04782  [pdf, other

    cs.LG cs.AI

    StarCraft II: A New Challenge for Reinforcement Learning

    Authors: Oriol Vinyals, Timo Ewalds, Sergey Bartunov, Petko Georgiev, Alexander Sasha Vezhnevets, Michelle Yeo, Alireza Makhzani, Heinrich Küttler, John Agapiou, Julian Schrittwieser, John Quan, Stephen Gaffney, Stig Petersen, Karen Simonyan, Tom Schaul, Hado van Hasselt, David Silver, Timothy Lillicrap, Kevin Calderone, Paul Keet, Anthony Brunasso, David Lawrence, Anders Ekermo, Jacob Repp, Rodney Tsing

    Abstract: This paper introduces SC2LE (StarCraft II Learning Environment), a reinforcement learning environment based on the StarCraft II game. This domain poses a new grand challenge for reinforcement learning, representing a more difficult class of problems than considered in most prior work. It is a multi-agent problem with multiple players interacting; there is imperfect information due to a partially o… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

    Comments: Collaboration between DeepMind & Blizzard. 20 pages, 9 figures, 2 tables

  10. arXiv:1612.02192  [pdf, other

    stat.ML cs.LG

    Fast Adaptation in Generative Models with Generative Matching Networks

    Authors: Sergey Bartunov, Dmitry P. Vetrov

    Abstract: Despite recent advances, the remaining bottlenecks in deep generative models are necessity of extensive training and difficulties with generalization from small number of training examples. We develop a new generative model called Generative Matching Network which is inspired by the recently proposed matching networks for one-shot learning in discriminative tasks. By conditioning on the additional… ▽ More

    Submitted 5 September, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

    ACM Class: I.2.6; I.5

  11. arXiv:1605.06065  [pdf, other

    cs.LG

    One-shot Learning with Memory-Augmented Neural Networks

    Authors: Adam Santoro, Sergey Bartunov, Matthew Botvinick, Daan Wierstra, Timothy Lillicrap

    Abstract: Despite recent breakthroughs in the applications of deep neural networks, one setting that presents a persistent challenge is that of "one-shot learning." Traditional gradient-based networks require a lot of data to learn, often through extensive iterative training. When new data is encountered, the models must inefficiently relearn their parameters to adequately incorporate the new information wi… ▽ More

    Submitted 19 May, 2016; originally announced May 2016.

    Comments: 13 pages, 8 figures

  12. arXiv:1502.07257  [pdf, other

    cs.CL

    Breaking Sticks and Ambiguities with Adaptive Skip-gram

    Authors: Sergey Bartunov, Dmitry Kondrashkin, Anton Osokin, Dmitry Vetrov

    Abstract: Recently proposed Skip-gram model is a powerful method for learning high-dimensional word representations that capture rich semantic relationships between words. However, Skip-gram as well as most prior work on learning word representations does not take into account word ambiguity and maintain only single representation per word. Although a number of Skip-gram modifications were proposed to overc… ▽ More

    Submitted 15 November, 2015; v1 submitted 25 February, 2015; originally announced February 2015.

  13. Thermonuclear Burning Regimes and the Use of SNe Ia in Cosmology

    Authors: E. I. Sorokina, S. I. Blinnikov, O. S. Bartunov

    Abstract: The calculations of the light curves of thermonuclear supernovae are carried out by a method of multi-group radiation hydrodynamics. The effects of spectral lines and expansion opacity are taken into account. The predictions for UBVI fluxes are given. The values of rise time for B and V bands found in our calculations are in good agreement with the observed values. We explain why our results for… ▽ More

    Submitted 2 October, 1999; v1 submitted 30 June, 1999; originally announced June 1999.

    Comments: 20 pages, 5 figures, presented at the conference "Astronomy at the Eve of the New Century", Puschino, May 17-22, 1999. A few references and a table added, typos corrected

    Journal ref: Astron.Lett.26:67,2000

  14. A comparative modeling of supernova 1993J

    Authors: S. I. Blinnikov, R. Eastman, O. S. Bartunov, V. A. Popolitov, S. E. Woosley

    Abstract: The light curve of Supernova (SN) 1993J is calculated using two approaches to radiation transport as exemplified by the two computer codes, STELLA and EDDINGTON. Particular attention is paid to shock breakout and the photometry in the U, B, and V bands during the first 120 days. The hydrodynamical model, the explosion of a 13 Msun star which had lost most of its hydrogenic envelope to a companio… ▽ More

    Submitted 6 November, 1997; originally announced November 1997.

    Comments: 43 pages with 22 eps figures, aaspp4.sty + epsf.sty, Accepted by ApJ, to appear in March 20, 1998 issue, Vol. 496

    Report number: MPA-1049

  15. arXiv:astro-ph/9611191  [pdf, ps, other

    astro-ph

    The rate of Supernovae from the combined sample of five searches

    Authors: E. Cappellaro, M. Turatto, D. Yu. Tsvetkov, O. S. Bartunov, C. Pollas, R. Evans, M. Hamuy

    Abstract: With the purpose to obtain new estimates of the rate of supernovae we joined the logs of five SN searches, namely the Asiago, Crimea, Cal{á}n-Tololo and OCA photographic surveys and the visual search by Evans (the sample counts 110 SNe). We found that the most prolific galaxies are late spirals in which most SNe are of type II (0.88 SNu). SN Ib/c are rarer than SN Ia (0.16 and 0.24 SNu, respecti… ▽ More

    Submitted 22 November, 1996; originally announced November 1996.

    Comments: 12 pages, Latex, 5 figures, postscript file also available at http://www.astro.it/supern/preprints.html

    Journal ref: Astron.Astrophys. 322 (1997) 431-441

  16. arXiv:astro-ph/9302017  [pdf, ps, other

    astro-ph

    The Rate of Supernovae. II. the Selection Effects and the Frequencies Per Unit Blue Luminosity

    Authors: E. Cappellaro, M. Turatto, Benetti, D. Yu. Tsvetkov, O. S. Bartunov, I. N. Makarova

    Abstract: We present new estimates of the observed rates of SNe determined with the {\em control time} method applied to the files of observations of two long term, photographic SN searches carried out at the Asiago and Sternberg Observatories. Our calculations are applied to a galaxy sample extracted from RC3, in which 65 SNe have been discovered. This relatively large number of SNe has been redistribute… ▽ More

    Submitted 25 February, 1993; originally announced February 1993.

    Comments: 9 pages, 3 figures (available upon request), Astron.Astrophys LaTeX style format, OAPD-203