Skip to main content

Showing 1–4 of 4 results for author: Ganev, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.17216  [pdf, other

    cs.LG math.RT

    Symmetries, flat minima, and the conserved quantities of gradient flow

    Authors: Bo Zhao, Iordan Ganev, Robin Walters, Rose Yu, Nima Dehmamy

    Abstract: Empirical studies of the loss landscape of deep networks have revealed that many local minima are connected through low-loss valleys. Yet, little is known about the theoretical origin of such valleys. We present a general framework for finding continuous symmetries in the parameter space, which carve out low-loss valleys. Our framework uses equivariances of the activation functions and can be appl… ▽ More

    Submitted 23 March, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: To appear at ICLR 2023

  2. arXiv:2207.12773  [pdf, other

    cs.LG math.RT

    Quiver neural networks

    Authors: Iordan Ganev, Robin Walters

    Abstract: We develop a uniform theoretical approach towards the analysis of various neural network connectivity architectures by introducing the notion of a quiver neural network. Inspired by quiver representation theory in mathematics, this approach gives a compact way to capture elaborate data flows in complex network architectures. As an application, we use parameter space symmetries to prove a lossless… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: Preliminary version, comments welcome

  3. arXiv:2107.02550  [pdf, other

    cs.LG math.RT

    Universal approximation and model compression for radial neural networks

    Authors: Iordan Ganev, Twan van Laarhoven, Robin Walters

    Abstract: We introduce a class of fully-connected neural networks whose activation functions, rather than being pointwise, rescale feature vectors by a function depending only on their norm. We call such networks radial neural networks, extending previous work on rotation equivariant networks that considers rescaling activations in less generality. We prove universal approximation theorems for radial neural… ▽ More

    Submitted 16 February, 2023; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: 44 pages

  4. arXiv:2010.06277  [pdf, other

    cs.AR cs.DC

    PIUMA: Programmable Integrated Unified Memory Architecture

    Authors: Sriram Aananthakrishnan, Nesreen K. Ahmed, Vincent Cave, Marcelo Cintra, Yigit Demir, Kristof Du Bois, Stijn Eyerman, Joshua B. Fryman, Ivan Ganev, Wim Heirman, Hans-Christian Hoppe, Jason Howard, Ibrahim Hur, MidhunChandra Kodiyath, Samkit Jain, Daniel S. Klowden, Marek M. Landowski, Laurent Montigny, Ankit More, Przemyslaw Ossowski, Robert Pawlowski, Nick Pepperling, Fabrizio Petrini, Mariusz Sikora, Balasubramanian Seshasayee , et al. (6 additional authors not shown)

    Abstract: High performance large scale graph analytics is essential to timely analyze relationships in big data sets. Conventional processor architectures suffer from inefficient resource usage and bad scaling on graph workloads. To enable efficient and scalable graph analysis, Intel developed the Programmable Integrated Unified Memory Architecture (PIUMA). PIUMA consists of many multi-threaded cores, fine-… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.