Skip to main content

Showing 1–7 of 7 results for author: Kaba, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.09016  [pdf, other

    cs.LG stat.ML

    Symmetry Breaking and Equivariant Neural Networks

    Authors: Sékou-Oumar Kaba, Siamak Ravanbakhsh

    Abstract: Using symmetry as an inductive bias in deep learning has been proven to be a principled approach for sample-efficient model design. However, the relationship between symmetry and the imperative for equivariance in neural networks is not always obvious. Here, we analyze a key limitation that arises in equivariant functions: their incapacity to break symmetry at the level of individual data samples.… ▽ More

    Submitted 22 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 14 pages, 2 figures, Symmetry and Geometry in Neural Representations

  2. arXiv:2310.01647  [pdf, other

    cs.LG

    Equivariant Adaptation of Large Pretrained Models

    Authors: Arnab Kumar Mondal, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Sai Rajeswar, Siamak Ravanbakhsh

    Abstract: Equivariant networks are specifically designed to ensure consistent behavior with respect to a set of input transformations, leading to higher sample efficiency and more accurate and robust predictions. However, redesigning each component of prevalent deep neural network architectures to achieve chosen equivariance is a difficult problem and can result in a computationally expensive network during… ▽ More

    Submitted 29 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 17 pages, 6 figures. Accepted to NeurIPS 2023

  3. arXiv:2309.03139  [pdf, other

    cs.LG

    Using Multiple Vector Channels Improves E(n)-Equivariant Graph Neural Networks

    Authors: Daniel Levy, Sékou-Oumar Kaba, Carmelo Gonzales, Santiago Miret, Siamak Ravanbakhsh

    Abstract: We present a natural extension to E(n)-equivariant graph neural networks that uses multiple equivariant vectors per node. We formulate the extension and show that it improves performance across different physical systems benchmark tasks, with minimal differences in runtime or number of parameters. The proposed multichannel EGNN outperforms the standard singlechannel EGNN on N-body charged particle… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  4. arXiv:2211.15420  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Equivariant Networks for Crystal Structures

    Authors: Sékou-Oumar Kaba, Siamak Ravanbakhsh

    Abstract: Supervised learning with deep models has tremendous potential for applications in materials science. Recently, graph neural networks have been used in this context, drawing direct inspiration from models for molecules. However, materials are typically much more structured than molecules, which is a feature that these models do not leverage. In this work, we introduce a class of models that are equ… ▽ More

    Submitted 15 January, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 10 pages, 4 figures + appendix

  5. arXiv:2211.06489  [pdf, other

    cs.LG cs.AI

    Equivariance with Learned Canonicalization Functions

    Authors: Sékou-Oumar Kaba, Arnab Kumar Mondal, Yan Zhang, Yoshua Bengio, Siamak Ravanbakhsh

    Abstract: Symmetry-based neural networks often constrain the architecture in order to achieve invariance or equivariance to a group of transformations. In this paper, we propose an alternative that avoids this architectural constraint by learning to produce canonical representations of the data. These canonicalization functions can readily be plugged into non-equivariant backbone architectures. We offer exp… ▽ More

    Submitted 7 July, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 21 pages, 5 figures

  6. arXiv:2111.14712  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Prediction of Large Magnetic Moment Materials With Graph Neural Networks and Random Forests

    Authors: Sékou-Oumar Kaba, Benjamin Groleau-Paré, Marc-Antoine Gauthier, André-Marie Tremblay, Simon Verret, Chloé Gauvin-Ndiaye

    Abstract: Magnetic materials are crucial components of many technologies that could drive the ecological transition, including electric motors, wind turbine generators and magnetic refrigeration systems. Discovering materials with large magnetic moments is therefore an increasing priority. Here, using state-of-the-art machine learning methods, we scan the Inorganic Crystal Structure Database (ICSD) of hundr… ▽ More

    Submitted 17 April, 2023; v1 submitted 29 November, 2021; originally announced November 2021.

    ACM Class: J.2

    Journal ref: Phys. Rev. Mater., 7:044407, Apr 2023

  7. arXiv:2011.09468  [pdf, other

    cs.LG math.DS stat.ML

    Gradient Starvation: A Learning Proclivity in Neural Networks

    Authors: Mohammad Pezeshki, Sékou-Oumar Kaba, Yoshua Bengio, Aaron Courville, Doina Precup, Guillaume Lajoie

    Abstract: We identify and formalize a fundamental gradient descent phenomenon resulting in a learning proclivity in over-parameterized neural networks. Gradient Starvation arises when cross-entropy loss is minimized by capturing only a subset of features relevant for the task, despite the presence of other predictive features that fail to be discovered. This work provides a theoretical explanation for the e… ▽ More

    Submitted 24 November, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: Proceeding of NeurIPS 2021