Skip to main content

Showing 1–6 of 6 results for author: Joudaki, A

.
  1. arXiv:2312.03865  [pdf, other

    cs.LG q-bio.GN

    Learning Genomic Sequence Representations using Graph Neural Networks over De Bruijn Graphs

    Authors: Kacper Kapuśniak, Manuel Burger, Gunnar Rätsch, Amir Joudaki

    Abstract: The rapid expansion of genomic sequence data calls for new methods to achieve robust sequence representations. Existing techniques often neglect intricate structural details, emphasizing mainly contextual information. To address this, we developed k-mer embeddings that merge contextual and structural string information by enhancing De Bruijn graphs with structural similarity connections. Subsequen… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Poster at "NeurIPS 2023 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2023)"

  2. arXiv:2310.02012  [pdf, other

    cs.LG cs.AI

    Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion

    Authors: Alexandru Meterez, Amir Joudaki, Francesco Orabona, Alexander Immer, Gunnar Rätsch, Hadi Daneshmand

    Abstract: Normalization layers are one of the key building blocks for deep neural networks. Several theoretical studies have shown that batch normalization improves the signal propagation, by avoiding the representations from becoming collinear across the layers. However, results on mean-field theory of batch normalization also conclude that this benefit comes at the expense of exploding gradients in depth.… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  3. arXiv:2305.18399  [pdf, other

    cs.LG cs.AI stat.ML

    On the impact of activation and normalization in obtaining isometric embeddings at initialization

    Authors: Amir Joudaki, Hadi Daneshmand, Francis Bach

    Abstract: In this paper, we explore the structure of the penultimate Gram matrix in deep neural networks, which contains the pairwise inner products of outputs corresponding to a batch of inputs. In several architectures it has been observed that this Gram matrix becomes degenerate with depth at initialization, which dramatically slows training. Normalization layers, such as batch or layer normalization, pl… ▽ More

    Submitted 17 November, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

  4. arXiv:2205.13076  [pdf, other

    cs.LG cs.AI math.ST

    On Bridging the Gap between Mean Field and Finite Width in Deep Random Neural Networks with Batch Normalization

    Authors: Amir Joudaki, Hadi Daneshmand, Francis Bach

    Abstract: Mean field theory is widely used in the theoretical studies of neural networks. In this paper, we analyze the role of depth in the concentration of mean-field predictions, specifically for deep multilayer perceptron (MLP) with batch normalization (BN) at initialization. By scaling the network width to infinity, it is postulated that the mean-field predictions suffer from layer-wise errors that amp… ▽ More

    Submitted 20 February, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

  5. arXiv:2106.03970  [pdf, other

    stat.ML cs.AI cs.LG

    Batch Normalization Orthogonalizes Representations in Deep Random Networks

    Authors: Hadi Daneshmand, Amir Joudaki, Francis Bach

    Abstract: This paper underlines a subtle property of batch-normalization (BN): Successive batch normalizations with random linear transformations make hidden representations increasingly orthogonal across layers of a deep neural network. We establish a non-asymptotic characterization of the interplay between depth, width, and the orthogonality of deep representations. More precisely, under a mild assumption… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  6. arXiv:1312.0803  [pdf, ps, other

    cs.CG

    Nonlinear Dimensionality Reduction via Path-Based Isometric Map**

    Authors: Amir Najafi, Amir Joudaki, Emad Fatemizadeh

    Abstract: Nonlinear dimensionality reduction methods have demonstrated top-notch performance in many pattern recognition and image classification tasks. Despite their popularity, they suffer from highly expensive time and memory requirements, which render them inapplicable to large-scale datasets. To leverage such cases we propose a new method called "Path-Based Isomap". Similar to Isomap, we exploit geodes… ▽ More

    Submitted 6 April, 2014; v1 submitted 3 December, 2013; originally announced December 2013.

    Comments: (29) pages, (12) figures