Skip to main content

Showing 1–17 of 17 results for author: Seddik, M E A

.
  1. arXiv:2405.14088  [pdf, other

    cs.LG cs.AI stat.ML

    High-dimensional Learning with Noisy Labels

    Authors: Aymane El Firdoussi, Mohamed El Amine Seddik

    Abstract: This paper provides theoretical insights into high-dimensional binary classification with class-conditional noisy labels. Specifically, we study the behavior of a linear classifier with a label noisiness aware loss function, when both the dimension of data $p$ and the sample size $n$ are large and comparable. Relying on random matrix theory by supposing a Gaussian mixture data model, the performan… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2404.05090  [pdf, other

    cs.LG cs.AI cs.CL

    How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse

    Authors: Mohamed El Amine Seddik, Suei-Wen Chen, Soufiane Hayou, Pierre Youssef, Merouane Debbah

    Abstract: The phenomenon of model collapse, introduced in (Shumailov et al., 2023), refers to the deterioration in performance that occurs when new models are trained on synthetic data generated from previously trained models. This recursive training loop makes the tails of the original distribution disappear, thereby making future-generation models forget about the initial (real) distribution. With the aim… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  3. arXiv:2404.04291  [pdf, other

    cs.LG

    Investigating Regularization of Self-Play Language Models

    Authors: Reda Alami, Abdalgader Abubaker, Mastane Achab, Mohamed El Amine Seddik, Salem Lahlou

    Abstract: This paper explores the effects of various forms of regularization in the context of language model alignment via self-play. While both reinforcement learning from human feedback (RLHF) and direct preference optimization (DPO) require to collect costly human-annotated pairwise preferences, the self-play fine-tuning (SPIN) approach replaces the rejected answers by data generated from the previous i… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  4. arXiv:2402.10677  [pdf, other

    stat.ML cs.LG math.PR

    Performance Gaps in Multi-view Clustering under the Nested Matrix-Tensor Model

    Authors: Hugo Lebeau, Mohamed El Amine Seddik, José Henrique de Morais Goulart

    Abstract: We study the estimation of a planted signal hidden in a recently introduced nested matrix-tensor model, which is an extension of the classical spiked rank-one tensor model, motivated by multi-view clustering. Prior work has theoretically examined the performance of a tensor-based approach, which relies on finding a best rank-one approximation, a problem known to be computationally hard. A tractabl… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  5. arXiv:2401.05224  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Do Vision and Language Encoders Represent the World Similarly?

    Authors: Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam, Noel E. O'Connor

    Abstract: Aligned text-image encoders such as CLIP have become the de facto model for vision-language tasks. Furthermore, modality-specific encoders achieve impressive performances in their respective domains. This raises a central question: does an alignment exist between uni-modal vision and language encoders since they fundamentally represent the same physical world? Analyzing the latent spaces structure… ▽ More

    Submitted 22 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Accepted CVPR 2024

  6. arXiv:2310.18717  [pdf, other

    stat.ML cs.LG

    On the Accuracy of Hotelling-Type Asymmetric Tensor Deflation: A Random Tensor Analysis

    Authors: Mohamed El Amine Seddik, Maxime Guillaud, Alexis Decurninge, José Henrique de Morais Goulart

    Abstract: This work introduces an asymptotic study of Hotelling-type tensor deflation in the presence of noise, in the regime of large tensor dimensions. Specifically, we consider a low-rank asymmetric tensor model of the form $\sum_{i=1}^r β_i{\mathcal{A}}_i + {\mathcal{W}}$ where $β_i\geq 0$ and the ${\mathcal{A}}_i$'s are unit-norm rank-one tensors such that… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted at IEEE CAMSAP 2023. See also companion paper arXiv:2304.10248 for the symmetric case. arXiv admin note: text overlap with arXiv:2211.09004

  7. arXiv:2305.19992  [pdf, ps, other

    stat.ML cs.LG

    A Nested Matrix-Tensor Model for Noisy Multi-view Clustering

    Authors: Mohamed El Amine Seddik, Mastane Achab, Henrique Goulart, Merouane Debbah

    Abstract: In this paper, we propose a nested matrix-tensor model which extends the spiked rank-one tensor model of order three. This model is particularly motivated by a multi-view clustering problem in which multiple noisy observations of each data point are acquired, with potentially non-uniform variances along the views. In this case, data can be naturally represented by an order-three tensor where the v… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  8. arXiv:2304.10248  [pdf, ps, other

    stat.ML cs.LG

    Hotelling Deflation on Large Symmetric Spiked Tensors

    Authors: Mohamed El Amine Seddik, José Henrique de Morais Goulart, Maxime Guillaud

    Abstract: This paper studies the deflation algorithm when applied to estimate a low-rank symmetric spike contained in a large tensor corrupted by additive Gaussian noise. Specifically, we provide a precise characterization of the large-dimensional performance of deflation in terms of the alignments of the vectors obtained by successive rank-1 approximation and of their estimated weights, assuming non-trivia… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 4 pages, 1 figure

  9. arXiv:2302.05798  [pdf, ps, other

    stat.ML math.PR math.ST

    Optimizing Orthogonalized Tensor Deflation via Random Tensor Theory

    Authors: Mohamed El Amine Seddik, Mohammed Mahfoud, Merouane Debbah

    Abstract: This paper tackles the problem of recovering a low-rank signal tensor with possibly correlated components from a random noisy tensor, or so-called spiked tensor model. When the underlying components are orthogonal, they can be recovered efficiently using tensor deflation which consists of successive rank-one approximations, while non-orthogonal components may alter the tensor deflation mechanism,… ▽ More

    Submitted 16 March, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

  10. arXiv:2211.09004  [pdf, other

    math.ST math.PR stat.ML

    On the Accuracy of Hotelling-Type Tensor Deflation: A Random Tensor Analysis

    Authors: Mohamed El Amine Seddik, Maxime Guillaud, Alexis Decurninge

    Abstract: Leveraging on recent advances in random tensor theory, we consider in this paper a rank-$r$ asymmetric spiked tensor model of the form $\sum_{i=1}^r β_i A_i + W$ where $β_i\geq 0$ and the $A_i$'s are rank-one tensors such that $\langle A_i, A_j \rangle\in [0, 1]$ for $i\neq j$, based on which we provide an asymptotic study of Hotelling-type tensor deflation in the large dimensional regime. Specifi… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  11. arXiv:2112.12348  [pdf, other

    math.PR math.SP stat.ML

    When Random Tensors meet Random Matrices

    Authors: Mohamed El Amine Seddik, Maxime Guillaud, Romain Couillet

    Abstract: Relying on random matrix theory (RMT), this paper studies asymmetric order-$d$ spiked tensor models with Gaussian noise. Using the variational definition of the singular vectors and values of (Lim, 2005), we show that the analysis of the considered model boils down to the analysis of an equivalent spiked symmetric \textit{block-wise} random matrix, that is constructed from \textit{contractions} of… ▽ More

    Submitted 19 November, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

  12. arXiv:2109.01785  [pdf, other

    cs.LG cs.SI stat.ML

    Node Feature Kernels Increase Graph Convolutional Network Robustness

    Authors: Mohamed El Amine Seddik, Changmin Wu, Johannes F. Lutzeyer, Michalis Vazirgiannis

    Abstract: The robustness of the much-used Graph Convolutional Networks (GCNs) to perturbations of their input is becoming a topic of increasing importance. In this paper, the random GCN is introduced for which a random matrix theory analysis is possible. This analysis suggests that if the graph is sufficiently perturbed, or in the extreme case random, then the GCN fails to benefit from the node features. It… ▽ More

    Submitted 21 February, 2022; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: 16 pages, 5 figures

  13. arXiv:2102.09321  [pdf, other

    cs.CV

    Deep Miner: A Deep and Multi-branch Network which Mines Rich and Diverse Features for Person Re-identification

    Authors: Abdallah Benzine, Mohamed El Amine Seddik, Julien Desmarais

    Abstract: Most recent person re-identification approaches are based on the use of deep convolutional neural networks (CNNs). These networks, although effective in multiple tasks such as classification or object detection, tend to focus on the most discriminative part of an object rather than retrieving all its relevant features. This behavior penalizes the performance of a CNN for the re-identification task… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  14. arXiv:2001.08370  [pdf, other

    cs.LG stat.ML

    Random Matrix Theory Proves that Deep Learning Representations of GAN-data Behave as Gaussian Mixtures

    Authors: Mohamed El Amine Seddik, Cosme Louart, Mohamed Tamaazousti, Romain Couillet

    Abstract: This paper shows that deep learning (DL) representations of data produced by generative adversarial nets (GANs) are random vectors which fall within the class of so-called \textit{concentrated} random vectors. Further exploiting the fact that Gram matrices, of the type $G = X^T X$ with $X=[x_1,\ldots,x_n]\in \mathbb{R}^{p\times n}$ and $x_i$ independent concentrated random vectors from a mixture m… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

  15. arXiv:1904.02672  [pdf, other

    cs.CV

    Deep Multi-class Adversarial Specularity Removal

    Authors: John Lin, Mohamed El Amine Seddik, Mohamed Tamaazousti, Youssef Tamaazousti, Adrien Bartoli

    Abstract: We propose a novel learning approach, in the form of a fully-convolutional neural network (CNN), which automatically and consistently removes specular highlights from a single image by generating its diffuse component. To train the generative network, we define an adversarial loss on a discriminative network as in the GAN framework and combined it with a content loss. In contrast to existing GAN a… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

  16. arXiv:1902.10467  [pdf, other

    cs.CV

    Generative Collaborative Networks for Single Image Super-Resolution

    Authors: Mohamed El Amine Seddik, Mohamed Tamaazousti, John Lin

    Abstract: A common issue of deep neural networks-based methods for the problem of Single Image Super-Resolution (SISR), is the recovery of finer texture details when super-resolving at large upscaling factors. This issue is particularly related to the choice of the objective loss function. In particular, recent works proposed the use of a VGG loss which consists in minimizing the error between the generated… ▽ More

    Submitted 12 March, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

  17. arXiv:1712.09708  [pdf, other

    cs.CV cs.LG

    Learning More Universal Representations for Transfer-Learning

    Authors: Youssef Tamaazousti, Hervé Le Borgne, Céline Hudelot, Mohamed El Amine Seddik, Mohamed Tamaazousti

    Abstract: A representation is supposed universal if it encodes any element of the visual world (e.g., objects, scenes) in any configuration (e.g., scale, context). While not expecting pure universal representations, the goal in the literature is to improve the universality level, starting from a representation with a certain level. To do so, the state-of-the-art consists in learning CNN-based representation… ▽ More

    Submitted 2 September, 2018; v1 submitted 27 December, 2017; originally announced December 2017.

    Comments: Submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)