Skip to main content

Showing 1–10 of 10 results for author: Nabli, A

.
  1. arXiv:2406.02613  [pdf, other

    cs.LG cs.AI

    ACCO: Accumulate while you Communicate, Hiding Communications in Distributed LLM Training

    Authors: Adel Nabli, Louis Fournier, Pierre Erbacher, Louis Serrano, Eugene Belilovsky, Edouard Oyallon

    Abstract: Training Large Language Models (LLMs) relies heavily on distributed implementations, employing multiple GPUs to compute stochastic gradients on model replicas in parallel. However, synchronizing gradients in data parallel settings induces a communication overhead increasing with the number of distributed workers, which can impede the efficiency gains of parallelization. To address this challenge,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2405.17517  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average

    Authors: Louis Fournier, Adel Nabli, Masih Aminbeidokhti, Marco Pedersoli, Eugene Belilovsky, Edouard Oyallon

    Abstract: The performance of deep neural networks is enhanced by ensemble methods, which average the output of several models. However, this comes at an increased cost at inference. Weight averaging methods aim at balancing the generalization of ensembling and the inference speed of a single model by averaging the parameters of an ensemble of models. Yet, naive averaging results in poor performance as model… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2306.08289  [pdf, other

    cs.LG cs.AI cs.DC

    $\textbf{A}^2\textbf{CiD}^2$: Accelerating Asynchronous Communication in Decentralized Deep Learning

    Authors: Adel Nabli, Eugene Belilovsky, Edouard Oyallon

    Abstract: Distributed training of Deep Learning models has been critical to many recent successes in the field. Current standard methods primarily rely on synchronous centralized algorithms which induce major communication bottlenecks and synchronization locks at scale. Decentralized asynchronous algorithms are emerging as a potential alternative but their practical applicability still lags. In order to mit… ▽ More

    Submitted 6 December, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Journal ref: Thirty-seventh Conference on Neural Information Processing Systems, Dec 2023, New Orleans, United States

  4. arXiv:2208.00779  [pdf, ps, other

    math.OC cs.AI cs.DC

    DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization

    Authors: Adel Nabli, Edouard Oyallon

    Abstract: This work introduces DADAO: the first decentralized, accelerated, asynchronous, primal, first-order algorithm to minimize a sum of $L$-smooth and $μ$-strongly convex functions distributed over a given network of size $n$. Our key insight is based on modeling the local gradient updates and gossip communication procedures with separate independent Poisson Point Processes. This allows us to decoupl… ▽ More

    Submitted 6 December, 2023; v1 submitted 26 July, 2022; originally announced August 2022.

    Comments: International Conference on Machine Learning, Jul 2023, Honolulu, United States

  5. arXiv:2204.05148  [pdf, other

    cs.AI

    Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning

    Authors: Robin Algayres, Adel Nabli, Benoit Sagot, Emmanuel Dupoux

    Abstract: We introduce a simple neural encoder architecture that can be trained using an unsupervised contrastive learning objective which gets its positive samples from data-augmented k-Nearest Neighbors search. We show that when built on top of recent self-supervised audio representations, this method can be applied iteratively and yield competitive SSE as evaluated on two tasks: query-by-example of rando… ▽ More

    Submitted 21 October, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Interspeech 2022 New version on 10/21/23 with appendix data and gitlab link

  6. arXiv:2007.03151  [pdf, other

    cs.LG cs.GT stat.ML

    Curriculum learning for multilevel budgeted combinatorial problems

    Authors: Adel Nabli, Margarida Carvalho

    Abstract: Learning heuristics for combinatorial optimization problems through graph neural networks have recently shown promising results on some classic NP-hard problems. These are single-level optimization problems with only one player. Multilevel combinatorial optimization problems are their generalization, encompassing situations with multiple players taking decisions sequentially. By framing them in a… ▽ More

    Submitted 26 October, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020, December 2020

  7. arXiv:2007.02370  [pdf, ps, other

    cs.CC cs.DM cs.GT

    Complexity of the Multilevel Critical Node Problem

    Authors: Adel Nabli, Margarida Carvalho, Pierre Hosteins

    Abstract: In this work, we analyze a sequential game played in a graph called the Multilevel Critical Node problem (MCN). A defender and an attacker are the players of this game. The defender starts by preventively interdicting vertices (vaccination) from being attacked. Then, the attacker infects a subset of non-vaccinated vertices and, finally, the defender reacts with a protection strategy. We provide th… ▽ More

    Submitted 2 October, 2020; v1 submitted 5 July, 2020; originally announced July 2020.

  8. arXiv:1203.3589  [pdf

    cs.DB cs.IR

    Building MultiView Analyst Profile From Multidimensional Query Logs: From Consensual to Conflicting Preferences

    Authors: Eya Ben Ahmed, Ahlem Nabli, Faïez Gargouri

    Abstract: In order to provide suitable results to the analyst needs, user preferences summarization is widely used in several domains. In this paper, we introduce a new approach for user profile construction from OLAP query logs. The key idea is to learn the user's preferences by drawing the evidence from OLAP logs. In fact, the analyst preferences are clustered into three main pools : (i) consensual or non… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: 8 pages

    Journal ref: IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 1, No 2, January 2012 ISSN (Online): 1694-0814 www.IJCSI.org

  9. arXiv:1112.5957   

    cs.DB

    Usage Des Mesures Pour La Génération Des Règles d'Associations Cycliques

    Authors: Eya Ben Ahmed, Ahlem Nabli, Faïez Gargouri

    Abstract: The online analytical processing (OLAP) does not provide any explanation of correlations discovered between data. Thus, the coupling of OLAP and data mining, especially association rules, is considered as an efficient solution to this problem. In this context, we mainly focus on a particular class of association rules which is the cyclic association rules. These rules aimed to discover patterns th… ▽ More

    Submitted 9 September, 2012; v1 submitted 27 December, 2011; originally announced December 2011.

    Comments: 18 pages, 3 figures; 7 ème journées Francophones sur les Entrepôts de données et l'Analyse en ligne (EDA'2011)

  10. arXiv:1107.1779  [pdf

    cs.DB

    A Survey of User-Centric Data Warehouses: From Personalization to Recommendation

    Authors: Eya Ben Ahmed, Ahlem Nabli, Faïez Gargouri

    Abstract: Providing a customized support for the OLAP brings tremendous challenges to the OLAP technology. Standing at the crossroads of the preferences and the data warehouse, two emerging trends are pointed out; namely: (i) the personalization and (ii) the recommendation. Although the panoply of the proposed approaches, the user-centric data warehouse community issues have not been addressed yet. In this… ▽ More

    Submitted 9 July, 2011; originally announced July 2011.

    Comments: 13 pages, 3 figures, 1 table

    Journal ref: The International Journal of Database Management Systems (IJDMS), May 2011, Volume 3, Number 2