Skip to main content

Showing 1–7 of 7 results for author: Dimitriadis, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07813  [pdf, other

    cs.LG cs.CV

    Localizing Task Information for Improved Model Merging and Compression

    Authors: Ke Wang, Nikolaos Dimitriadis, Guillermo Ortiz-Jimenez, François Fleuret, Pascal Frossard

    Abstract: Model merging and task arithmetic have emerged as promising scalable approaches to merge multiple single-task checkpoints to one multi-task model, but their applicability is reduced by significant performance loss. Previous works have linked these drops to interference in the weight space and erasure of important task-specific features. Instead, in this work we show that the information required t… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted ICML 2024; The first two authors contributed equally to this work; Project website: https://tall-masks.github.io

  2. arXiv:2306.08021  [pdf, other

    cs.LG cs.AI

    Flexible Channel Dimensions for Differentiable Architecture Search

    Authors: Ahmet Caner Yüzügüler, Nikolaos Dimitriadis, Pascal Frossard

    Abstract: Finding optimal channel dimensions (i.e., the number of filters in DNN layers) is essential to design DNNs that perform well under computational resource constraints. Recent work in neural architecture search aims at automating the optimization of the DNN model implementation. However, existing neural architecture search methods for channel dimensions rely on fixed search spaces, which prevents ac… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  3. arXiv:2304.10857  [pdf, other

    cs.LG

    SequeL: A Continual Learning Library in PyTorch and JAX

    Authors: Nikolaos Dimitriadis, Francois Fleuret, Pascal Frossard

    Abstract: Continual Learning is an important and challenging problem in machine learning, where models must adapt to a continuous stream of new data without forgetting previously acquired knowledge. While existing frameworks are built on PyTorch, the rising popularity of JAX might lead to divergent codebases, ultimately hindering reproducibility and progress. To address this problem, we introduce SequeL, a… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 7 pages, 1 figure, 4 code listings

  4. arXiv:2210.09759  [pdf, other

    cs.LG

    Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models

    Authors: Nikolaos Dimitriadis, Pascal Frossard, François Fleuret

    Abstract: In Multi-Task Learning (MTL), tasks may compete and limit the performance achieved on each other, rather than guiding the optimization to a solution, superior to all its single-task trained counterparts. Since there is often not a unique solution optimal for all tasks, practitioners have to balance tradeoffs between tasks' performance, and resort to optimality in the Pareto sense. Most MTL methodo… ▽ More

    Submitted 14 June, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: Accepted ICML 2023

  5. arXiv:2203.12412  [pdf, other

    cs.LG cs.AI cs.CV

    U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search

    Authors: Ahmet Caner Yüzügüler, Nikolaos Dimitriadis, Pascal Frossard

    Abstract: Optimizing resource utilization in target platforms is key to achieving high performance during DNN inference. While optimizations have been proposed for inference latency, memory footprint, and energy consumption, prior hardware-aware neural architecture search (NAS) methods have omitted resource utilization, preventing DNNs to take full advantage of the target inference platforms. Modeling resou… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  6. arXiv:2203.01016  [pdf, other

    cs.LG

    The Theoretical Expressiveness of Maxpooling

    Authors: Kyle Matoba, Nikolaos Dimitriadis, François Fleuret

    Abstract: Over the decade since deep neural networks became state of the art image classifiers there has been a tendency towards less use of max pooling: the function that takes the largest of nearby pixels in an image. Since max pooling featured prominently in earlier generations of image classifiers, we wish to understand this trend, and whether it is justified. We develop a theoretical framework analyzin… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 31 pages, 6 figures

  7. arXiv:2011.07643  [pdf, other

    cs.LG eess.IV

    Advances in the training, pruning and enforcement of shape constraints of Morphological Neural Networks using Tropical Algebra

    Authors: Nikolaos Dimitriadis, Petros Maragos

    Abstract: In this paper we study an emerging class of neural networks based on the morphological operators of dilation and erosion. We explore these networks mathematically from a tropical geometry perspective as well as mathematical morphology. Our contributions are threefold. First, we examine the training of morphological networks via Difference-of-Convex programming methods and extend a binary morpholog… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: 15 pages