Skip to main content

Showing 1–13 of 13 results for author: Pomponi, J

.
  1. arXiv:2405.02330  [pdf, other

    cs.IT cs.AI cs.LG

    Adaptive Semantic Token Selection for AI-native Goal-oriented Communications

    Authors: Alessio Devoto, Simone Petruzzi, Jary Pomponi, Paolo Di Lorenzo, Simone Scardapane

    Abstract: In this paper, we propose a novel design for AI-native goal-oriented communications, exploiting transformer neural networks under dynamic inference constraints on bandwidth and computation. Transformers have become the standard architecture for pretraining large-scale vision and text models, and preliminary results have shown promising performance also in deep joint source-channel coding (JSCC). H… ▽ More

    Submitted 25 April, 2024; originally announced May 2024.

    Comments: 5 pages

    MSC Class: 94A40

  2. Conditional computation in neural networks: principles and research trends

    Authors: Simone Scardapane, Alessandro Baiocchi, Alessio Devoto, Valerio Marsocci, Pasquale Minervini, Jary Pomponi

    Abstract: This article summarizes principles and ideas from the emerging area of applying \textit{conditional computation} methods to the design of neural networks. In particular, we focus on neural networks that can dynamically activate or de-activate parts of their computational graph conditionally on their input. Examples include the dynamic selection of, e.g., input tokens, layers (or sets of layers), a… ▽ More

    Submitted 8 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Journal ref: Intelligenza Artificiale, vol. Pre-press, pp. 1-16, 2024

  3. arXiv:2402.01262  [pdf, other

    cs.LG cs.CV

    Class incremental learning with probability dampening and cascaded gated classifier

    Authors: Jary Pomponi, Alessio Devoto, Simone Scardapane

    Abstract: Humans are capable of acquiring new knowledge and transferring learned knowledge into different domains, incurring a small forgetting. The same ability, called Continual Learning, is challenging to achieve when operating with neural networks due to the forgetting affecting past learned tasks when learning new ones. This forgetting can be mitigated by replaying stored samples from past tasks, but a… ▽ More

    Submitted 23 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Previously called "Cascaded Scaling Classifier: class incremental learning with probability scaling ". The official code is available https://github.com/jaryP/CIL-Margin-Dampening-Gated-Classifier

  4. arXiv:2401.13330  [pdf, other

    cs.LG cs.CV

    NACHOS: Neural Architecture Search for Hardware Constrained Early Exit Neural Networks

    Authors: Matteo Gambella, Jary Pomponi, Simone Scardapane, Manuel Roveri

    Abstract: Early Exit Neural Networks (EENNs) endow astandard Deep Neural Network (DNN) with Early Exit Classifiers (EECs), to provide predictions at intermediate points of the processing when enough confidence in classification is achieved. This leads to many benefits in terms of effectiveness and efficiency. Currently, the design of EENNs is carried out manually by experts, a complex and time-consuming tas… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  5. arXiv:2208.02048  [pdf, other

    cs.LG stat.ML

    Centroids Matching: an efficient Continual Learning approach operating in the embedding space

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Catastrophic forgetting (CF) occurs when a neural network loses the information previously learned while training on a set of samples from a different distribution, i.e., a new task. Existing approaches have achieved remarkable results in mitigating CF, especially in a scenario called task incremental learning. However, this scenario is not realistic, and limited work has been done to achieve good… ▽ More

    Submitted 10 September, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

    Comments: Submitted to Transactions on Machine Learning Research (TMLR)

  6. arXiv:2202.05694  [pdf, other

    cs.LG stat.ML

    Continual Learning with Invertible Generative Models

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks), and rehearsal strategies, where the network is constantly re-trained on past data. Generative models have also been applied for the latter, in order to have endl… ▽ More

    Submitted 27 December, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.02443

  7. Pixle: a fast and effective black-box attack based on rearranging pixels

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Recent research has found that neural networks are vulnerable to several types of adversarial attacks, where the input samples are modified in such a way that the model produces a wrong prediction that misclassifies the adversarial sample. In this paper we focus on black-box adversarial attacks, that can be performed without knowing the inner structure of the attacked model, nor the training proce… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  8. Structured Ensembles: an Approach to Reduce the Memory Footprint of Ensemble Methods

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: In this paper, we propose a novel ensembling technique for deep neural networks, which is able to drastically reduce the required memory compared to alternative approaches. In particular, we propose to extract multiple sub-networks from a single, untrained neural network by solving an end-to-end optimization task combining differentiable scaling over the original architecture, with multiple regula… ▽ More

    Submitted 17 September, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Article accepted at Neural Networks

  9. arXiv:2104.00405  [pdf, other

    cs.LG cs.AI cs.CV

    Avalanche: an End-to-End Library for Continual Learning

    Authors: Vincenzo Lomonaco, Lorenzo Pellegrini, Andrea Cossu, Antonio Carta, Gabriele Graffieti, Tyler L. Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I. Parisi, Fabio Cuzzolin, Andreas Tolias, Simone Scardapane, Luca Antiga, Subutai Amhad, Adrian Popescu, Christopher Kanan, Joost van de Weijer , et al. (3 additional authors not shown)

    Abstract: Learning continually from non-stationary data streams is a long-standing goal and a challenging problem in machine learning. Recently, we have witnessed a renewed and fast-growing interest in continual learning, especially within the deep learning community. However, algorithmic solutions are often difficult to re-implement, evaluate and port across different settings, where even results on standa… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: Official Website: https://avalanche.continualai.org

  10. arXiv:2007.02443  [pdf, other

    stat.ML cs.CV cs.LG

    Pseudo-Rehearsal for Continual Learning with Normalizing Flows

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks), and rehearsal strategies, where the network is constantly re-trained on past data. Generative models have also been applied for the latter, in order to have endl… ▽ More

    Submitted 5 August, 2021; v1 submitted 5 July, 2020; originally announced July 2020.

    Comments: A preliminary unpublished version of this work was presented in the LifelongML workshop, at ICML 2020

  11. Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Bayesian Neural Networks (BNNs) are trained to optimize an entire distribution over their weights instead of a single set, having significant advantages in terms of, e.g., interpretability, multi-task learning, and calibration. Because of the intractability of the resulting optimization problem, most BNNs are either sampled through Monte Carlo methods, or trained by minimizing a suitable Evidence… ▽ More

    Submitted 30 September, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  12. arXiv:1911.11717  [pdf, other

    physics.data-an cs.LG hep-ex nucl-ex physics.ins-det

    DeepRICH: Learning Deeply Cherenkov Detectors

    Authors: Cristiano Fanelli, Jary Pomponi

    Abstract: Imaging Cherenkov detectors are largely used for particle identification (PID) in nuclear and particle physics experiments, where develo** fast reconstruction algorithms is becoming of paramount importance to allow for near real time calibration and data quality control, as well as to speed up offline analysis of large amount of data. In this paper we present DeepRICH, a novel deep learning algo… ▽ More

    Submitted 18 December, 2019; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: 14 pages, 9 figures, preprint

    Report number: JLAB-PHY-20-3179

    Journal ref: 2020 Mach. Learn.: Sci. Technol. 1 015010

  13. Efficient Continual Learning in Neural Networks with Embedding Regularization

    Authors: Jary Pomponi, Simone Scardapane, Vincenzo Lomonaco, Aurelio Uncini

    Abstract: Continual learning of deep neural networks is a key requirement for scaling them up to more complex applicative scenarios and for achieving real lifelong learning of these architectures. Previous approaches to the problem have considered either the progressive increase in the size of the networks, or have tried to regularize the network behavior to equalize it with respect to previously observed t… ▽ More

    Submitted 11 February, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Journal ref: Neurocomputing, 397, pp. 139-148, 2020