Skip to main content

Showing 1–50 of 94 results for author: Bacciu, D

.
  1. arXiv:2406.09835  [pdf, other

    cs.LG

    I Know How: Combining Prior Policies to Solve New Tasks

    Authors: Malio Li, Elia Piccoli, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Multi-Task Reinforcement Learning aims at develo** agents that are able to continually evolve and adapt to new scenarios. However, this goal is challenging to achieve due to the phenomenon of catastrophic forgetting and the high demand of computational resources. Learning from scratch for each new task is not a viable or sustainable option, and thus agents should be able to collect and exploit p… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 7 pages, Conference on Games (CoG) 2024

  2. arXiv:2406.02740  [pdf, other

    cs.LG

    Long Range Propagation on Continuous-Time Dynamic Graphs

    Authors: Alessio Gravina, Giulio Lovisotto, Claudio Gallicchio, Davide Bacciu, Claas Grohnfeldt

    Abstract: Learning Continuous-Time Dynamic Graphs (C-TDGs) requires accurately modeling spatio-temporal information on streams of irregularly sampled events. While many methods have been proposed recently, we find that most message passing-, recurrent- or self-attention-based methods perform poorly on long-range tasks. These tasks require correlating information that occurred "far" away from the current eve… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024 (https://openreview.net/forum?id=gVg8V9isul)

  3. arXiv:2406.00394  [pdf, other

    cs.LG cs.AI stat.ME

    Learning Causal Abstractions of Linear Structural Causal Models

    Authors: Riccardo Massidda, Sara Magliacane, Davide Bacciu

    Abstract: The need for modelling causal knowledge at different levels of granularity arises in several settings. Causal Abstraction provides a framework for formalizing this problem by relating two Structural Causal Models at different levels of detail. Despite increasing interest in applying causal abstraction, e.g. in the interpretability of large machine learning models, the graphical and parametrical co… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  4. arXiv:2405.17163  [pdf, other

    cs.LG eess.SY

    Injecting Hamiltonian Architectural Bias into Deep Graph Networks for Long-Range Propagation

    Authors: Simon Heilig, Alessio Gravina, Alessandro Trenta, Claudio Gallicchio, Davide Bacciu

    Abstract: The dynamics of information diffusion within graphs is a critical open issue that heavily influences graph representation learning, especially when considering long-range propagation. This calls for principled approaches that control and regulate the degree of propagation and dissipation of information throughout the neural flow. Motivated by this, we introduce (port-)Hamiltonian Deep Graph Networ… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  5. arXiv:2405.01009  [pdf, other

    cs.LG

    Tackling Graph Oversquashing by Global and Local Non-Dissipativity

    Authors: Alessio Gravina, Moshe Eliasof, Claudio Gallicchio, Davide Bacciu, Carola-Bibiane Schönlieb

    Abstract: A common problem in Message-Passing Neural Networks is oversquashing -- the limited ability to facilitate effective information flow between distant nodes. Oversquashing is attributed to the exponential decay in information transmission as node distances increase. This paper introduces a novel perspective to address oversquashing, leveraging properties of global and local non-dissipativity, that e… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  6. arXiv:2404.19508  [pdf, other

    cs.LG

    Temporal Graph ODEs for Irregularly-Sampled Time Series

    Authors: Alessio Gravina, Daniele Zambon, Davide Bacciu, Cesare Alippi

    Abstract: Modern graph representation learning works mostly under the assumption of dealing with regularly sampled temporal graph snapshots, which is far from realistic, e.g., social networks and physical systems are characterized by continuous dynamics and sporadic observations. To address this limitation, we introduce the Temporal Graph Ordinary Differential Equation (TG-ODE) framework, which learns both… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Preprint. Accepted at IJCAI 2024

  7. arXiv:2404.14909  [pdf, other

    cs.LG hep-th

    MultiSTOP: Solving Functional Equations with Reinforcement Learning

    Authors: Alessandro Trenta, Davide Bacciu, Andrea Cossu, Pietro Ferrero

    Abstract: We develop MultiSTOP, a Reinforcement Learning framework for solving functional equations in physics. This new methodology produces actual numerical solutions instead of bounds on them. We extend the original BootSTOP algorithm by adding multiple constraints derived from domain-specific knowledge, even in integral form, to improve the accuracy of the solution. We investigate a particular equation… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: ICLR 2024 Workshop on AI4DifferentialEquations In Science

  8. arXiv:2404.07817  [pdf, other

    cs.LG cs.AI

    Calibration of Continual Learning Models

    Authors: Lanpei Li, Elia Piccoli, Andrea Cossu, Davide Bacciu, Vincenzo Lomonaco

    Abstract: Continual Learning (CL) focuses on maximizing the predictive performance of a model across a non-stationary stream of data. Unfortunately, CL models tend to forget previous knowledge, thus often underperforming when compared with an offline model trained jointly on the entire data stream. Given that any CL model will eventually make mistakes, it is of crucial importance to build calibrated CL mode… ▽ More

    Submitted 12 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted at CLVISION workshop, CVPR 2024

  9. arXiv:2403.13130  [pdf, other

    cs.CL cs.AI cs.LG

    Self-generated Replay Memories for Continual Neural Machine Translation

    Authors: Michele Resta, Davide Bacciu

    Abstract: Modern Neural Machine Translation systems exhibit strong performance in several different languages and are constantly improving. Their ability to learn continuously is, however, still severely limited by the catastrophic forgetting issue. In this work, we leverage a key property of encoder-decoder Transformers, i.e. their generative ability, to propose a novel approach to continually learning Neu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted at NAACL 2024

  10. arXiv:2403.11292  [pdf, other

    cs.LG cs.AI

    Multi-Relational Graph Neural Network for Out-of-Domain Link Prediction

    Authors: Asma Sattar, Georgios Deligiorgis, Marco Trincavelli, Davide Bacciu

    Abstract: Dynamic multi-relational graphs are an expressive relational representation for data enclosing entities and relations of different types, and where relationships are allowed to vary in time. Addressing predictive tasks over such data requires the ability to find structure embeddings that capture the diversity of the relationships involved, as well as their dynamic evolution. In this work, we estab… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures, 3 Tables, conference [accepted in IEEE WCCI 2024]

  11. arXiv:2403.07015  [pdf, other

    cs.LG

    Adaptive Hyperparameter Optimization for Continual Learning Scenarios

    Authors: Rudy Semola, Julio Hurtado, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Hyperparameter selection in continual learning scenarios is a challenging and underexplored aspect, especially in practical non-stationary environments. Traditional approaches, such as grid searches with held-out validation data from all tasks, are unrealistic for building accurate lifelong learning systems. This paper aims to explore the role of hyperparameter selection in continual learning and… ▽ More

    Submitted 19 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  12. arXiv:2402.09030  [pdf, other

    cs.RO

    Awareness in robotics: An early perspective from the viewpoint of the EIC Pathfinder Challenge "Awareness Inside''

    Authors: Cosimo Della Santina, Carlos Hernandez Corbato, Burak Sisman, Luis A. Leiva, Ioannis Arapakis, Michalis Vakalellis, Jean Vanderdonckt, Luis Fernando D'Haro, Guido Manzi, Cristina Becchio, Aïda Elamrani, Mohsen Alirezaei, Ginevra Castellano, Dimos V. Dimarogonas, Arabinda Ghosh, Sofie Haesaert, Sadegh Soudjani, Sybert Stroeve, Paul Verschure, Davide Bacciu, Ophelia Deroy, Bahador Bahrami, Claudio Gallicchio, Sabine Hauert, Ricardo Sanz , et al. (6 additional authors not shown)

    Abstract: Consciousness has been historically a heavily debated topic in engineering, science, and philosophy. On the contrary, awareness had less success in raising the interest of scholars in the past. However, things are changing as more and more researchers are getting interested in answering questions concerning what awareness is and how it can be artificially generated. The landscape is rapidly evolvi… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  13. arXiv:2312.17397  [pdf, other

    cs.LG q-bio.BM

    Classifier-free graph diffusion for molecular property targeting

    Authors: Matteo Ninniri, Marco Podda, Davide Bacciu

    Abstract: This work focuses on the task of property targeting: that is, generating molecules conditioned on target chemical properties to expedite candidate screening for novel drug and materials development. DiGress is a recent diffusion model for molecular graphs whose distinctive feature is allowing property targeting through classifier-based (CB) guidance. While CB guidance may work to generate molecula… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted to GCLR workshop (AAAI '24)

  14. arXiv:2312.06256  [pdf, other

    cs.LG eess.SY

    Neural Autoencoder-Based Structure-Preserving Model Order Reduction and Control Design for High-Dimensional Physical Systems

    Authors: Marco Lepri, Davide Bacciu, Cosimo Della Santina

    Abstract: This work concerns control-oriented and structure-preserving learning of low-dimensional approximations of high-dimensional physical systems, with a focus on mechanical systems. We investigate the integration of neural autoencoders in model order reduction, while at the same time preserving Hamiltonian or Lagrangian structures. We focus on extensively evaluating the considered methodology by perfo… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 11 pages, 14 Figures

  15. arXiv:2309.08406  [pdf, other

    cs.LG stat.ML

    Constraint-Free Structure Learning with Smooth Acyclic Orientations

    Authors: Riccardo Massidda, Francesco Landolfi, Martina Cinquini, Davide Bacciu

    Abstract: The structure learning problem consists of fitting data generated by a Directed Acyclic Graph (DAG) to correctly reconstruct its arcs. In this context, differentiable approaches constrain or regularize the optimization problem using a continuous relaxation of the acyclicity property. The computational cost of evaluating graph acyclicity is cubic on the number of nodes and significantly affects sca… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  16. Modeling Edge Features with Deep Bayesian Graph Networks

    Authors: Daniele Atzeni, Federico Errica, Davide Bacciu, Alessio Micheli

    Abstract: We propose an extension of the Contextual Graph Markov Model, a deep and probabilistic machine learning model for graphs, to model the distribution of edge features. Our approach is architectural, as we introduce an additional Bayesian network map** edge features into discrete states to be used by the original model. In doing so, we are also able to build richer graph representations even in the… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Releasing pre-print version to comply with TAILOR project requirements

  17. arXiv:2307.14928  [pdf, other

    cs.SD cs.LG eess.AS

    Graph-based Polyphonic Multitrack Music Generation

    Authors: Emanuele Cosenza, Andrea Valenti, Davide Bacciu

    Abstract: Graphs can be leveraged to model polyphonic multitrack symbolic music, where notes, chords and entire sections may be linked at different levels of the musical hierarchy by tonal and rhythmic relationships. Nonetheless, there is a lack of works that consider graph representations in the context of deep learning systems for music generation. This paper bridges this gap by introducing a novel graph… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  18. Deep learning for dynamic graphs: models and benchmarks

    Authors: Alessio Gravina, Davide Bacciu

    Abstract: Recent progress in research on Deep Graph Networks (DGNs) has led to a maturation of the domain of learning on graphs. Despite the growth of this research field, there are still important challenges that are yet unsolved. Specifically, there is an urge of making DGNs suitable for predictive tasks on realworld systems of interconnected entities, which evolve over time. With the aim of fostering res… ▽ More

    Submitted 9 April, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Preprint version. Accepted at IEEE TNNLS (https://ieeexplore.ieee.org/document/10490120)

  19. arXiv:2306.10724  [pdf, other

    cs.LG

    Partial Hypernetworks for Continual Learning

    Authors: Hamed Hemati, Vincenzo Lomonaco, Davide Bacciu, Damian Borth

    Abstract: Hypernetworks mitigate forgetting in continual learning (CL) by generating task-dependent weights and penalizing weight changes at a meta-model level. Unfortunately, generating all weights is not only computationally expensive for larger architectures, but also, it is not well understood whether generating all model weights is necessary. Inspired by latent replay methods in CL, we propose partial… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023

  20. arXiv:2306.07218  [pdf, other

    cs.LG cs.AI

    A Protocol for Continual Explanation of SHAP

    Authors: Andrea Cossu, Francesco Spinnato, Riccardo Guidotti, Davide Bacciu

    Abstract: Continual Learning trains models on a stream of data, with the aim of learning new information without forgetting previous knowledge. Given the dynamic nature of such environments, explaining the predictions of these models can be challenging. We study the behavior of SHAP values explanations in Continual Learning and propose an evaluation protocol to robustly assess the change of explanations in… ▽ More

    Submitted 20 June, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: ESANN 2023, 6 pages, added link to code

  21. arXiv:2306.06064  [pdf, other

    cs.NE cs.LG

    Neural Algorithmic Reasoning for Combinatorial Optimisation

    Authors: Dobrik Georgiev, Danilo Numeroso, Davide Bacciu, Pietro Liò

    Abstract: Solving NP-hard/complete combinatorial problems with neural networks is a challenging research area that aims to surpass classical approximate algorithms. The long-term objective is to outperform hand-designed heuristics for NP-hard/complete problems by learning to generate superior solutions solely from training data. Current neural-based methods for solving CO problems often overlook the inheren… ▽ More

    Submitted 13 February, 2024; v1 submitted 18 May, 2023; originally announced June 2023.

  22. arXiv:2305.16396  [pdf

    cs.LG math.OC

    ADLER -- An efficient Hessian-based strategy for adaptive learning rate

    Authors: Dario Balboni, Davide Bacciu

    Abstract: We derive a sound positive semi-definite approximation of the Hessian of deep models for which Hessian-vector products are easily computable. This enables us to provide an adaptive SGD learning rate strategy based on the minimization of the local quadratic approximation, which requires just twice the computation of a single SGD run, but performs comparably with grid search on SGD learning rates on… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 6 pages, 4 figures

  23. arXiv:2303.15888  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Projected Latent Distillation for Data-Agnostic Consolidation in Distributed Continual Learning

    Authors: Antonio Carta, Andrea Cossu, Vincenzo Lomonaco, Davide Bacciu, Joost van de Weijer

    Abstract: Distributed learning on the edge often comprises self-centered devices (SCD) which learn local tasks independently and are unwilling to contribute to the performance of other SDCs. How do we achieve forward transfer at zero cost for the single SCDs? We formalize this problem as a Distributed Continual Learning scenario, where SCD adapt to local tasks and a CL model consolidates the knowledge from… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  24. arXiv:2302.04496  [pdf, other

    cs.LG cs.DS

    Dual Algorithmic Reasoning

    Authors: Danilo Numeroso, Davide Bacciu, Petar Veličković

    Abstract: Neural Algorithmic Reasoning is an emerging area of machine learning which seeks to infuse algorithmic computation in neural networks, typically by training neural models to approximate steps of classical algorithms. In this context, much of the current work has focused on learning reachability and shortest path graph algorithms, showing that joint learning on similar algorithms is beneficial for… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: To appear at ICLR 2023. 16 pages, 9 figures

  25. arXiv:2301.11396  [pdf, other

    cs.LG

    Class-Incremental Learning with Repetition

    Authors: Hamed Hemati, Andrea Cossu, Antonio Carta, Julio Hurtado, Lorenzo Pellegrini, Davide Bacciu, Vincenzo Lomonaco, Damian Borth

    Abstract: Real-world data streams naturally include the repetition of previous concepts. From a Continual Learning (CL) perspective, repetition is a property of the environment and, unlike replay, cannot be controlled by the agent. Nowadays, the Class-Incremental (CI) scenario represents the leading test-bed for assessing and comparing CL strategies. This scenario type is very easy to use, but it never allo… ▽ More

    Submitted 19 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Accepted to the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023 19 pages

  26. arXiv:2301.09496  [pdf, other

    cs.LG

    ECGAN: Self-supervised generative adversarial network for electrocardiography

    Authors: Lorenzo Simone, Davide Bacciu

    Abstract: High-quality synthetic data can support the development of effective predictive models for biomedical tasks, especially in rare diseases or when subject to compelling privacy constraints. These limitations, for instance, negatively impact open access to electrocardiography datasets about arrhythmias. This work introduces a self-supervised approach to the generation of synthetic electrocardiography… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  27. arXiv:2211.12270  [pdf, other

    cs.AI

    Causal Abstraction with Soft Interventions

    Authors: Riccardo Massidda, Atticus Geiger, Thomas Icard, Davide Bacciu

    Abstract: Causal abstraction provides a theory describing how several causal models can represent the same system at different levels of detail. Existing theoretical proposals limit the analysis of abstract models to "hard" interventions fixing causal variables to be constant values. In this work, we extend causal abstraction to "soft" interventions, which assign possibly non-constant functions to variables… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  28. arXiv:2210.09789  [pdf, other

    cs.LG

    Anti-Symmetric DGN: a stable architecture for Deep Graph Networks

    Authors: Alessio Gravina, Davide Bacciu, Claudio Gallicchio

    Abstract: Deep Graph Networks (DGNs) currently dominate the research landscape of learning from graphs, due to their efficiency and ability to implement an adaptive message-passing scheme between the nodes. However, DGNs are typically limited in their ability to propagate and preserve long-term dependencies between nodes, i.e., they suffer from the over-squashing phenomena. This reduces their effectiveness,… ▽ More

    Submitted 8 February, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: Accepted at ICLR 2023 (https://openreview.net/forum?id=J3Y7cgZOOS)

  29. arXiv:2210.02095  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    ChemAlgebra: Algebraic Reasoning on Chemical Reactions

    Authors: Andrea Valenti, Davide Bacciu, Antonio Vergari

    Abstract: While showing impressive performance on various kinds of learning tasks, it is yet unclear whether deep learning models have the ability to robustly tackle reasoning tasks. than by learning the underlying reasoning process that is actually required to solve the tasks. Measuring the robustness of reasoning in machine learning models is challenging as one needs to provide a task that cannot be easil… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  30. arXiv:2209.05336  [pdf, other

    cs.LG

    Modular Representations for Weak Disentanglement

    Authors: Andrea Valenti, Davide Bacciu

    Abstract: The recently introduced weakly disentangled representations proposed to relax some constraints of the previous definitions of disentanglement, in exchange for more flexibility. However, at the moment, weak disentanglement can only be achieved by increasing the amount of supervision as the number of factors of variations of the data increase. In this paper, we introduce modular representations for… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: Accepted at ESANN2022

  31. Generalizing Downsampling from Regular Data to Graphs

    Authors: Davide Bacciu, Alessio Conte, Francesco Landolfi

    Abstract: Downsampling produces coarsened, multi-resolution representations of data and it is used, for example, to produce lossy compression and visualization of large images, reduce computational costs, and boost deep neural representation learning. Unfortunately, due to their lack of a regular structure, there is still no consensus on how downsampling should apply to graphs and linked data. Indeed reduct… ▽ More

    Submitted 1 December, 2022; v1 submitted 6 August, 2022; originally announced August 2022.

    Comments: Accepted at AAAI 2023; Extended version with proofs; 15 pages, 3 figures, 5 tables

  32. arXiv:2207.01145  [pdf, other

    cs.LG

    Memory Population in Continual Learning via Outlier Elimination

    Authors: Julio Hurtado, Alain Raymond-Saez, Vladimir Araujo, Vincenzo Lomonaco, Alvaro Soto, Davide Bacciu

    Abstract: Catastrophic forgetting, the phenomenon of forgetting previously learned tasks when learning a new one, is a major hurdle in develo** continual learning algorithms. A popular method to alleviate forgetting is to use a memory buffer, which stores a subset of previously learned task examples for use during training on new tasks. The de facto method of filling memory is by randomly selecting previo… ▽ More

    Submitted 3 October, 2023; v1 submitted 3 July, 2022; originally announced July 2022.

  33. arXiv:2207.00200  [pdf, other

    cs.LG cs.CV

    Studying the impact of magnitude pruning on contrastive learning methods

    Authors: Francesco Corti, Rahim Entezari, Sara Hooker, Davide Bacciu, Olga Saukh

    Abstract: We study the impact of different pruning techniques on the representation learned by deep neural networks trained with contrastive loss functions. Our work finds that at high sparsity levels, contrastive learning results in a higher number of misclassified examples relative to models trained with traditional cross-entropy loss. To understand this pronounced difference, we use metrics such as the n… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  34. arXiv:2207.00010  [pdf, other

    cs.LG cs.AI cs.HC

    Continual Learning for Human State Monitoring

    Authors: Federico Matteoni, Andrea Cossu, Claudio Gallicchio, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Continual Learning (CL) on time series data represents a promising but under-studied avenue for real-world applications. We propose two new CL benchmarks for Human State Monitoring. We carefully designed the benchmarks to mirror real-world environments in which new subjects are continuously added. We conducted an empirical evaluation to assess the ability of popular CL strategies to mitigate forge… ▽ More

    Submitted 11 July, 2022; v1 submitted 29 June, 2022; originally announced July 2022.

    Comments: 6 pages, 4 figures, 2 tables, Accepted as oral at ESANN 2022

  35. arXiv:2206.11849  [pdf, other

    cs.LG cs.AI cs.CV

    Sample Condensation in Online Continual Learning

    Authors: Mattia Sangermano, Antonio Carta, Andrea Cossu, Davide Bacciu

    Abstract: Online Continual learning is a challenging learning scenario where the model must learn from a non-stationary stream of data where each sample is seen only once. The main challenge is to incrementally learn while avoiding catastrophic forgetting, namely the problem of forgetting previously acquired knowledge while learning from new data. A popular solution in these scenario is to use a small memor… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: Accepted as a conference paper at 2022 International Joint Conference on Neural Networks (IJCNN 2022). Part of 2022 IEEE World Congress on Computational Intelligence (IEEE WCCI 2022)

  36. arXiv:2206.11087  [pdf, ps, other

    cs.NE cs.LG

    Federated Adaptation of Reservoirs via Intrinsic Plasticity

    Authors: Valerio De Caro, Claudio Gallicchio, Davide Bacciu

    Abstract: We propose a novel algorithm for performing federated learning with Echo State Networks (ESNs) in a client-server scenario. In particular, our proposal focuses on the adaptation of reservoirs by combining Intrinsic Plasticity with Federated Averaging. The former is a gradient-based method for adapting the reservoir's non-linearity in a local and unsupervised manner, while the latter provides the f… ▽ More

    Submitted 25 May, 2022; originally announced June 2022.

    Comments: 6 pages

  37. arXiv:2206.06957  [pdf, other

    cs.LG cs.AI cs.DC

    Continual-Learning-as-a-Service (CLaaS): On-Demand Efficient Adaptation of Predictive Models

    Authors: Rudy Semola, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Predictive machine learning models nowadays are often updated in a stateless and expensive way. The two main future trends for companies that want to build machine learning-based applications and systems are real-time inference and continual updating. Unfortunately, both trends require a mature infrastructure that is hard and costly to realize on-premise. This paper defines a novel software servic… ▽ More

    Submitted 21 July, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

  38. arXiv:2205.10056  [pdf, other

    cs.LG

    Leveraging Relational Information for Learning Weakly Disentangled Representations

    Authors: Andrea Valenti, Davide Bacciu

    Abstract: Disentanglement is a difficult property to enforce in neural representations. This might be due, in part, to a formalization of the disentanglement problem that focuses too heavily on separating relevant factors of variation of the data in single isolated dimensions of the neural representation. We argue that such a definition might be too restrictive and not necessarily beneficial in terms of dow… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted at WCCI2022

  39. arXiv:2205.09357  [pdf, other

    cs.LG cs.AI

    Continual Pre-Training Mitigates Forgetting in Language and Vision

    Authors: Andrea Cossu, Tinne Tuytelaars, Antonio Carta, Lucia Passaro, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Pre-trained models are nowadays a fundamental component of machine learning research. In continual learning, they are commonly used to initialize the model before training on the stream of non-stationary data. However, pre-training is rarely applied during continual learning. We formalize and investigate the characteristics of the continual pre-training scenario in both language and vision environ… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: under review

  40. arXiv:2205.08935  [pdf, other

    cs.CV cs.LG cs.NE

    Deep Features for CBIR with Scarce Data using Hebbian Learning

    Authors: Gabriele Lagani, Davide Bacciu, Claudio Gallicchio, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato

    Abstract: Features extracted from Deep Neural Networks (DNNs) have proven to be very effective in the context of Content Based Image Retrieval (CBIR). In recent work, biologically inspired \textit{Hebbian} learning algorithms have shown promises for DNN training. In this contribution, we study the performance of such algorithms in the development of feature extractors for CBIR tasks. Specifically, we consid… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: 6 Pages, 1 Figure, 2 Tables, Submitted at CBMI 2022

  41. arXiv:2204.08938  [pdf, other

    cs.AI cs.LG

    Learning heuristics for A*

    Authors: Danilo Numeroso, Davide Bacciu, Petar Veličković

    Abstract: Path finding in graphs is one of the most studied classes of problems in computer science. In this context, search algorithms are often extended with heuristics for a more efficient search of target nodes. In this work we combine recent advancements in Neural Algorithmic Reasoning to learn efficient heuristic functions for path finding problems on graphs. At training time, we exploit multi-task le… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 7 pages, 2 figures. To appear at the ICLR 2022 GroundedML Workshop

  42. Practical Recommendations for Replay-based Continual Learning Methods

    Authors: Gabriele Merlin, Vincenzo Lomonaco, Andrea Cossu, Antonio Carta, Davide Bacciu

    Abstract: Continual Learning requires the model to learn from a stream of dynamic, non-stationary data without forgetting previous knowledge. Several approaches have been developed in the literature to tackle the Continual Learning challenge. Among them, Replay approaches have empirically proved to be the most effective ones. Replay operates by saving some samples in memory which are then used to rehearse k… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Journal ref: ICIAP 2022 Workshops

  43. arXiv:2202.13657  [pdf, other

    cs.LG cs.AI cs.CV

    Avalanche RL: a Continual Reinforcement Learning Library

    Authors: Nicolò Lucchesi, Antonio Carta, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Continual Reinforcement Learning (CRL) is a challenging setting where an agent learns to interact with an environment that is constantly changing over time (the stream of experiences). In this paper, we describe Avalanche RL, a library for Continual Reinforcement Learning which allows to easily train agents on a continuous stream of tasks. Avalanche RL is based on PyTorch and supports any OpenAI G… ▽ More

    Submitted 24 March, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: Presented at the 21st International Conference on Image Analysis and Processing (ICIAP 2021)

  44. arXiv:2202.01645  [pdf, other

    cs.AI cs.HC

    AI-as-a-Service Toolkit for Human-Centered Intelligence in Autonomous Driving

    Authors: Valerio De Caro, Saira Bano, Achilles Machumilane, Alberto Gotta, Pietro Cassará, Antonio Carta, Rudy Semola, Christos Sardianos, Christos Chronis, Iraklis Varlamis, Konstantinos Tserpes, Vincenzo Lomonaco, Claudio Gallicchio, Davide Bacciu

    Abstract: This paper presents a proof-of-concept implementation of the AI-as-a-Service toolkit developed within the H2020 TEACHING project and designed to implement an autonomous driving personalization system according to the output of an automatic driver's stress recognition algorithm, both of them realizing a Cyber-Physical System of Systems. In addition, we implemented a data-gathering subsystem to coll… ▽ More

    Submitted 9 February, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

  45. arXiv:2112.06511  [pdf, other

    cs.LG cs.AI cs.CV

    Ex-Model: Continual Learning from a Stream of Trained Models

    Authors: Antonio Carta, Andrea Cossu, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Learning continually from non-stationary data streams is a challenging research topic of growing popularity in the last few years. Being able to learn, adapt, and generalize continually in an efficient, effective, and scalable way is fundamental for a sustainable development of Artificial Intelligent systems. However, an agent-centric view of continual learning requires learning directly from raw… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

  46. arXiv:2112.02925  [pdf, other

    cs.LG cs.AI

    Is Class-Incremental Enough for Continual Learning?

    Authors: Andrea Cossu, Gabriele Graffieti, Lorenzo Pellegrini, Davide Maltoni, Davide Bacciu, Antonio Carta, Vincenzo Lomonaco

    Abstract: The ability of a model to learn continually can be empirically assessed in different continual learning scenarios. Each scenario defines the constraints and the opportunities of the learning environment. Here, we challenge the current trend in the continual learning literature to experiment mainly on class-incremental scenarios, where classes present in one experience are never revisited. We posit… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Under review

  47. arXiv:2111.02133  [pdf, other

    cs.DC cs.LG eess.SY

    Predictive Auto-scaling with OpenStack Monasca

    Authors: Giacomo Lanciano, Filippo Galli, Tommaso Cucinotta, Davide Bacciu, Andrea Passarella

    Abstract: Cloud auto-scaling mechanisms are typically based on reactive automation rules that scale a cluster whenever some metric, e.g., the average CPU usage among instances, exceeds a predefined threshold. Tuning these rules becomes particularly cumbersome when scaling-up a cluster involves non-negligible times to bootstrap new instances, as it happens frequently in production cloud services. To deal w… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: Accepted at 2021 IEEE/ACM 14th International Conference on Utility and Cloud Computing (UCC'21)

  48. arXiv:2110.01677  [pdf, ps, other

    cs.LG cs.AI

    Inductive learning for product assortment graph completion

    Authors: Haris Dukic, Georgios Deligiorgis, Pierpaolo Sepe, Davide Bacciu, Marco Trincavelli

    Abstract: Global retailers have assortments that contain hundreds of thousands of products that can be linked by several types of relationships like style compatibility, "bought together", "watched together", etc. Graphs are a natural representation for assortments, where products are nodes and relations are edges. Relations like style compatibility are often produced by a manual process and therefore do no… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: 6 pages

    MSC Class: 68T07 ACM Class: I.2; I.5

    Journal ref: ESANN 2021 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning

  49. arXiv:2107.08396  [pdf, ps, other

    cs.LG

    GraphGen-Redux: a Fast and Lightweight Recurrent Model for labeled Graph Generation

    Authors: Marco Podda, Davide Bacciu

    Abstract: The problem of labeled graph generation is gaining attention in the Deep Learning community. The task is challenging due to the sparse and discrete nature of graph spaces. Several approaches have been proposed in the literature, most of which require to transform the graphs into sequences that encode their structure and labels and to learn the distribution of such sequences through an auto-regress… ▽ More

    Submitted 18 July, 2021; originally announced July 2021.

  50. arXiv:2107.06543  [pdf, other

    cs.AI cs.LG

    TEACHING -- Trustworthy autonomous cyber-physical applications through human-centred intelligence

    Authors: Davide Bacciu, Siranush Akarmazyan, Eric Armengaud, Manlio Bacco, George Bravos, Calogero Calandra, Emanuele Carlini, Antonio Carta, Pietro Cassara, Massimo Coppola, Charalampos Davalas, Patrizio Dazzi, Maria Carmela Degennaro, Daniele Di Sarli, Jürgen Dobaj, Claudio Gallicchio, Sylvain Girbal, Alberto Gotta, Riccardo Groppo, Vincenzo Lomonaco, Georg Macher, Daniele Mazzei, Gabriele Mencagli, Dimitrios Michail, Alessio Micheli , et al. (10 additional authors not shown)

    Abstract: This paper discusses the perspective of the H2020 TEACHING project on the next generation of autonomous applications running in a distributed and highly heterogeneous environment comprising both virtual and physical resources spanning the edge-cloud continuum. TEACHING puts forward a human-centred vision leveraging the physiological, emotional, and cognitive state of the users as a driver for the… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.