Skip to main content

Showing 1–50 of 73 results for author: Rodola, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15057  [pdf, other

    cs.LG

    Latent Space Translation via Inverse Relative Projection

    Authors: Valentino Maiorca, Luca Moschella, Marco Fumero, Francesco Locatello, Emanuele Rodolà

    Abstract: The emergence of similar representations between independently trained neural models has sparked significant interest in the representation learning community, leading to the development of various methods to obtain communication between latent spaces. "Latent space communication" can be achieved in two ways: i) by independently map** the original spaces to a shared or relative one; ii) by direc… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.00664, arXiv:2406.11014

  2. arXiv:2406.14183  [pdf, other

    cs.LG

    Latent Functional Maps

    Authors: Marco Fumero, Marco Pegoraro, Valentino Maiorca, Francesco Locatello, Emanuele Rodolà

    Abstract: Neural models learn data representations that lie on low-dimensional manifolds, yet modeling the relation between these representational spaces is an ongoing challenge. By integrating spectral geometry principles into neural modeling, we show that this problem can be better addressed in the functional domain, mitigating complexity, while enhancing interpretability and performances on downstream ta… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2405.17897  [pdf, other

    cs.LG

    $C^2M^3$: Cycle-Consistent Multi-Model Merging

    Authors: Donato Crisostomi, Marco Fumero, Daniele Baieri, Florian Bernard, Emanuele Rodolà

    Abstract: In this paper, we present a novel data-free method for merging neural networks in weight space. Differently from most existing works, our method optimizes for the permutations of network neurons globally across all layers. This allows us to enforce cycle consistency of the permutations when merging $N \geq 3$ models, allowing circular compositions of permutations to be computed without accumulatin… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 22 pages, 16 figures

  4. arXiv:2405.12895  [pdf, other

    cs.GR cs.CV

    Implicit-ARAP: Efficient Handle-Guided Deformation of High-Resolution Meshes and Neural Fields via Local Patch Meshing

    Authors: Daniele Baieri, Filippo Maggioli, Zorah Lähner, Simone Melzi, Emanuele Rodolà

    Abstract: In this work, we present the local patch mesh representation for neural signed distance fields. This technique allows to discretize local regions of the level sets of an input SDF by projecting and deforming flat patch meshes onto the level set surface, using exclusively the SDF information and its gradient. Our analysis reveals this method to be more accurate than the standard marching cubes algo… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 12 pages, 16 figures

    MSC Class: 68U05 ACM Class: I.3.5; I.2.6

  5. arXiv:2405.09062  [pdf, other

    cs.SD cs.LG eess.AS

    Naturalistic Music Decoding from EEG Data via Latent Diffusion Models

    Authors: Emilian Postolache, Natalia Polouliakh, Hiroaki Kitano, Akima Connelly, Emanuele Rodolà, Luca Cosmo, Taketo Akama

    Abstract: In this article, we explore the potential of using latent diffusion models, a family of powerful generative models, for the task of reconstructing naturalistic music from electroencephalogram (EEG) recordings. Unlike simpler music with limited timbres, such as MIDI-generated tunes or monophonic pieces, the focus here is on intricate music featuring a diverse array of instruments, voices, and effec… ▽ More

    Submitted 3 July, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  6. arXiv:2404.16969  [pdf, other

    cs.SD cs.LG eess.AS

    COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations

    Authors: Ruben Ciranni, Emilian Postolache, Giorgio Mariani, Michele Mancusi, Luca Cosmo, Emanuele Rodolà

    Abstract: We present COCOLA (Coherence-Oriented Contrastive Learning for Audio), a contrastive learning method for musical audio representations that captures the harmonic and rhythmic coherence between samples. Our method operates at the level of stems (or their combinations) composing music tracks and allows the objective evaluation of compositional models for music in the task of accompaniment generation… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Demo page: https://github.com/gladia-research-group/cocola

  7. arXiv:2404.12917  [pdf, other

    cs.LG cs.AI cs.CV

    Zero-Shot Stitching in Reinforcement Learning using Relative Representations

    Authors: Antonio Pio Ricciardi, Valentino Maiorca, Luca Moschella, Riccardo Marin, Emanuele Rodolà

    Abstract: Visual Reinforcement Learning is a popular and powerful framework that takes full advantage of the Deep Learning breakthrough. However, it is also known that variations in the input (e.g., different colors of the panorama due to the season of the year) or the task (e.g., changing the speed limit for a car to respect) could require complete retraining of the agents. In this work, we leverage recent… ▽ More

    Submitted 7 May, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 13 pages, 10 figures, 4 tables

    MSC Class: 68T07 ACM Class: I.2.6

  8. arXiv:2403.11706  [pdf, other

    cs.SD cs.LG eess.AS

    Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models

    Authors: Emilian Postolache, Giorgio Mariani, Luca Cosmo, Emmanouil Benetos, Emanuele Rodolà

    Abstract: Multi-Source Diffusion Models (MSDM) allow for compositional musical generation tasks: generating a set of coherent sources, creating accompaniments, and performing source separation. Despite their versatility, they require estimating the joint distribution over the sources, necessitating pre-separated musical data, which is rarely available, and fixing the number and type of sources at training t… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted at ICASSP 2024

  9. arXiv:2403.05154  [pdf, other

    cs.CV cs.GR

    GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting

    Authors: Francesco Palandra, Andrea Sanchietti, Daniele Baieri, Emanuele Rodolà

    Abstract: We present GSEdit, a pipeline for text-guided 3D object editing based on Gaussian Splatting models. Our method enables the editing of the style and appearance of 3D objects without altering their main details, all in a matter of minutes on consumer hardware. We tackle the problem by leveraging Gaussian splatting to represent 3D scenes, and we optimize the model while progressively varying the imag… ▽ More

    Submitted 21 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures

    MSC Class: 68T45 ACM Class: I.2.10; I.3.8

  10. arXiv:2311.07345  [pdf, other

    eess.AS cs.SD

    Zero-Shot Duet Singing Voices Separation with Diffusion Models

    Authors: Chin-Yun Yu, Emilian Postolache, Emanuele Rodolà, György Fazekas

    Abstract: In recent studies, diffusion models have shown promise as priors for solving audio inverse problems. These models allow us to sample from the posterior distribution of a target signal given an observed signal by manipulating the diffusion process. However, when separating audio sources of the same type, such as duet singing voices, the prior learned by the diffusion process may not be sufficient t… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 9 pages, 1 figure. Published at Sound Demixing Workshop 2023

  11. arXiv:2311.06547  [pdf, other

    cs.LG

    From Charts to Atlas: Merging Latent Spaces into One

    Authors: Donato Crisostomi, Irene Cannistraci, Luca Moschella, Pietro Barbiero, Marco Ciccone, Pietro Liò, Emanuele Rodolà

    Abstract: Models trained on semantically related datasets and tasks exhibit comparable inter-sample relations within their latent spaces. We investigate in this study the aggregation of such latent spaces to create a unified space encompassing the combined information. To this end, we introduce Relative Latent Space Aggregation, a two-step approach that first renders the spaces comparable using relative rep… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: To appear in the NeurReps workshop @ NeurIPS 2023

  12. arXiv:2311.06284  [pdf, other

    physics.comp-ph cs.GR physics.flu-dyn

    Efficient Generation of Multimodal Fluid Simulation Data

    Authors: Daniele Baieri, Donato Crisostomi, Stefano Esposito, Filippo Maggioli, Emanuele Rodolà

    Abstract: In this work, we introduce an efficient generation procedure to produce synthetic multi-modal datasets of fluid simulations. The procedure can reproduce the dynamics of fluid flows and allows for exploring and learning various properties of their complex behavior, from distinct perspectives and modalities. We employ our framework to generate a set of thoughtfully designed training datasets, which… ▽ More

    Submitted 8 March, 2024; v1 submitted 30 October, 2023; originally announced November 2023.

    Comments: 10 pages, 7 figures

    MSC Class: 68U20 ACM Class: I.2.6; I.3; I.6.3

  13. arXiv:2311.00664  [pdf, other

    cs.LG

    Latent Space Translation via Semantic Alignment

    Authors: Valentino Maiorca, Luca Moschella, Antonio Norelli, Marco Fumero, Francesco Locatello, Emanuele Rodolà

    Abstract: While different neural models often exhibit latent spaces that are alike when exposed to semantically related data, this intrinsic similarity is not always immediately discernible. Towards a better understanding of this phenomenon, our work shows how representations learned from these neural modules can be translated between different pre-trained networks via simpler transformations than previousl… ▽ More

    Submitted 11 February, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023. 21 pages, 13 figures, 8 tables

  14. arXiv:2310.15247  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis

    Authors: Marco Comunità, Riccardo F. Gramaccioni, Emilian Postolache, Emanuele Rodolà, Danilo Comminiello, Joshua D. Reiss

    Abstract: Sound design involves creatively selecting, recording, and editing sound effects for various media like cinema, video games, and virtual/augmented reality. One of the most time-consuming steps when designing sound is synchronizing audio with video. In some cases, environmental recordings from video shoots are available, which can aid in the process. However, in video games and animations, no refer… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  15. arXiv:2310.01211  [pdf, other

    cs.LG

    From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

    Authors: Irene Cannistraci, Luca Moschella, Marco Fumero, Valentino Maiorca, Emanuele Rodolà

    Abstract: It has been observed that representations learned by distinct neural networks conceal structural similarities when the models are trained under similar inductive biases. From a geometric perspective, identifying the classes of transformations and the related invariances that connect these representations is fundamental to unlocking applications, such as merging, stitching, and reusing different ne… ▽ More

    Submitted 20 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 41 pages, 14 figures and 31 tables

  16. arXiv:2307.16456  [pdf, other

    cs.CL

    Camoscio: an Italian Instruction-tuned LLaMA

    Authors: Andrea Santilli, Emanuele Rodolà

    Abstract: In recent years Large Language Models (LLMs) have increased the state of the art on several natural language processing tasks. However, their accessibility is often limited to paid API services, posing challenges for researchers in conducting extensive investigations. On the other hand, while some open-source models have been proposed by the community, they are typically English-centric or multili… ▽ More

    Submitted 18 December, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: Published at CLiC-it 2023

  17. arXiv:2307.13608  [pdf, other

    q-bio.BM cs.LG

    Geometric Epitope and Paratope Prediction

    Authors: Marco Pegoraro, Clémentine Dominé, Emanuele Rodolà, Petar Veličković, Andreea Deac

    Abstract: Antibody-antigen interactions play a crucial role in identifying and neutralizing harmful foreign molecules. In this paper, we investigate the optimal representation for predicting the binding sites in the two molecules and emphasize the importance of geometric information. Specifically, we compare different geometric deep learning methods applied to proteins' inner (I-GEP) and outer (O-GEP) struc… ▽ More

    Submitted 28 May, 2023; originally announced July 2023.

  18. arXiv:2307.01037  [pdf, other

    stat.ME cs.LG

    Vector Quantile Regression on Manifolds

    Authors: Marco Pegoraro, Sanketh Vedula, Aviv A. Rosenberg, Irene Tallini, Emanuele Rodolà, Alex M. Bronstein

    Abstract: Quantile regression (QR) is a statistical tool for distribution-free estimation of conditional quantiles of a target variable given explanatory features. QR is limited by the assumption that the target distribution is univariate and defined on an Euclidean domain. Although the notion of quantiles was recently extended to multi-variate distributions, QR for multi-variate distributions on manifolds… ▽ More

    Submitted 7 February, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

  19. arXiv:2306.14457  [pdf, other

    cs.CL

    Fauno: The Italian Large Language Model that will leave you senza parole!

    Authors: Andrea Bacciu, Giovanni Trappolini, Andrea Santilli, Emanuele Rodolà, Fabrizio Silvestri

    Abstract: This paper presents Fauno, the first and largest open-source Italian conversational Large Language Model (LLM). Our goal with Fauno is to democratize the study of LLMs in Italian, demonstrating that obtaining a fine-tuned conversational bot with a single GPU is possible. In addition, we release a collection of datasets for conversational AI in Italian. The datasets on which we fine-tuned Fauno inc… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  20. Accelerating Transformer Inference for Translation via Parallel Decoding

    Authors: Andrea Santilli, Silvio Severino, Emilian Postolache, Valentino Maiorca, Michele Mancusi, Riccardo Marin, Emanuele Rodolà

    Abstract: Autoregressive decoding limits the efficiency of transformers for Machine Translation (MT). The community proposed specific network architectures and learning-based methods to solve this issue, which are expensive and require changes to the MT model, trading inference speed at the cost of the translation quality. In this paper, we propose to address the problem from the point of view of decoding a… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 main conference

  21. arXiv:2305.09274  [pdf, other

    cs.GR cs.CG

    ReMatching: Low-Resolution Representations for Scalable Shape Correspondence

    Authors: Filippo Maggioli, Daniele Baieri, Emanuele Rodolà, Simone Melzi

    Abstract: We introduce \emph{ReMatching}, a novel shape correspondence solution based on the functional maps framework. Our method, by exploiting a new and appropriate \emph{re}-meshing paradigm, can target shape-\emph{matching} tasks even on meshes counting millions of vertices, where the original functional maps does not apply or requires a massive computational cost. The core of our procedure is a time-e… ▽ More

    Submitted 11 March, 2024; v1 submitted 16 May, 2023; originally announced May 2023.

    MSC Class: 68U05 (Primary) ACM Class: I.3

  22. arXiv:2305.01447  [pdf, other

    cs.MM cs.CL cs.CV cs.DB cs.IR

    Multimodal Neural Databases

    Authors: Giovanni Trappolini, Andrea Santilli, Emanuele Rodolà, Alon Halevy, Fabrizio Silvestri

    Abstract: The rise in loosely-structured data available through text, images, and other modalities has called for new ways of querying them. Multimedia Information Retrieval has filled this gap and has witnessed exciting progress in recent years. Tasks such as search and retrieval of extensive multimedia archives have undergone massive performance improvements, driven to a large extent by recent development… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Journal ref: SIGIR 2023: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

  23. arXiv:2304.07939  [pdf, other

    cs.LG

    Leveraging sparse and shared feature activations for disentangled representation learning

    Authors: Marco Fumero, Florian Wenzel, Luca Zancato, Alessandro Achille, Emanuele Rodolà, Stefano Soatto, Bernhard Schölkopf, Francesco Locatello

    Abstract: Recovering the latent factors of variation of high dimensional data has so far focused on simple synthetic settings. Mostly building on unsupervised and weakly-supervised objectives, prior work missed out on the positive implications for representation learning on real world data. In this work, we propose to leverage knowledge extracted from a diversified set of supervised tasks to learn a common… ▽ More

    Submitted 12 December, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

  24. arXiv:2303.09871  [pdf, other

    cs.CV cs.GR

    Fluid Dynamics Network: Topology-Agnostic 4D Reconstruction via Fluid Dynamics Priors

    Authors: Daniele Baieri, Stefano Esposito, Filippo Maggioli, Emanuele Rodolà

    Abstract: Representing 3D surfaces as level sets of continuous functions over $\mathbb{R}^3$ is the common denominator of neural implicit representations, which recently enabled remarkable progress in geometric deep learning and computer vision tasks. In order to represent 3D motion within this framework, it is often assumed (either explicitly or implicitly) that the transformations which a surface may unde… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 9 pages, 10 figures

    ACM Class: I.2.6; I.3.7; I.3.5

  25. arXiv:2303.00721  [pdf, other

    cs.LG cs.AI

    Bootstrap** Parallel Anchors for Relative Representations

    Authors: Irene Cannistraci, Luca Moschella, Valentino Maiorca, Marco Fumero, Antonio Norelli, Emanuele Rodolà

    Abstract: The use of relative representations for latent embeddings has shown potential in enabling latent space communication and zero-shot model stitching across a wide range of applications. Nevertheless, relative representations rely on a certain amount of parallel anchors to be given as input, which can be impractical to obtain in certain scenarios. To overcome this limitation, we propose an optimizati… ▽ More

    Submitted 1 June, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 9 pages, 7 tables

    MSC Class: 68T07 ACM Class: I.2.6

  26. arXiv:2302.02257  [pdf, other

    cs.SD cs.LG eess.AS

    Multi-Source Diffusion Models for Simultaneous Music Generation and Separation

    Authors: Giorgio Mariani, Irene Tallini, Emilian Postolache, Michele Mancusi, Luca Cosmo, Emanuele Rodolà

    Abstract: In this work, we define a diffusion-based generative model capable of both music synthesis and source separation by learning the score of the joint probability density of sources sharing a context. Alongside the classic total inference tasks (i.e., generating a mixture, separating the sources), we also introduce and experiment on the partial generation task of source imputation, where we generate… ▽ More

    Submitted 18 March, 2024; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: ICLR 2024 oral presentation. Demo page: https://gladia-research-group.github.io/multi-source-diffusion-models/

  27. arXiv:2301.08562  [pdf, other

    cs.LG cs.SD eess.AS

    Latent Autoregressive Source Separation

    Authors: Emilian Postolache, Giorgio Mariani, Michele Mancusi, Andrea Santilli, Luca Cosmo, Emanuele Rodolà

    Abstract: Autoregressive models have achieved impressive results over a wide range of domains in terms of generation quality and downstream task performance. In the continuous domain, a key factor behind this success is the usage of quantized latent spaces (e.g., obtained via VQ-VAE autoencoders), which allow for dimensionality reduction and faster inference times. However, using existing pre-trained models… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI 2023

  28. arXiv:2301.03345  [pdf, other

    cs.LG

    Latent Spectral Regularization for Continual Learning

    Authors: Emanuele Frascaroli, Riccardo Benaglia, Matteo Boschini, Luca Moschella, Cosimo Fiorini, Emanuele Rodolà, Simone Calderara

    Abstract: While biological intelligence grows organically as new knowledge is gathered throughout life, Artificial Neural Networks forget catastrophically whenever they face a changing training data distribution. Rehearsal-based Continual Learning (CL) approaches have been established as a versatile and reliable solution to overcome this limitation; however, sudden input disruptions and memory constraints a… ▽ More

    Submitted 7 August, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: 8 pages, 3 figures

  29. arXiv:2211.14604  [pdf, other

    cs.CV cs.GR

    Reduced Representation of Deformation Fields for Effective Non-rigid Shape Matching

    Authors: Ramana Sundararaman, Riccardo Marin, Emanuele Rodola, Maks Ovsjanikov

    Abstract: In this work we present a novel approach for computing correspondences between non-rigid objects, by exploiting a reduced representation of deformation fields. Different from existing works that represent deformation fields by training a general-purpose neural network, we advocate for an approximation based on mesh-free methods. By letting the network learn deformation parameters at a sparse set o… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

  30. arXiv:2210.01738  [pdf, other

    cs.LG cs.AI cs.CV

    ASIF: Coupled Data Turns Unimodal Models to Multimodal Without Training

    Authors: Antonio Norelli, Marco Fumero, Valentino Maiorca, Luca Moschella, Emanuele Rodolà, Francesco Locatello

    Abstract: CLIP proved that aligning visual and language spaces is key to solving many vision tasks without explicit training, but required to train image and text encoders from scratch on a huge dataset. LiT improved this by only training the text encoder and using a pre-trained vision network. In this paper, we show that a common space can be created without any training at all, using single-domain encoder… ▽ More

    Submitted 10 November, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 17 pages

  31. arXiv:2209.15430  [pdf, other

    cs.LG cs.AI

    Relative representations enable zero-shot latent space communication

    Authors: Luca Moschella, Valentino Maiorca, Marco Fumero, Antonio Norelli, Francesco Locatello, Emanuele Rodolà

    Abstract: Neural networks embed the geometric structure of a data manifold lying in a high-dimensional space into latent representations. Ideally, the distribution of the data points in the latent space should depend only on the task, the data, the loss, and other architecture-specific constraints. However, factors such as the random weights initialization, training hyperparameters, or other sources of rand… ▽ More

    Submitted 7 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 notable top 5%, 26 pages, 11 figures, 18 tables

    MSC Class: 68T07 ACM Class: I.2.6

  32. arXiv:2209.09688  [pdf, other

    cs.LG cs.CR

    Sparse Vicious Attacks on Graph Neural Networks

    Authors: Giovanni Trappolini, Valentino Maiorca, Silvio Severino, Emanuele Rodolà, Fabrizio Silvestri, Gabriele Tolomei

    Abstract: Graph Neural Networks (GNNs) have proven to be successful in several predictive modeling tasks for graph-structured data. Amongst those tasks, link prediction is one of the fundamental problems for many real-world applications, such as recommender systems. However, GNNs are not immune to adversarial attacks, i.e., carefully crafted malicious examples that are designed to fool the predictive mo… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  33. arXiv:2206.10885  [pdf, other

    cs.CV cs.GR cs.LG

    KiloNeuS: A Versatile Neural Implicit Surface Representation for Real-Time Rendering

    Authors: Stefano Esposito, Daniele Baieri, Stefan Zellmann, André Hinkenjann, Emanuele Rodolà

    Abstract: NeRF-based techniques fit wide and deep multi-layer perceptrons (MLPs) to a continuous radiance field that can be rendered from any unseen viewpoint. However, the lack of surface and normals definition and high rendering times limit their usage in typical computer graphics applications. Such limitations have recently been overcome separately, but solving them together remains an open problem. We p… ▽ More

    Submitted 21 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: 9 pages, 8 figures

    ACM Class: I.2.10; I.3.7; I.2.6

  34. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  35. arXiv:2206.03695  [pdf, other

    cs.LG cs.AI

    Metric Based Few-Shot Graph Classification

    Authors: Donato Crisostomi, Simone Antonelli, Valentino Maiorca, Luca Moschella, Riccardo Marin, Emanuele Rodolà

    Abstract: Many modern deep-learning techniques do not work without enormous datasets. At the same time, several fields demand methods working in scarcity of data. This problem is even more complex when the samples have varying structures, as in the case of graphs. Graph representation learning techniques have recently proven successful in a variety of domains. Nevertheless, the employed architectures perfor… ▽ More

    Submitted 4 January, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: To appear in Learning on Graphs (LoG) 2022

  36. arXiv:2205.14938  [pdf, other

    cs.LG cs.CV

    Spectral Maps for Learning on Subgraphs

    Authors: Marco Pegoraro, Riccardo Marin, Arianna Rampini, Simone Melzi, Luca Cosmo, Emanuele Rodolà

    Abstract: In graph learning, maps between graphs and their subgraphs frequently arise. For instance, when coarsening or rewiring operations are present along the pipeline, one needs to keep track of the corresponding nodes between the original and modified graphs. Classically, these maps are represented as binary node-to-node correspondence matrices and used as-is to transfer node-wise features between the… ▽ More

    Submitted 31 January, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

  37. arXiv:2203.10554  [pdf, other

    cs.CV

    3D Human Pose Estimation Using Möbius Graph Convolutional Networks

    Authors: Niloofar Azizi, Horst Possegger, Emanuele Rodolà, Horst Bischof

    Abstract: 3D human pose estimation is fundamental to understanding human behavior. Recently, promising results have been achieved by graph convolutional networks (GCNs), which achieve state-of-the-art performance and provide rather light-weight architectures. However, a major limitation of GCNs is their inability to encode all the transformations between joints explicitly. To address this issue, we propose… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

  38. arXiv:2201.10222  [pdf, other

    cs.LG cs.AI cs.CL physics.hist-ph

    Explanatory Learning: Beyond Empiricism in Neural Networks

    Authors: Antonio Norelli, Giorgio Mariani, Luca Moschella, Andrea Santilli, Giambattista Parascandolo, Simone Melzi, Emanuele Rodolà

    Abstract: We introduce Explanatory Learning (EL), a framework to let machines use existing knowledge buried in symbolic sequences -- e.g. explanations written in hieroglyphic -- by autonomously learning to interpret them. In EL, the burden of interpreting symbols is not left to humans or rigid human-coded compilers, as done in Program Synthesis. Rather, EL calls for a learned interpreter, built upon a limit… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: Main paper: 10 pages, References: 3 pages, Appendix: 7 pages

  39. arXiv:2201.05013  [pdf, other

    cs.SD cs.LG eess.AS

    Fish sounds: towards the evaluation of marine acoustic biodiversity through data-driven audio source separation

    Authors: Michele Mancusi, Nicola Zonca, Emanuele Rodolà, Silvia Zuffi

    Abstract: The marine ecosystem is changing at an alarming rate, exhibiting biodiversity loss and the migration of tropical species to temperate basins. Monitoring the underwater environments and their inhabitants is of fundamental importance to understand the evolution of these systems and implement safeguard policies. However, assessing and tracking biodiversity is often a complex task, especially in large… ▽ More

    Submitted 14 January, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

  40. Graph Kernel Neural Networks

    Authors: Luca Cosmo, Giorgia Minello, Alessandro Bicciato, Michael Bronstein, Emanuele Rodolà, Luca Rossi, Andrea Torsello

    Abstract: The convolution operator at the core of many modern neural architectures can effectively be seen as performing a dot product between an input matrix and a filter. While this is readily applicable to data such as images, which can be represented as regular grids in the Euclidean space, extending the convolution operator to work on graphs proves more challenging, due to their irregular structure. In… ▽ More

    Submitted 19 June, 2024; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: IEEE Transactions on Neural Networks and Learning Systems (2024)

  41. arXiv:2112.07289  [pdf, other

    cs.CV

    Smoothness and effective regularizations in learned embeddings for shape matching

    Authors: Riccardo Marin, Souhaib Attaiki, Simone Melzi, Emanuele Rodolà, Maks Ovsjanikov

    Abstract: Many innovative applications require establishing correspondences among 3D geometric objects. However, the countless possible deformations of smooth surfaces make shape matching a challenging task. Finding an embedding to represent the different shapes in high-dimensional space where the matching is easier to solve is a well-trodden path that has given many outstanding solutions. Recently, a new t… ▽ More

    Submitted 8 June, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

  42. arXiv:2110.13042  [pdf, other

    cs.DC

    Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose

    Authors: Viviana Arrigoni, Filippo Maggioli, Annalisa Massini, Emanuele Rodolà

    Abstract: The multiplication of a matrix by its transpose, $A^T A$, appears as an intermediate operation in the solution of a wide set of problems. In this paper, we propose a new cache-oblivious algorithm (ATA) for computing this product, based upon the classical Strassen algorithm as a sub-routine. In particular, we decrease the computational cost to $\frac{2}{3}$ the time required by Strassen's algorithm… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  43. arXiv:2110.05313  [pdf, other

    cs.LG cs.SD eess.AS

    Unsupervised Source Separation via Bayesian Inference in the Latent Domain

    Authors: Michele Mancusi, Emilian Postolache, Giorgio Mariani, Marco Fumero, Andrea Santilli, Luca Cosmo, Emanuele Rodolà

    Abstract: State of the art audio source separation models rely on supervised data-driven approaches, which can be expensive in terms of labeling resources. On the other hand, approaches for training these models without any direct supervision are typically high-demanding in terms of memory and time requirements, and remain impractical to be used at inference time. We aim to tackle these limitations by propo… ▽ More

    Submitted 30 March, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 5 pages, 2 figures, submitted to Interspeech 2022

  44. arXiv:2108.02161  [pdf, other

    cs.CV

    Localized Shape Modelling with Global Coherence: An Inverse Spectral Approach

    Authors: Marco Pegoraro, Simone Melzi, Umberto Castellani, Riccardo Marin, Emanuele Rodolà

    Abstract: Many natural shapes have most of their characterizing features concentrated over a few regions in space. For example, humans and animals have distinctive head shapes, while inorganic objects like chairs and airplanes are made of well-localized functional parts with specific geometric features. Often, these features are strongly correlated -- a modification of facial traits in a quadruped should in… ▽ More

    Submitted 24 June, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: Accepted at SGP2022

  45. arXiv:2106.13679  [pdf, other

    cs.CV cs.GR cs.LG

    Shape registration in the time of transformers

    Authors: Giovanni Trappolini, Luca Cosmo, Luca Moschella, Riccardo Marin, Simone Melzi, Emanuele Rodolà

    Abstract: In this paper, we propose a transformer-based procedure for the efficient registration of non-rigid 3D point clouds. The proposed approach is data-driven and adopts for the first time the transformer architecture in the registration task. Our method is general and applies to different settings. Given a fixed template with some desired properties (e.g. skinning weights or other animation cues), we… ▽ More

    Submitted 28 June, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

  46. arXiv:2104.14628  [pdf, other

    cs.LG cs.CV

    Cluster-driven Graph Federated Learning over Multiple Domains

    Authors: Debora Caldarola, Massimiliano Mancini, Fabio Galasso, Marco Ciccone, Emanuele Rodolà, Barbara Caputo

    Abstract: Federated Learning (FL) deals with learning a central model (i.e. the server) in privacy-constrained scenarios, where data are stored on multiple devices (i.e. the clients). The central model has no direct access to the data, but only to the updates of the parameters computed locally by each client. This raises a problem, known as statistical heterogeneity, because the clients may have different d… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR21 Workshop Learning from Limited or Imperfect Data (L^2ID)

  47. arXiv:2104.03356  [pdf, other

    cs.LG

    Universal Spectral Adversarial Attacks for Deformable Shapes

    Authors: Arianna Rampini, Franco Pestarini, Luca Cosmo, Simone Melzi, Emanuele Rodolà

    Abstract: Machine learning models are known to be vulnerable to adversarial attacks, namely perturbations of the data that lead to wrong predictions despite being imperceptible. However, the existence of "universal" attacks (i.e., unique perturbations that transfer across different data points) has only been demonstrated for images to date. Part of the reason lies in the lack of a common domain, for geometr… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: Published at CVPR 2021

  48. arXiv:2104.00514  [pdf, other

    cs.GR cs.CG cs.LG

    Learning Spectral Unions of Partial Deformable 3D Shapes

    Authors: Luca Moschella, Simone Melzi, Luca Cosmo, Filippo Maggioli, Or Litany, Maks Ovsjanikov, Leonidas Guibas, Emanuele Rodolà

    Abstract: Spectral geometric methods have brought revolutionary changes to the field of geometry processing. Of particular interest is the study of the Laplacian spectrum as a compact, isometry and permutation-invariant representation of a shape. Some recent works show how the intrinsic geometry of a full shape can be recovered from its spectrum, but there are approaches that consider the more challenging p… ▽ More

    Submitted 21 December, 2022; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: 18 pages, 20 figures

  49. arXiv:2103.01638  [pdf, other

    cs.LG

    Learning disentangled representations via product manifold projection

    Authors: Marco Fumero, Luca Cosmo, Simone Melzi, Emanuele Rodolà

    Abstract: We propose a novel approach to disentangle the generative factors of variation underlying a given set of observations. Our method builds upon the idea that the (unknown) low-dimensional manifold underlying the data space can be explicitly modeled as a product of submanifolds. This definition of disentanglement gives rise to a novel weakly-supervised algorithm for recovering the unknown explanatory… ▽ More

    Submitted 3 October, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: 15 pages, 10 figures

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  50. Non-Rigid Puzzles

    Authors: Or Litany, Emanuele Rodolà, Alex Bronstein, Michael Bronstein, Daniel Cremers

    Abstract: Shape correspondence is a fundamental problem in computer graphics and vision, with applications in various problems including animation, texture map**, robotic vision, medical imaging, archaeology and many more. In settings where the shapes are allowed to undergo non-rigid deformations and only partial views are available, the problem becomes very challenging. To this end, we present a non-rigi… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Journal ref: Computer Graphics Forum, Volume 35, Issue 5, August 2016