Skip to main content

Showing 1–15 of 15 results for author: Mariani, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16969  [pdf, other

    cs.SD cs.LG eess.AS

    COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations

    Authors: Ruben Ciranni, Emilian Postolache, Giorgio Mariani, Michele Mancusi, Luca Cosmo, Emanuele Rodolà

    Abstract: We present COCOLA (Coherence-Oriented Contrastive Learning for Audio), a contrastive learning method for musical audio representations that captures the harmonic and rhythmic coherence between samples. Our method operates at the level of stems (or their combinations) composing music tracks and allows the objective evaluation of compositional models for music in the task of accompaniment generation… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Demo page: https://github.com/gladia-research-group/cocola

  2. arXiv:2403.11706  [pdf, other

    cs.SD cs.LG eess.AS

    Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models

    Authors: Emilian Postolache, Giorgio Mariani, Luca Cosmo, Emmanouil Benetos, Emanuele Rodolà

    Abstract: Multi-Source Diffusion Models (MSDM) allow for compositional musical generation tasks: generating a set of coherent sources, creating accompaniments, and performing source separation. Despite their versatility, they require estimating the joint distribution over the sources, necessitating pre-separated musical data, which is rarely available, and fixing the number and type of sources at training t… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted at ICASSP 2024

  3. arXiv:2302.02257  [pdf, other

    cs.SD cs.LG eess.AS

    Multi-Source Diffusion Models for Simultaneous Music Generation and Separation

    Authors: Giorgio Mariani, Irene Tallini, Emilian Postolache, Michele Mancusi, Luca Cosmo, Emanuele Rodolà

    Abstract: In this work, we define a diffusion-based generative model capable of both music synthesis and source separation by learning the score of the joint probability density of sources sharing a context. Alongside the classic total inference tasks (i.e., generating a mixture, separating the sources), we also introduce and experiment on the partial generation task of source imputation, where we generate… ▽ More

    Submitted 18 March, 2024; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: ICLR 2024 oral presentation. Demo page: https://gladia-research-group.github.io/multi-source-diffusion-models/

  4. arXiv:2301.08562  [pdf, other

    cs.LG cs.SD eess.AS

    Latent Autoregressive Source Separation

    Authors: Emilian Postolache, Giorgio Mariani, Michele Mancusi, Andrea Santilli, Luca Cosmo, Emanuele Rodolà

    Abstract: Autoregressive models have achieved impressive results over a wide range of domains in terms of generation quality and downstream task performance. In the continuous domain, a key factor behind this success is the usage of quantized latent spaces (e.g., obtained via VQ-VAE autoencoders), which allow for dimensionality reduction and faster inference times. However, using existing pre-trained models… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI 2023

  5. arXiv:2212.11700  [pdf, ps, other

    cs.CR cs.DC

    Blockchain Scalability and Security: Communications Among Fast-Changing Committees Made Simple

    Authors: Andrea Mariani, Gianluca Mariani, Diego Pennino, Maurizio Pizzonia

    Abstract: For permissionless blockchains, scalability is paramount. While current technologies still fail to address this problem fully, many research works propose sharding or other techniques that extensively adopt parallel processing of transactions. In these approaches, a potentially large number of committees of nodes independently perform consensus and process new transactions. Hence, in addition to r… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  6. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  7. arXiv:2201.10222  [pdf, other

    cs.LG cs.AI cs.CL physics.hist-ph

    Explanatory Learning: Beyond Empiricism in Neural Networks

    Authors: Antonio Norelli, Giorgio Mariani, Luca Moschella, Andrea Santilli, Giambattista Parascandolo, Simone Melzi, Emanuele Rodolà

    Abstract: We introduce Explanatory Learning (EL), a framework to let machines use existing knowledge buried in symbolic sequences -- e.g. explanations written in hieroglyphic -- by autonomously learning to interpret them. In EL, the burden of interpreting symbols is not left to humans or rigid human-coded compilers, as done in Program Synthesis. Rather, EL calls for a learned interpreter, built upon a limit… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: Main paper: 10 pages, References: 3 pages, Appendix: 7 pages

  8. arXiv:2110.05313  [pdf, other

    cs.LG cs.SD eess.AS

    Unsupervised Source Separation via Bayesian Inference in the Latent Domain

    Authors: Michele Mancusi, Emilian Postolache, Giorgio Mariani, Marco Fumero, Andrea Santilli, Luca Cosmo, Emanuele Rodolà

    Abstract: State of the art audio source separation models rely on supervised data-driven approaches, which can be expensive in terms of labeling resources. On the other hand, approaches for training these models without any direct supervision are typically high-demanding in terms of memory and time requirements, and remain impractical to be used at inference time. We aim to tackle these limitations by propo… ▽ More

    Submitted 30 March, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 5 pages, 2 figures, submitted to Interspeech 2022

  9. arXiv:2012.08859  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces

    Authors: Bert Moons, Parham Noorzad, Andrii Skliar, Giovanni Mariani, Dushyant Mehta, Chris Lott, Tijmen Blankevoort

    Abstract: Current state-of-the-art Neural Architecture Search (NAS) methods neither efficiently scale to multiple hardware platforms, nor handle diverse architectural search-spaces. To remedy this, we present DONNA (Distilling Optimal Neural Network Architectures), a novel pipeline for rapid, scalable and diverse NAS, that scales to many user scenarios. DONNA consists of three phases. First, an accuracy pre… ▽ More

    Submitted 27 August, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: Accepted at ICCV2021. Main text 9 pages, Full text 21 pages, 18 figures

  10. Mixed-precision deep learning based on computational memory

    Authors: S. R. Nandakumar, Manuel Le Gallo, Christophe Piveteau, Vinay Joshi, Giovanni Mariani, Irem Boybat, Geethan Karunaratne, Riduan Khaddam-Aljameh, Urs Egger, Anastasios Petropoulos, Theodore Antonakopoulos, Bipin Rajendran, Abu Sebastian, Evangelos Eleftheriou

    Abstract: Deep neural networks (DNNs) have revolutionized the field of artificial intelligence and have achieved unprecedented success in cognitive tasks such as image and speech recognition. Training of large DNNs, however, is computationally intensive and this has motivated the search for novel computing architectures targeting this application. A computational memory unit with nanoscale resistive memory… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

    Journal ref: Frontiers in Neuroscience 14:406 (2020)

  11. arXiv:1901.06261  [pdf, other

    cs.LG cs.SE stat.ML

    NeuNetS: An Automated Synthesis Engine for Neural Network Design

    Authors: Atin Sood, Benjamin Elder, Benjamin Herta, Chao Xue, Costas Bekas, A. Cristiano I. Malossi, Debashish Saha, Florian Scheidegger, Ganesh Venkataraman, Gegi Thomas, Giovanni Mariani, Hendrik Strobelt, Horst Samulowitz, Martin Wistuba, Matteo Manica, Mihir Choudhury, Rong Yan, Roxana Istrate, Ruchir Puri, Tejaswini Pedapati

    Abstract: Application of neural networks to a vast variety of practical applications is transforming the way AI is applied in practice. Pre-trained neural network models available through APIs or capability to custom train pre-built neural network architectures with customer data has made the consumption of AI by developers much simpler and resulted in broad adoption of these complex AI models. While prebui… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: 14 pages, 12 figures. arXiv admin note: text overlap with arXiv:1806.00250

  12. arXiv:1806.00250  [pdf, other

    cs.LG stat.ML

    TAPAS: Train-less Accuracy Predictor for Architecture Search

    Authors: R. Istrate, F. Scheidegger, G. Mariani, D. Nikolopoulos, C. Bekas, A. C. I. Malossi

    Abstract: In recent years an increasing number of researchers and practitioners have been suggesting algorithms for large-scale neural network architecture search: genetic algorithms, reinforcement learning, learning curve extrapolation, and accuracy predictors. None of them, however, demonstrated high-performance without training new experiments in the presence of unseen datasets. We propose a new deep neu… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  13. arXiv:1803.09655  [pdf, other

    cs.CV cs.LG stat.ML

    BAGAN: Data Augmentation with Balancing GAN

    Authors: Giovanni Mariani, Florian Scheidegger, Roxana Istrate, Costas Bekas, Cristiano Malossi

    Abstract: Image classification datasets are often imbalanced, characteristic that negatively affects the accuracy of deep-learning classifiers. In this work we propose balancing GAN (BAGAN) as an augmentation tool to restore balance in imbalanced datasets. This is challenging because the few minority-class images may not be enough to train a GAN. We overcome this issue by including during the adversarial tr… ▽ More

    Submitted 5 June, 2018; v1 submitted 26 March, 2018; originally announced March 2018.

  14. arXiv:1803.09588  [pdf, other

    cs.CV

    Efficient Image Dataset Classification Difficulty Estimation for Predicting Deep-Learning Accuracy

    Authors: Florian Scheidegger, Roxana Istrate, Giovanni Mariani, Luca Benini, Costas Bekas, Cristiano Malossi

    Abstract: In the deep-learning community new algorithms are published at an incredible pace. Therefore, solving an image classification problem for new datasets becomes a challenging task, as it requires to re-evaluate published algorithms and their different configurations in order to find a close to optimal classifier. To facilitate this process, before biasing our decision towards a class of neural netwo… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

  15. arXiv:1612.00456  [pdf, ps, other

    astro-ph.IM cs.PF

    Characterising radio telescope software with the Workload Characterisation Framework

    Authors: Y. G. Grange, R. Lakhoo, M. Petschow, C. Wu, B. Veenboer, I. Emsley, T. J. Dijkema, A. P. Mechev, G. Mariani

    Abstract: We present a modular framework, the Workload Characterisation Framework (WCF), that is developed to reproducibly obtain, store and compare key characteristics of radio astronomy processing software. As a demonstration, we discuss the experiences using the framework to characterise a LOFAR calibration and imaging pipeline.

    Submitted 1 December, 2016; originally announced December 2016.

    Comments: 4 pages, 4 figures; to be published in ADASS XXVI (held October 16-20, 2016) proceedings. See http://www.adass2016.inaf.it/images/posters/grange.pdf for the poster

    ACM Class: D.4.8; K.6.2

    Journal ref: 2019, ADASS XXVI, ASP Conf. Ser., Vol 521, Eds. M. Molinaro, K. Shortridge, & F. Pasian, 683