Skip to main content

Showing 1–13 of 13 results for author: Motta, G

.
  1. arXiv:2310.00141  [pdf, other

    cs.CL eess.AS

    The Gift of Feedback: Improving ASR Model Quality by Learning from User Corrections through Federated Learning

    Authors: Lillian Zhou, Yuxin Ding, Mingqing Chen, Harry Zhang, Rohit Prabhavalkar, Dhruv Guliani, Giovanni Motta, Rajiv Mathews

    Abstract: Automatic speech recognition (ASR) models are typically trained on large datasets of transcribed speech. As language evolves and new terms come into use, these models can become outdated and stale. In the context of models trained on the server but deployed on edge devices, errors may result from the mismatch between server training data and actual on-device usage. In this work, we seek to continu… ▽ More

    Submitted 30 November, 2023; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted to IEEE ASRU 2023

  2. arXiv:2209.06359  [pdf, other

    cs.LG cs.AI

    Federated Pruning: Improving Neural Network Efficiency with Federated Learning

    Authors: Rongmei Lin, Yonghui Xiao, Tien-Ju Yang, Ding Zhao, Li Xiong, Giovanni Motta, Françoise Beaufays

    Abstract: Automatic Speech Recognition models require large amount of speech data for training, and the collection of such data often leads to privacy concerns. Federated learning has been widely used and is considered to be an effective decentralized technique by collaboratively learning a shared prediction model while kee** the data local on different clients devices. However, the limited computation an… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: To appear in INTERSPEECH 2022

  3. arXiv:2205.03494  [pdf, other

    cs.LG

    Online Model Compression for Federated Learning with Large Models

    Authors: Tien-Ju Yang, Yonghui Xiao, Giovanni Motta, Françoise Beaufays, Rajiv Mathews, Mingqing Chen

    Abstract: This paper addresses the challenges of training large neural network models under federated learning settings: high on-device memory usage and communication cost. The proposed Online Model Compression (OMC) provides a framework that stores model parameters in a compressed format and decompresses them only when needed. We use quantization as the compression method in this paper and propose three me… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: Submitted to INTERSPEECH 2022

  4. arXiv:2111.10264  [pdf, other

    stat.ME astro-ph.SR

    Periodic Variable Stars Modulated by Time-Varying Parameters

    Authors: Giovanni Motta, Darlin Soto, Márcio Catelan

    Abstract: Many astrophysical phenomena are time-varying, in the sense that their brightness change over time. In the case of periodic stars, previous approaches assumed that changes in period, amplitude, and phase are well described by either parametric or piecewise-constant functions. With this paper, we introduce a new mathematical model for the description of the so-called modulated light curves, as foun… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: 26 pages, 6 figures, to be published in The Astrophysical Journal

  5. arXiv:2110.05607  [pdf, other

    cs.LG cs.SD eess.AS

    Partial Variable Training for Efficient On-Device Federated Learning

    Authors: Tien-Ju Yang, Dhruv Guliani, Françoise Beaufays, Giovanni Motta

    Abstract: This paper aims to address the major challenges of Federated Learning (FL) on edge devices: limited memory and expensive communication. We propose a novel method, called Partial Variable Training (PVT), that only trains a small subset of variables on edge devices to reduce memory usage and communication cost. With PVT, we show that network accuracy can be maintained by utilizing more local trainin… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  6. arXiv:2110.04267  [pdf, other

    cs.LG cs.CL cs.SD eess.AS

    Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training

    Authors: Lillian Zhou, Dhruv Guliani, Andreas Kabel, Giovanni Motta, Françoise Beaufays

    Abstract: Transformer-based architectures have been the subject of research aimed at understanding their overparameterization and the non-uniform importance of their layers. Applying these approaches to Automatic Speech Recognition, we demonstrate that the state-of-the-art Conformer models generally have multiple ambient layers. We study the stability of these layers across runs and model sizes, propose tha… ▽ More

    Submitted 4 February, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    MSC Class: 68T10 ACM Class: I.2.7

  7. arXiv:2110.03634  [pdf, other

    cs.LG cs.DC

    Enabling On-Device Training of Speech Recognition Models with Federated Dropout

    Authors: Dhruv Guliani, Lillian Zhou, Changwan Ryu, Tien-Ju Yang, Harry Zhang, Yonghui Xiao, Francoise Beaufays, Giovanni Motta

    Abstract: Federated learning can be used to train machine learning models on the edge on local data that never leave devices, providing privacy by default. This presents a challenge pertaining to the communication and computation costs associated with clients' devices. These costs are strongly correlated with the size of the model being trained, and are significant for state-of-the-art automatic speech reco… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses

    MSC Class: 68T10 ACM Class: I.2.7

  8. arXiv:2104.11358  [pdf, other

    stat.ME math.ST

    Joint Mean-Vector and Var-Matrix estimation for Locally Stationary VAR(1) processes

    Authors: Giovanni Motta

    Abstract: During the last two decades, locally stationary processes have been widely studied in the time series literature. In this paper we consider the locally-stationary vector-auto-regression model of order one, or LS-VAR(1), and estimate its parameters by weighted least squares. The LS-VAR(1) we consider allows for a smoothly time-varying non-diagonal VAR matrix, as well as for a smoothly time-varying… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  9. Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework

    Authors: Dhruv Guliani, Francoise Beaufays, Giovanni Motta

    Abstract: We propose using federated learning, a decentralized on-device learning paradigm, to train speech recognition models. By performing epochs of training on a per-user basis, federated learning must incur the cost of dealing with non-IID data distributions, which are expected to negatively affect the quality of the trained model. We propose a framework by which the degree of non-IID-ness can be varie… ▽ More

    Submitted 14 May, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: Paper published at ICASSP 2021

    MSC Class: 68T10 ACM Class: I.2.7

    Journal ref: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, pp. 3080-3084

  10. arXiv:2001.08885  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Low-rank Gradient Approximation For Memory-Efficient On-device Training of Deep Neural Network

    Authors: Mary Gooneratne, Khe Chai Sim, Petr Zadrazil, Andreas Kabel, Françoise Beaufays, Giovanni Motta

    Abstract: Training machine learning models on mobile devices has the potential of improving both privacy and accuracy of the models. However, one of the major obstacles to achieving this goal is the memory limitation of mobile devices. Reducing training memory enables models with high-dimensional weight matrices, like automatic speech recognition (ASR) models, to be trained on-device. In this paper, we prop… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  11. arXiv:1912.09251  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Personalization of End-to-end Speech Recognition On Mobile Devices For Named Entities

    Authors: Khe Chai Sim, Françoise Beaufays, Arnaud Benard, Dhruv Guliani, Andreas Kabel, Nikhil Khare, Tamar Lucassen, Petr Zadrazil, Harry Zhang, Leif Johnson, Giovanni Motta, Lillian Zhou

    Abstract: We study the effectiveness of several techniques to personalize end-to-end speech models and improve the recognition of proper names relevant to the user. These techniques differ in the amounts of user effort required to provide supervision, and are evaluated on how they impact speech recognition performance. We propose using keyword-dependent precision and recall metrics to measure vocabulary acq… ▽ More

    Submitted 14 December, 2019; originally announced December 2019.

  12. Poincaré surfaces of section around a 3-D irregular body: The case of asteroid 4179 Toutatis

    Authors: Gabriel Borderes Motta, Othon Cabo Winter

    Abstract: In general, small bodies of the solar system, e.g., asteroids and comets, have a very irregular shape. This feature affects significantly the gravitational potential around these irregular bodies, which hinders dynamical studies. The Poincaré surface of sec- tion technique is often used to look for stable and chaotic regions in two-dimensional dynamic cases. In this work, we show that this tool ca… ▽ More

    Submitted 17 November, 2017; originally announced November 2017.

  13. arXiv:1608.08342  [pdf, ps, other

    cs.SE

    A New Paradigm of Software Service Engineering in the Era of Big Data and Big Service

    Authors: Xiaofei Xu, Gianmario Motta, Xianzhi Wang, Zhiying Tu, Hanchuan Xu

    Abstract: Servitization is one of the most significant trends that reshapes the information world and society in recent years. The requirement of collecting,storing, processing, and sharing of the Big Data has led to massive software resources being developed and made accessible as web-based services to facilitate such process. These services that handle the Big Data come from various domains and heterogene… ▽ More

    Submitted 30 August, 2016; originally announced August 2016.

    Comments: 23 pages+ 1 page references. Submitted to Springer Computing