Skip to main content

Showing 1–18 of 18 results for author: Fernandez-Marques, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14791  [pdf, other

    cs.LG cs.CV cs.DC

    Recurrent Early Exits for Federated Learning with Heterogeneous Clients

    Authors: Royson Lee, Javier Fernandez-Marques, Shell Xu Hu, Da Li, Stefanos Laskaridis, Łukasz Dudziak, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane

    Abstract: Federated learning (FL) has enabled distributed learning of a model across multiple clients in a privacy-preserving manner. One of the main challenges of FL is to accommodate clients with varying hardware capacities; clients have differing compute and memory requirements. To tackle this challenge, recent state-of-the-art approaches leverage the use of early exits. Nonetheless, these approaches fal… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted at the 41st International Conference on Machine Learning (ICML 2024)

  2. arXiv:2311.18451  [pdf, other

    cs.LG

    How Much Is Hidden in the NAS Benchmarks? Few-Shot Adaptation of a NAS Predictor

    Authors: Hrushikesh Loya, Łukasz Dudziak, Abhinav Mehrotra, Royson Lee, Javier Fernandez-Marques, Nicholas D. Lane, Hongkai Wen

    Abstract: Neural architecture search has proven to be a powerful approach to designing and refining neural networks, often boosting their performance and efficiency over manually-designed variations, but comes with computational overhead. While there has been a considerable amount of research focused on lowering the cost of NAS for mainstream tasks, such as image classification, a lot of those improvements… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  3. arXiv:2307.13412  [pdf, other

    cs.LG cs.AR cs.CV

    Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation

    Authors: Stylianos I. Venieris, Javier Fernandez-Marques, Nicholas D. Lane

    Abstract: The unprecedented accuracy of convolutional neural networks (CNNs) across a broad range of AI tasks has led to their widespread deployment in mobile and embedded settings. In a pursuit for high-performance and energy-efficient inference, significant research effort has been invested in the design of FPGA-based CNN accelerators. In this context, single computation engines constitute a popular appro… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted at ACM TODAES, 2023. arXiv admin note: substantial text overlap with arXiv:2103.05600

  4. arXiv:2306.17453  [pdf, other

    cs.DC

    Pollen: High-throughput Federated Learning Simulation via Resource-Aware Client Placement

    Authors: Lorenzo Sani, Pedro Porto Buarque de Gusmão, Alex Iacob, Wanru Zhao, Xinchi Qiu, Yan Gao, Javier Fernandez-Marques, Nicholas Donald Lane

    Abstract: Federated Learning (FL) is a privacy-focused machine learning paradigm that collaboratively trains models directly on edge devices. Simulation plays an essential role in FL adoption, hel** develop novel aggregation and client sampling strategies. However, current simulators cannot emulate large-scale systems in a time-efficient manner, which limits their utility and casts doubts on generalizabil… ▽ More

    Submitted 20 May, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: 22 pages, 22 figures, 9 tables, under review

  5. arXiv:2305.18334  [pdf, other

    cs.AR cs.LG

    PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration

    Authors: Ahmed F. AbouElhamayed, Angela Cui, Javier Fernandez-Marques, Nicholas D. Lane, Mohamed S. Abdelfattah

    Abstract: Conventional multiply-accumulate (MAC) operations have long dominated computation time for deep neural networks (DNNs), espcially convolutional neural networks (CNNs). Recently, product quantization (PQ) has been applied to these workloads, replacing MACs with memory lookups to pre-computed dot products. To better understand the efficiency tradeoffs of product-quantized DNNs (PQ-DNNs), we create a… ▽ More

    Submitted 28 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: ACM Transactions on Reconfigurable Technology and Systems (TRETS) - FCCM 2024 Journal Track

  6. arXiv:2212.04084  [pdf, other

    cs.LG cs.DC

    Federated Learning for Inference at Anytime and Anywhere

    Authors: Zicheng Liu, Da Li, Javier Fernandez-Marques, Stefanos Laskaridis, Yan Gao, Łukasz Dudziak, Stan Z. Li, Shell Xu Hu, Timothy Hospedales

    Abstract: Federated learning has been predominantly concerned with collaborative training of deep networks from scratch, and especially the many challenges that arise, such as communication cost, robustness to heterogeneous data, and support for diverse device capabilities. However, there is no unified framework that addresses all these problems together. This paper studies the challenges and opportunities… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 14 pages, 3 figures

  7. arXiv:2209.15575  [pdf, other

    cs.SD cs.LG eess.AS

    Match to Win: Analysing Sequences Lengths for Efficient Self-supervised Learning in Speech and Audio

    Authors: Yan Gao, Javier Fernandez-Marques, Titouan Parcollet, Pedro P. B. de Gusmao, Nicholas D. Lane

    Abstract: Self-supervised learning (SSL) has proven vital in speech and audio-related applications. The paradigm trains a general model on unlabeled data that can later be used to solve specific downstream tasks. This type of model is costly to train as it requires manipulating long input sequences that can only be handled by powerful centralised servers. Surprisingly, despite many attempts to increase trai… ▽ More

    Submitted 22 November, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

  8. arXiv:2208.02507  [pdf, other

    cs.LG cs.DC

    ZeroFL: Efficient On-Device Training for Federated Learning with Local Sparsity

    Authors: Xinchi Qiu, Javier Fernandez-Marques, Pedro PB Gusmao, Yan Gao, Titouan Parcollet, Nicholas Donald Lane

    Abstract: When the available hardware cannot meet the memory and compute requirements to efficiently train high performing machine learning models, a compromise in either the training quality or the model complexity is needed. In Federated Learning (FL), nodes are orders of magnitude more constrained than traditional server-grade hardware and are often battery powered, severely limiting the sophistication o… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: Published as a conference paper at ICLR 2022

    Journal ref: International Conference on Learning Representations, 2022

  9. arXiv:2207.01053  [pdf, other

    cs.LG cs.AI cs.DC cs.PF

    Protea: Client Profiling within Federated Systems using Flower

    Authors: Wanru Zhao, Xinchi Qiu, Javier Fernandez-Marques, Pedro P. B. de Gusmão, Nicholas D. Lane

    Abstract: Federated Learning (FL) has emerged as a prospective solution that facilitates the training of a high-performing centralised model without compromising the privacy of users. While successful, research is currently limited by the possibility of establishing a realistic large-scale FL system at the early stages of experimentation. Simulation can help accelerate this process. To facilitate efficient… ▽ More

    Submitted 31 August, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

    Comments: 6 pages, 5 figures, Accepted at ACM MobiCom FedEdge Workshop, 2022

    Journal ref: 1st ACM Workshop on Data Privacy and Federated Learning Technologies for Mobile Edge Network (FedEdge'22), October 17,2022,Sydney, NSW, Australia

  10. arXiv:2206.11239  [pdf, other

    cs.LG

    FedorAS: Federated Architecture Search under system heterogeneity

    Authors: Lukasz Dudziak, Stefanos Laskaridis, Javier Fernandez-Marques

    Abstract: Federated learning (FL) has recently gained considerable attention due to its ability to learn on decentralised data while preserving client privacy. However, it also poses additional challenges related to the heterogeneity of the participating devices, both in terms of their computational capabilities and contributed data. Meanwhile, Neural Architecture Search (NAS) has been successfully used wit… ▽ More

    Submitted 29 September, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  11. arXiv:2204.02804  [pdf, other

    cs.SD cs.LG eess.AS

    Federated Self-supervised Speech Representations: Are We There Yet?

    Authors: Yan Gao, Javier Fernandez-Marques, Titouan Parcollet, Abhinav Mehrotra, Nicholas D. Lane

    Abstract: The ubiquity of microphone-enabled devices has lead to large amounts of unlabelled audio data being produced at the edge. The integration of self-supervised learning (SSL) and federated learning (FL) into one coherent system can potentially offer data privacy guarantees while also advancing the quality and robustness of speech representations. In this paper, we provide a first-of-its-kind systemat… ▽ More

    Submitted 19 July, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

  12. arXiv:2104.14297  [pdf, other

    cs.SD cs.LG eess.AS

    End-to-End Speech Recognition from Federated Acoustic Models

    Authors: Yan Gao, Titouan Parcollet, Salah Zaiem, Javier Fernandez-Marques, Pedro P. B. de Gusmao, Daniel J. Beutel, Nicholas D. Lane

    Abstract: Training Automatic Speech Recognition (ASR) models under federated learning (FL) settings has attracted a lot of attention recently. However, the FL scenarios often presented in the literature are artificial and fail to capture the complexity of real FL systems. In this paper, we construct a challenging and realistic ASR federated experimental setup consisting of clients with heterogeneous data di… ▽ More

    Submitted 9 July, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

  13. arXiv:2104.03042  [pdf, other

    cs.LG cs.AI cs.DC

    On-device Federated Learning with Flower

    Authors: Akhil Mathur, Daniel J. Beutel, Pedro Porto Buarque de Gusmão, Javier Fernandez-Marques, Taner Topal, Xinchi Qiu, Titouan Parcollet, Yan Gao, Nicholas D. Lane

    Abstract: Federated Learning (FL) allows edge devices to collaboratively learn a shared prediction model while kee** their training data on the device, thereby decoupling the ability to do machine learning from the need to store data in the cloud. Despite the algorithmic advancements in FL, the support for on-device training of FL algorithms on edge devices remains poor. In this paper, we present an explo… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: Accepted at the 2nd On-device Intelligence Workshop @ MLSys 2021. arXiv admin note: substantial text overlap with arXiv:2007.14390

    ACM Class: I.0

    Journal ref: On-device Intelligence Workshop at the Fourth Conference on Machine Learning and Systems (MLSys), April 9, 2021

  14. arXiv:2103.05600  [pdf, other

    cs.CV cs.AR cs.LG

    unzipFPGA: Enhancing FPGA-based CNN Engines with On-the-Fly Weights Generation

    Authors: Stylianos I. Venieris, Javier Fernandez-Marques, Nicholas D. Lane

    Abstract: Single computation engines have become a popular design choice for FPGA-based convolutional neural networks (CNNs) enabling the deployment of diverse models without fabric reconfiguration. This flexibility, however, often comes with significantly reduced performance on memory-bound layers and resource underutilisation due to suboptimal map** of certain layers on the engine's fixed configuration.… ▽ More

    Submitted 3 April, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: Accepted at the 29th IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM) 2021

  15. arXiv:2102.07627  [pdf, other

    cs.LG cs.DC

    A first look into the carbon footprint of federated learning

    Authors: Xinchi Qiu, Titouan Parcollet, Javier Fernandez-Marques, Pedro Porto Buarque de Gusmao, Yan Gao, Daniel J. Beutel, Taner Topal, Akhil Mathur, Nicholas D. Lane

    Abstract: Despite impressive results, deep learning-based technologies also raise severe privacy and environmental concerns induced by the training procedure often conducted in data centers. In response, alternatives to centralized training such as Federated Learning (FL) have emerged. Perhaps unexpectedly, FL is starting to be deployed at a global scale by companies that must adhere to new legal demands an… ▽ More

    Submitted 22 May, 2023; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2010.06537

  16. arXiv:2008.05000  [pdf, other

    cs.LG stat.ML

    Degree-Quant: Quantization-Aware Training for Graph Neural Networks

    Authors: Shyam A. Tailor, Javier Fernandez-Marques, Nicholas D. Lane

    Abstract: Graph neural networks (GNNs) have demonstrated strong performance on a wide variety of tasks due to their ability to model non-uniform structured data. Despite their promise, there exists little research exploring methods to make them more efficient at inference time. In this work, we explore the viability of training quantized GNNs, enabling the usage of low precision integer arithmetic during in… ▽ More

    Submitted 15 March, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

    Comments: Published as a conference paper at ICLR 2021

  17. arXiv:2007.14390  [pdf, other

    cs.LG cs.CV stat.ML

    Flower: A Friendly Federated Learning Research Framework

    Authors: Daniel J. Beutel, Taner Topal, Akhil Mathur, Xinchi Qiu, Javier Fernandez-Marques, Yan Gao, Lorenzo Sani, Kwing Hei Li, Titouan Parcollet, Pedro Porto Buarque de Gusmão, Nicholas D. Lane

    Abstract: Federated Learning (FL) has emerged as a promising technique for edge devices to collaboratively learn a shared prediction model, while kee** their training data on the device, thereby decoupling the ability to do machine learning from the need to store the data in the cloud. However, FL is difficult to implement realistically, both in terms of scale and systems heterogeneity. Although there are… ▽ More

    Submitted 5 March, 2022; v1 submitted 28 July, 2020; originally announced July 2020.

    Comments: Open-Source, mobile-friendly Federated Learning framework

  18. arXiv:2002.10711  [pdf, other

    cs.LG cs.CV stat.ML

    Searching for Winograd-aware Quantized Networks

    Authors: Javier Fernandez-Marques, Paul N. Whatmough, Andrew Mundy, Matthew Mattina

    Abstract: Lightweight architectural designs of Convolutional Neural Networks (CNNs) together with quantization have paved the way for the deployment of demanding computer vision applications on mobile devices. Parallel to this, alternative formulations to the convolution operation such as FFT, Strassen and Winograd, have been adapted for use in CNNs offering further speedups. Winograd convolutions are the f… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper at MLSys 2020

    Journal ref: Proceedings of Machine Learning and Systems (2020), 14-29