Skip to main content

Showing 1–6 of 6 results for author: Lavagno, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.17815  [pdf, other

    cs.AR cs.AI

    A Survey on Design Methodologies for Accelerating Deep Learning on Heterogeneous Architectures

    Authors: Fabrizio Ferrandi, Serena Curzel, Leandro Fiorin, Daniele Ielmini, Cristina Silvano, Francesco Conti, Alessio Burrello, Francesco Barchi, Luca Benini, Luciano Lavagno, Teodoro Urso, Enrico Calore, Sebastiano Fabio Schifano, Cristian Zambelli, Maurizio Palesi, Giuseppe Ascia, Enrico Russo, Nicola Petra, Davide De Caro, Gennaro Di Meo, Valeria Cardellini, Salvatore Filippone, Francesco Lo Presti, Francesco Silvestri, Paolo Palazzari , et al. (1 additional authors not shown)

    Abstract: In recent years, the field of Deep Learning has seen many disruptive and impactful advancements. Given the increasing complexity of deep neural networks, the need for efficient hardware accelerators has become more and more pressing to design heterogeneous HPC platforms. The design of Deep Learning accelerators requires a multidisciplinary approach, combining expertise from several areas, spanning… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  2. A DSP shared is a DSP earned: HLS Task-Level Multi-Pum** for High-Performance Low-Resource Designs

    Authors: Giovanni Brignone, Mihai T. Lazarescu, Luciano Lavagno

    Abstract: High-level synthesis (HLS) enhances digital hardware design productivity through a high abstraction level. Even if the HLS abstraction prevents fine-grained manual register-transfer level (RTL) optimizations, it also enables automatable optimizations that would be unfeasible or hard to automate at RTL. Specifically, we propose a task-level multi-pum** methodology to reduce resource utilization,… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  3. arXiv:2309.15631  [pdf, other

    cs.AR eess.SP

    Design and Optimization of Residual Neural Network Accelerators for Low-Power FPGAs Using High-Level Synthesis

    Authors: Filippo Minnella, Teodoro Urso, Mihai T. Lazarescu, Luciano Lavagno

    Abstract: Residual neural networks are widely used in computer vision tasks. They enable the construction of deeper and more accurate models by mitigating the vanishing gradient problem. Their main innovation is the residual block which allows the output of one layer to bypass one or more intermediate layers and be added to the output of a later layer. Their complex structure and the buffering required by t… ▽ More

    Submitted 2 November, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

  4. arXiv:2306.15749  [pdf, other

    cs.NE cs.AI cs.AR cs.LG

    To Spike or Not To Spike: A Digital Hardware Perspective on Deep Learning Acceleration

    Authors: Fabrizio Ottati, Chang Gao, Qinyu Chen, Giovanni Brignone, Mario R. Casu, Jason K. Eshraghian, Luciano Lavagno

    Abstract: As deep learning models scale, they become increasingly competitive from domains spanning from computer vision to natural language processing; however, this happens at the expense of efficiency since they require increasingly more memory and computing power. The power efficiency of the biological brain outperforms any large-scale deep learning ( DL ) model; thus, neuromorphic computing tries to mi… ▽ More

    Submitted 28 January, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Fixed error in bio

  5. arXiv:2306.15552  [pdf, other

    cs.AR cs.ET cs.LG

    A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms

    Authors: Cristina Silvano, Daniele Ielmini, Fabrizio Ferrandi, Leandro Fiorin, Serena Curzel, Luca Benini, Francesco Conti, Angelo Garofalo, Cristian Zambelli, Enrico Calore, Sebastiano Fabio Schifano, Maurizio Palesi, Giuseppe Ascia, Davide Patti, Stefania Perri, Nicola Petra, Davide De Caro, Luciano Lavagno, Teodoro Urso, Valeria Cardellini, Gian Carlo Cardarilli, Robert Birke

    Abstract: Recent trends in deep learning (DL) imposed hardware accelerators as the most viable solution for several classes of high-performance computing (HPC) applications such as image classification, computer vision, and speech recognition. This survey summarizes and classifies the most recent advances in designing DL accelerators suitable to reach the performance requirements of HPC applications. In par… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Preprint version of our manuscript submitted to the journal @ ACM CSUR (35 pages plus Appendix) on June 22nd, 2023

  6. Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs

    Authors: Yifan Yang, Qi**g Huang, Bichen Wu, Tianjun Zhang, Liang Ma, Giulio Gambardella, Michaela Blott, Luciano Lavagno, Kees Vissers, John Wawrzynek, Kurt Keutzer

    Abstract: Using FPGAs to accelerate ConvNets has attracted significant attention in recent years. However, FPGA accelerator design has not leveraged the latest progress of ConvNets. As a result, the key application characteristics such as frames-per-second (FPS) are ignored in favor of simply counting GOPs, and results on accuracy, which is critical to application success, are often not even reported. In th… ▽ More

    Submitted 10 May, 2020; v1 submitted 21 November, 2018; originally announced November 2018.

    Comments: Update to the latest results