Skip to main content

Showing 1–9 of 9 results for author: Calore, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.17815  [pdf, other

    cs.AR cs.AI

    A Survey on Design Methodologies for Accelerating Deep Learning on Heterogeneous Architectures

    Authors: Fabrizio Ferrandi, Serena Curzel, Leandro Fiorin, Daniele Ielmini, Cristina Silvano, Francesco Conti, Alessio Burrello, Francesco Barchi, Luca Benini, Luciano Lavagno, Teodoro Urso, Enrico Calore, Sebastiano Fabio Schifano, Cristian Zambelli, Maurizio Palesi, Giuseppe Ascia, Enrico Russo, Nicola Petra, Davide De Caro, Gennaro Di Meo, Valeria Cardellini, Salvatore Filippone, Francesco Lo Presti, Francesco Silvestri, Paolo Palazzari , et al. (1 additional authors not shown)

    Abstract: In recent years, the field of Deep Learning has seen many disruptive and impactful advancements. Given the increasing complexity of deep neural networks, the need for efficient hardware accelerators has become more and more pressing to design heterogeneous HPC platforms. The design of Deep Learning accelerators requires a multidisciplinary approach, combining expertise from several areas, spanning… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  2. arXiv:2306.15552  [pdf, other

    cs.AR cs.ET cs.LG

    A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms

    Authors: Cristina Silvano, Daniele Ielmini, Fabrizio Ferrandi, Leandro Fiorin, Serena Curzel, Luca Benini, Francesco Conti, Angelo Garofalo, Cristian Zambelli, Enrico Calore, Sebastiano Fabio Schifano, Maurizio Palesi, Giuseppe Ascia, Davide Patti, Nicola Petra, Davide De Caro, Luciano Lavagno, Teodoro Urso, Valeria Cardellini, Gian Carlo Cardarilli, Robert Birke, Stefania Perri

    Abstract: Recent trends in deep learning (DL) imposed hardware accelerators as the most viable solution for several classes of high-performance computing (HPC) applications such as image classification, computer vision, and speech recognition. This survey summarizes and classifies the most recent advances in designing DL accelerators suitable to reach the performance requirements of HPC applications. In par… ▽ More

    Submitted 12 July, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Preprint version of our manuscript submitted to the journal @ ACM CSUR (58 pages including Appendix) on June 22nd, 2023. Major revision submitted on July 12th, 2024

  3. Early Experience on Using Knights Landing Processors for Lattice Boltzmann Applications

    Authors: Enrico Calore, Alessandro Gabbana, Sebastiano Fabio Schifano, Raffaele Tripiccione

    Abstract: The Knights Landing (KNL) is the codename for the latest generation of Intel processors based on Intel Many Integrated Core (MIC) architecture. It relies on massive thread and data parallelism, and fast on-chip memory. This processor operates in standalone mode, booting an off-the-shelf Linux operating system. The KNL peak performance is very high - approximately 3 Tflops in double precision and 6… ▽ More

    Submitted 5 April, 2018; originally announced April 2018.

  4. Energy-efficiency evaluation of Intel KNL for HPC workloads

    Authors: E. Calore, A. Gabbana, S. F. Schifano, R. Tripiccione

    Abstract: Energy consumption is increasingly becoming a limiting factor to the design of faster large-scale parallel systems, and development of energy-efficient and energy-aware applications is today a relevant issue for HPC code-developer communities. In this work we focus on energy performance of the Knights Landing (KNL) Xeon Phi, the latest many-core architecture processor introduced by Intel into the… ▽ More

    Submitted 5 April, 2018; originally announced April 2018.

  5. Optimization of Lattice Boltzmann Simulations on Heterogeneous Computers

    Authors: E. Calore, A. Gabbana, S. F. Schifano, R. Tripiccione

    Abstract: High-performance computing systems are more and more often based on accelerators. Computing applications targeting those systems often follow a host-driven approach in which hosts offload almost all compute-intensive sections of the code onto accelerators; this approach only marginally exploits the computational resources available on the host CPUs, limiting performance and energy efficiency. The… ▽ More

    Submitted 14 March, 2017; originally announced March 2017.

  6. arXiv:1703.02788  [pdf, ps, other

    cs.DC cs.PF

    Evaluation of DVFS techniques on modern HPC processors and accelerators for energy-aware applications

    Authors: Enrico Calore, Alessandro Gabbana, Sebastiano Fabio Schifano, Raffaele Tripiccione

    Abstract: Energy efficiency is becoming increasingly important for computing systems, in particular for large scale HPC facilities. In this work we evaluate, from an user perspective, the use of Dynamic Voltage and Frequency Scaling (DVFS) techniques, assisted by the power and energy monitoring capabilities of modern processors in order to tune applications for energy efficiency. We run selected kernels and… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

  7. Performance and Portability of Accelerated Lattice Boltzmann Applications with OpenACC

    Authors: E. Calore, A. Gabbana, J. Kraus, S. F. Schifano, R. Tripiccione

    Abstract: An increasingly large number of HPC systems rely on heterogeneous architectures combining traditional multi-core CPUs with power efficient accelerators. Designing efficient applications for these systems has been troublesome in the past as accelerators could usually be programmed using specific programming languages threatening maintainability, portability and correctness. Several new programming… ▽ More

    Submitted 1 March, 2017; originally announced March 2017.

  8. Massively parallel lattice-Boltzmann codes on large GPU clusters

    Authors: E. Calore, A. Gabbana, J. Kraus, E. Pellegrini, S. F. Schifano, R. Tripiccione

    Abstract: This paper describes a massively parallel code for a state-of-the art thermal lattice- Boltzmann method. Our code has been carefully optimized for performance on one GPU and to have a good scaling behavior extending to a large number of GPUs. Versions of this code have been already used for large-scale studies of convective turbulence. GPUs are becoming increasingly popular in HPC applications, as… ▽ More

    Submitted 1 March, 2017; originally announced March 2017.

  9. arXiv:1611.04833  [pdf, other

    cs.HC

    Steady State Visually Evoked Potentials detection using a single electrode consumer-grade EEG device for BCI applications

    Authors: Enrico Calore

    Abstract: Brain-Computer Interfaces (BCIs) implement a direct communication pathway between the brain of an user and an external device, as a computer or a machine in general. One of the most used brain responses to implement non-invasive BCIs is the so called steady-state visually evoked potential (SSVEP). This periodic response is generated when an user gazes to a light flickering at a constant frequency.… ▽ More

    Submitted 15 November, 2016; originally announced November 2016.

    Comments: Work conducted between 2013 and 2014