Skip to main content

Showing 1–8 of 8 results for author: Ansaloni, G

.
  1. arXiv:2402.12834  [pdf, other

    cs.AR

    SAT-based Exact Modulo Scheduling Map** for Resource-Constrained CGRAs

    Authors: Cristian Tirelli, Juan Sapriza, Rubén Rodríguez Álvarez, Lorenzo Ferretti, Benoît Denkinger, Giovanni Ansaloni, José Miranda Calero, David Atienza, Laura Pozzi

    Abstract: Coarse-Grain Reconfigurable Arrays (CGRAs) represent emerging low-power architectures designed to accelerate Compute-Intensive Loops (CILs). The effectiveness of CGRAs in providing acceleration relies on the quality of map**: how efficiently the CIL is compiled onto the platform. State of the Art (SoA) compilation techniques utilize modulo scheduling to minimize the Iteration Interval (II) and u… ▽ More

    Submitted 29 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  2. arXiv:2401.09420  [pdf, other

    cs.ET

    LionHeart: A Layer-based Map** Framework for Heterogeneous Systems with Analog In-Memory Computing Tiles

    Authors: Corey Lammie, Flavio Ponzina, Yuxuan Wang, Joshua Klein, Marina Zapater, Irem Boybat, Abu Sebastian, Giovanni Ansaloni, David Atienza

    Abstract: When arranged in a crossbar configuration, resistive memory devices can be used to execute MVM, the most dominant operation of many ML algorithms, in constant time complexity. Nonetheless, when performing computations in the analog domain, novel challenges are introduced in terms of arithmetic precision and stochasticity, due to non-ideal circuit and device behaviour. Moreover, these non-idealitie… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  3. arXiv:2312.13000  [pdf, other

    cs.AR cs.AI

    Accelerator-driven Data Arrangement to Minimize Transformers Run-time on Multi-core Architectures

    Authors: Alireza Amirshahi, Giovanni Ansaloni, David Atienza

    Abstract: The increasing complexity of transformer models in artificial intelligence expands their computational costs, memory usage, and energy consumption. Hardware acceleration tackles the ensuing challenges by designing processors and accelerators tailored for transformer models, supporting their computation hotspots with high efficiency. However, memory bandwidth can hinder improvements in hardware acc… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  4. arXiv:2212.09358  [pdf, other

    cs.AR

    A Soft SIMD Based Energy Efficient Computing Microarchitecture

    Authors: Pengbo Yu, Alexandre Levisse, Mohit Gupta, Evenblij Timon, Giovanni Ansaloni, Francky Catthoor, David Atienza

    Abstract: The ever-increasing size and computational complexity of today's machine-learning algorithms pose an increasing strain on the underlying hardware. In this light, novel and dedicated architectural solutions are required to optimize energy efficiency by leveraging opportunities (such as intrinsic parallelism and robustness to quantization errors) exposed by algorithms. We herein address this challen… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 6 pages, 10 figures

  5. arXiv:2209.06108  [pdf, other

    cs.AR eess.IV

    Bit-Line Computing for CNN Accelerators Co-Design in Edge AI Inference

    Authors: Marco Rios, Flavio Ponzina, Alexandre Levisse, Giovanni Ansaloni, David Atienza

    Abstract: By supporting the access of multiple memory words at the same time, Bit-line Computing (BC) architectures allow the parallel execution of bit-wise operations in-memory. At the array periphery, arithmetic operations are then derived with little additional overhead. Such a paradigm opens novel opportunities for Artificial Intelligence (AI) at the edge, thanks to the massive parallelism inherent in m… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  6. arXiv:2207.01856  [pdf, other

    eess.SP

    Event-based sampled ECG morphology reconstruction through self-similarity

    Authors: Silvio Zanoli, Tomas Teijeiro, Giovanni Ansaloni, David Atienza

    Abstract: Background and Objective: Event-based analog-to-digital converters allow for sparse bio-signal acquisition, enabling local sub-Nyquist sampling frequency. However, aggressive event selection can cause the loss of important bio-markers, not recoverable with standard interpolation techniques. In this work, we leverage the self-similarity of the electrocardiogram (ECG) signal to recover missing featu… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  7. ALPINE: Analog In-Memory Acceleration with Tight Processor Integration for Deep Learning

    Authors: Joshua Klein, Irem Boybat, Yasir Qureshi, Martino Dazzi, Alexandre Levisse, Giovanni Ansaloni, Marina Zapater, Abu Sebastian, David Atienza

    Abstract: Analog in-memory computing (AIMC) cores offers significant performance and energy benefits for neural network inference with respect to digital logic (e.g., CPUs). AIMCs accelerate matrix-vector multiplications, which dominate these applications' run-time. However, AIMC-centric platforms lack the flexibility of general-purpose systems, as they often have hard-coded data flows and can only support… ▽ More

    Submitted 13 December, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted by IEEE Transactions on Computers, December 2022

    ACM Class: C.4; I.6.0

  8. arXiv:2101.00587  [pdf, other

    cs.AR

    DB4HLS: A Database of High-Level Synthesis Design Space Explorations

    Authors: Lorenzo Ferretti, Jihye Kwon, Giovanni Ansaloni, Giuseppe Di Guglielmo, Luca Carloni, Laura Pozzi

    Abstract: High-Level Synthesis (HLS) frameworks allow to easily specify a large number of variants of the same hardware design by only acting on optimization directives. Nonetheless, the hardware synthesis of implementations for all possible combinations of directive values is impractical even for simple designs. Addressing this shortcoming, many HLS Design Space Exploration (DSE) strategies have been propo… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.