Skip to main content

Showing 1–3 of 3 results for author: Karunaratne, G

Searching in archive eess. Search in all archives.
.
  1. arXiv:2206.04796  [pdf, other

    cs.AR eess.SY

    Scale up your In-Memory Accelerator: Leveraging Wireless-on-Chip Communication for AIMC-based CNN Inference

    Authors: Nazareno Bruschi, Giuseppe Tagliavini, Francesco Conti, Sergi Abadal, Alberto Cabellos-Aparicio, Eduard Alarcón, Geethan Karunaratne, Irem Boybat, Luca Benini, Davide Rossi

    Abstract: Analog In-Memory Computing (AIMC) is emerging as a disruptive paradigm for heterogeneous computing, potentially delivering orders of magnitude better peak performance and efficiency over traditional digital signal processing architectures on Matrix-Vector multiplication. However, to sustain this throughput in real-world applications, AIMC tiles must be supplied with data at very high bandwidth and… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  2. arXiv:2005.07137  [pdf, other

    eess.SP cs.AR

    ChewBaccaNN: A Flexible 223 TOPS/W BNN Accelerator

    Authors: Renzo Andri, Geethan Karunaratne, Lukas Cavigelli, Luca Benini

    Abstract: Binary Neural Networks enable smart IoT devices, as they significantly reduce the required memory footprint and computational complexity while retaining a high network performance and flexibility. This paper presents ChewBaccaNN, a 0.7 mm$^2$ sized binary convolutional neural network (CNN) accelerator designed in GlobalFoundries 22 nm technology. By exploiting efficient data re-use, data buffering… ▽ More

    Submitted 26 February, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: Accepted at IEEE ISCAS 2021, Daegu, South Korea, 23-26 May 2021

  3. arXiv:1803.05849  [pdf, other

    cs.CV cs.AI cs.AR cs.NE eess.IV

    XNORBIN: A 95 TOp/s/W Hardware Accelerator for Binary Convolutional Neural Networks

    Authors: Andrawes Al Bahou, Geethan Karunaratne, Renzo Andri, Lukas Cavigelli, Luca Benini

    Abstract: Deploying state-of-the-art CNNs requires power-hungry processors and off-chip memory. This precludes the implementation of CNNs in low-power embedded systems. Recent research shows CNNs sustain extreme quantization, binarizing their weights and intermediate feature maps, thereby saving 8-32\x memory and collapsing energy-intensive sum-of-products into XNOR-and-popcount operations. We present XNO… ▽ More

    Submitted 5 March, 2018; originally announced March 2018.