Skip to main content

Showing 1–11 of 11 results for author: Sifalakis, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17483  [pdf, other

    cs.CV eess.IV

    TRIP: Trainable Region-of-Interest Prediction for Hardware-Efficient Neuromorphic Processing on Event-based Vision

    Authors: Cina Arjmand, Yingfu Xu, Kevin Shidqi, Alexandra F. Dobrita, Kanishkan Vadivel, Paul Detterer, Manolis Sifalakis, Amirreza Yousefzadeh, Guangzhi Tang

    Abstract: Neuromorphic processors are well-suited for efficiently handling sparse events from event-based cameras. However, they face significant challenges in the growth of computing demand and hardware costs as the input resolution increases. This paper proposes the Trainable Region-of-Interest Prediction (TRIP), the first hardware-efficient hard attention framework for event-based vision processing on a… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted in ICONS 2024

  2. arXiv:2406.17285  [pdf, other

    cs.NE cs.AI cs.ET cs.LG

    EON-1: A Brain-Inspired Processor for Near-Sensor Extreme Edge Online Feature Extraction

    Authors: Alexandra Dobrita, Amirreza Yousefzadeh, Simon Thorpe, Kanishkan Vadivel, Paul Detterer, Guangzhi Tang, Gert-Jan van Schaik, Mario Konijnenburg, Anteneh Gebregiorgis, Said Hamdioui, Manolis Sifalakis

    Abstract: For Edge AI applications, deploying online learning and adaptation on resource-constrained embedded devices can deal with fast sensor-generated streams of data in changing environments. However, since maintaining low-latency and power-efficient inference is paramount at the Edge, online learning and adaptation on the device should impose minimal additional overhead for inference. With this goal in… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2404.10597  [pdf, other

    cs.NE cs.AI cs.ET

    Hardware-aware training of models with synaptic delays for digital event-driven neuromorphic processors

    Authors: Alberto Patino-Saucedo, Roy Meijer, Amirreza Yousefzadeh, Manil-Dev Gomony, Federico Corradi, Paul Detteter, Laura Garrido-Regife, Bernabe Linares-Barranco, Manolis Sifalakis

    Abstract: Configurable synaptic delays are a basic feature in many neuromorphic neural network hardware accelerators. However, they have been rarely used in model implementations, despite their promising impact on performance and efficiency in tasks that exhibit complex (temporal) dynamics, as it has been unclear how to optimize them. In this work, we propose a framework to train and deploy, in digital neur… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  4. Empirical study on the efficiency of Spiking Neural Networks with axonal delays, and algorithm-hardware benchmarking

    Authors: Alberto Patiño-Saucedo, Amirreza Yousefzadeh, Guangzhi Tang, Federico Corradi, Bernabé Linares-Barranco, Manolis Sifalakis

    Abstract: The role of axonal synaptic delays in the efficacy and performance of artificial neural networks has been largely unexplored. In step-based analog-valued neural network models (ANNs), the concept is almost absent. In their spiking neuroscience-inspired counterparts, there is hardly a systematic account of their effects on model performance in terms of accuracy and number of synaptic operations.Thi… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  5. arXiv:2304.04640  [pdf, other

    cs.AI

    NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems

    Authors: Jason Yik, Korneel Van den Berghe, Douwe den Blanken, Younes Bouhadjar, Maxime Fabre, Paul Hueber, Denis Kleyko, Noah Pacik-Nelson, Pao-Sheng Vincent Sun, Guangzhi Tang, Shenqi Wang, Biyan Zhou, Soikat Hasan Ahmed, George Vathakkattil Joseph, Benedetto Leto, Aurora Micheli, Anurag Kumar Mishra, Gregor Lenz, Tao Sun, Zergham Ahmed, Mahmoud Akl, Brian Anderson, Andreas G. Andreou, Chiara Bartolozzi, Arindam Basu , et al. (73 additional authors not shown)

    Abstract: Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neu… ▽ More

    Submitted 17 January, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Updated from whitepaper to full perspective article preprint

  6. arXiv:2303.15224  [pdf, other

    cs.NE eess.SY

    Open the box of digital neuromorphic processor: Towards effective algorithm-hardware co-design

    Authors: Guangzhi Tang, Ali Safa, Kevin Shidqi, Paul Detterer, Stefano Traferro, Mario Konijnenburg, Manolis Sifalakis, Gert-Jan van Schaik, Amirreza Yousefzadeh

    Abstract: Sparse and event-driven spiking neural network (SNN) algorithms are the ideal candidate solution for energy-efficient edge computing. Yet, with the growing complexity of SNN algorithms, it isn't easy to properly benchmark and optimize their computational cost without hardware in the loop. Although digital neuromorphic processors have been widely adopted to benchmark SNN algorithms, their black-box… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  7. arXiv:2107.07305  [pdf, other

    cs.CV cs.LG eess.IV

    Training for temporal sparsity in deep neural networks, application in video processing

    Authors: Amirreza Yousefzadeh, Manolis Sifalakis

    Abstract: Activation sparsity improves compute efficiency and resource utilization in sparsity-aware neural network accelerators. As the predominant operation in DNNs is multiply-accumulate (MAC) of activations with weights to compute inner products, skip** operations where (at least) one of the two operands is zero can make inference more efficient in terms of latency and power. Spatial sparsification of… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  8. arXiv:1711.07227  [pdf, ps, other

    cs.IR cs.DC cs.DS

    Linear-Complexity Relaxed Word Mover's Distance with GPU Acceleration

    Authors: Kubilay Atasu, Thomas Parnell, Celestine Dünner, Manolis Sifalakis, Haralampos Pozidis, Vasileios Vasileiadis, Michail Vlachos, Cesar Berrospi, Abdel Labbi

    Abstract: The amount of unstructured text-based data is growing every day. Querying, clustering, and classifying this big data requires similarity computations across large sets of documents. Whereas low-complexity similarity metrics are available, attention has been shifting towards more complex methods that achieve a higher accuracy. In particular, the Word Mover's Distance (WMD) method proposed by Kusner… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

    Comments: To appear in the 2017 IEEE International Conference on Big Data (Big Data 2017) http://cci.drexel.edu/bigdata/bigdata2017/ December 11-14, 2017, Boston, MA, USA

  9. arXiv:1702.07005  [pdf, other

    cs.LG cs.DC

    Large-Scale Stochastic Learning using GPUs

    Authors: Thomas Parnell, Celestine Dünner, Kubilay Atasu, Manolis Sifalakis, Haris Pozidis

    Abstract: In this work we propose an accelerated stochastic learning system for very large-scale applications. Acceleration is achieved by map** the training algorithm onto massively parallel processors: we demonstrate a parallel, asynchronous GPU implementation of the widely used stochastic coordinate descent/ascent algorithm that can provide up to 35x speed-up over a sequential CPU implementation. In or… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

    Comments: Accepted for publication in ParLearning 2017: The 6th International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics, Orlando, Florida, May 2017

  10. Understanding and Optimizing the Performance of Distributed Machine Learning Applications on Apache Spark

    Authors: Celestine Dünner, Thomas Parnell, Kubilay Atasu, Manolis Sifalakis, Haralampos Pozidis

    Abstract: In this paper we explore the performance limits of Apache Spark for machine learning applications. We begin by analyzing the characteristics of a state-of-the-art distributed machine learning algorithm implemented in Spark and compare it to an equivalent reference implementation using the high performance computing framework MPI. We identify critical bottlenecks of the Spark framework and carefull… ▽ More

    Submitted 12 December, 2017; v1 submitted 5 December, 2016; originally announced December 2016.

    Comments: To appear in the 2017 IEEE International Conference on Big Data (Big Data 2017), December 11-14, 2017, Boston, MA, USA

  11. arXiv:1601.05356  [pdf, other

    cs.ET eess.SY

    Towards Programmable Network Dynamics: A Chemistry-Inspired Abstraction for Hardware Design

    Authors: Massimo Monti, Manolis Sifalakis, Christian F. Tschudin, Marco Luise

    Abstract: Chemical algorithms are statistical algorithms described and represented as chemical reaction networks. They are particularly attractive for traffic sha** and general control of network dynamics; they are analytically tractable, they reinforce a strict state-to-dynamics relationship, they have configurable stability properties, and they are directly implemented in state-space using a high-level… ▽ More

    Submitted 20 January, 2016; originally announced January 2016.

    Comments: 14 pages, non accepted version submitted to IEEE/ACM Transactions on Networking on May 2015 (after first submission on May 2014)