Skip to main content

Showing 1–16 of 16 results for author: Dimitrakopoulos, G

.
  1. arXiv:2402.10850  [pdf, other

    cs.AR

    Error Checking for Sparse Systolic Tensor Arrays

    Authors: Christodoulos Peltekis, Dionysios Filippas, Giorgos Dimitrakopoulos

    Abstract: Structured sparsity is an efficient way to prune the complexity of modern Machine Learning (ML) applications and to simplify the handling of sparse data in hardware. In such cases, the acceleration of structured-sparse ML models is handled by sparse systolic tensor arrays. The increasing prevalence of ML in safety-critical systems requires enhancing the sparse tensor arrays with online error detec… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: AICAS 2024

  2. arXiv:2402.10118  [pdf, other

    cs.AR cs.LG

    Reusing Softmax Hardware Unit for GELU Computation in Transformers

    Authors: Christodoulos Peltekis, Kosmas Alexandridis, Giorgos Dimitrakopoulos

    Abstract: Transformers have improved drastically the performance of natural language processing (NLP) and computer vision applications. The computation of transformers involves matrix multiplications and non-linear activation functions such as softmax and GELU (Gaussion Error Linear Unit) that are accelerated directly in hardware. Currently, function evaluation is done separately for each function and rarel… ▽ More

    Submitted 16 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: AICAS 2024

  3. DeMM: A Decoupled Matrix Multiplication Engine Supporting Relaxed Structured Sparsity

    Authors: Christodoulos Peltekis, Vasileios Titopoulos, Chrysostomos Nicopoulos, Giorgos Dimitrakopoulos

    Abstract: Deep Learning (DL) has achieved unprecedented success in various application domains. Meanwhile, model pruning has emerged as a viable solution to reduce the footprint of DL models in mobile applications, without compromising their accuracy. To enable the matrix engines built for dense DL models to also handle their pruned counterparts, pruned DL models follow a fine-grained structured sparsity pa… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted on the IEEE Computer Architecture Letters

  4. arXiv:2311.07241  [pdf, other

    cs.AR

    IndexMAC: A Custom RISC-V Vector Instruction to Accelerate Structured-Sparse Matrix Multiplications

    Authors: V. Titopoulos, K. Alexandridis, C. Peltekis, C. Nicopoulos, G. Dimitrakopoulos

    Abstract: Structured sparsity has been proposed as an efficient way to prune the complexity of modern Machine Learning (ML) applications and to simplify the handling of sparse data in hardware. The acceleration of ML models - for both training and inference - relies primarily on equivalent matrix multiplications that can be executed efficiently on vector processors or custom matrix engines. The goal of this… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: DATE 2024

  5. arXiv:2309.02969  [pdf, other

    cs.AR

    The Case for Asymmetric Systolic Array Floorplanning

    Authors: C. Peltekis, D. Filippas, G. Dimitrakopoulos, C. Nicopoulos

    Abstract: The widespread proliferation of deep learning applications has triggered the need to accelerate them directly in hardware. General Matrix Multiplication (GEMM) kernels are elemental deep-learning constructs and they inherently map onto Systolic Arrays (SAs). SAs are regular structures that are well-suited for accelerating matrix multiplications. Typical SAs use a pipelined array of Processing Elem… ▽ More

    Submitted 13 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: CNNA 2023

  6. Low-Power Data Streaming in Systolic Arrays with Bus-Invert Coding and Zero-Value Clock Gating

    Authors: C. Peltekis, D. Filippas, G. Dimitrakopoulos, C. Nicopoulos

    Abstract: Systolic Array (SA) architectures are well suited for accelerating matrix multiplications through the use of a pipelined array of Processing Elements (PEs) communicating with local connections and pre-orchestrated data movements. Even though most of the dynamic power consumption in SAs is due to multiplications and additions, pipelined data movement within the SA constitutes an additional importan… ▽ More

    Submitted 8 September, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: International Conference on Modern Circuits and Systems Technologies (MOCAST)

  7. Reduced-Precision Floating-Point Arithmetic in Systolic Arrays with Skewed Pipelines

    Authors: D. Filippas, C. Peltekis, G. Dimitrakopoulos, C. Nicopoulos

    Abstract: The acceleration of deep-learning kernels in hardware relies on matrix multiplications that are executed efficiently on Systolic Arrays (SA). To effectively trade off deep-learning training/inference quality with hardware cost, SA accelerators employ reduced-precision Floating-Point (FP) arithmetic. In this work, we demonstrate the need for new pipeline organizations to reduce latency and improve… ▽ More

    Submitted 8 September, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS) 2023

  8. ArrayFlex: A Systolic Array Architecture with Configurable Transparent Pipelining

    Authors: C. Peltekis, D. Filippas, G. Dimitrakopoulos, C. Nicopoulos, D. Pnevmatikatos

    Abstract: Convolutional Neural Networks (CNNs) are the state-of-the-art solution for many deep learning applications. For maximum scalability, their computation should combine high performance and energy efficiency. In practice, the convolutions of each CNN layer are mapped to a matrix multiplication that includes all input features and kernels of each layer and is computed using a systolic array. In this w… ▽ More

    Submitted 6 June, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: DATE 2023

  9. arXiv:2204.00704  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    THz emission from Fe/Pt spintronic emitters with L1$_{0}$-FePt alloyed interface

    Authors: Laura Scheuer, Moritz Ruhwedel, Dimitris Karfaridis, Isaak G. Vasileiadis, Dominik Sokoluk, Garik Torosyan, George Vourlias, George P. Dimitrakopoulos, Marco Rahm, Burkard Hillebrands, Thomas Kehagias, René Beigang, Evangelos Th. Papaioannou

    Abstract: Recent developments in nanomagnetism and spintronics have enabled the use of ultrafast spin physics for terahertz (THz) emission. Spintronic THz emitters, consisting of ferromagnetic FM / non-magnetic (NM) thin film heterostructures, have demonstrated impressive properties for the use in THz spectroscopy and have great potential in scientific and industrial applications. In this work, we focus on… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Journal ref: iScience 25, 104319 (2022)

  10. Blockchain-based Recommender Systems: Applications, Challenges and Future Opportunities

    Authors: Yassine Himeur, Aya Sayed, Abdullah Alsalemi, Faycal Bensaali, Abbes Amira, Iraklis Varlamis, Magdalini Eirinaki, Christos Sardianos, George Dimitrakopoulos

    Abstract: Recommender systems have been widely used in different application domains including energy-preservation, e-commerce, healthcare, social media, etc. Such applications require the analysis and mining of massive amounts of various types of user data, including demographics, preferences, social interactions, etc. in order to develop accurate and precise recommender systems. Such datasets often includ… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 25 pages, 6 figures, 3 tables

    Journal ref: Computer Science Review, Volume 43, February 2022, 100439

  11. arXiv:2105.10460  [pdf, other

    physics.soc-ph

    Marketability of building energy efficiency systems based on behavioral change: A case study of a novel micro-moments based solution

    Authors: Yassine Himeur, Abdullah Alsalemi, Faycal Bensaali, Abbes Amira, Iraklis Varlamis, George Bravos, Christos Sardianos, George Dimitrakopoulos

    Abstract: In spite of the substantial advance in develo** energy-efficient buildings, power demand in the building sector is still remarkably growing due to teleworking and e-learning triggered by the COVID-19 movement restrictions. This is highlighted by the inefficiency of energy saving measures that have recently been set owing to the the marketability failure and unsuccessful design integration of exi… ▽ More

    Submitted 30 November, 2020; originally announced May 2021.

    Comments: 05 Figures and 04 Tables

  12. arXiv:2102.07654  [pdf, other

    cs.IR

    A survey of recommender systems for energy efficiency in buildings: Principles, challenges and prospects

    Authors: Yassine Himeur, Abdullah Alsalemi, Ayman Al-Kababji, Faycal Bensaali, Abbes Amira, Christos Sardianos, George Dimitrakopoulos, Iraklis Varlamis

    Abstract: Recommender systems have significantly developed in recent years in parallel with the witnessed advancements in both internet of things (IoT) and artificial intelligence (AI) technologies. Accordingly, as a consequence of IoT and AI, multiple forms of data are incorporated in these systems, e.g. social, implicit, local and personal information, which can help in improving recommender systems' perf… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 35 pages, 11 figures, 1 table

    Journal ref: Information Fusion 2021

  13. The emergence of Explainability of Intelligent Systems: Delivering Explainable and Personalised Recommendations for Energy Efficiency

    Authors: Christos Sardianos, Iraklis Varlamis, Christos Chronis, George Dimitrakopoulos, Abdullah Alsalemi, Yassine Himeur, Faycal Bensaali, Abbes Amira

    Abstract: The recent advances in artificial intelligence namely in machine learning and deep learning, have boosted the performance of intelligent systems in several ways. This gave rise to human expectations, but also created the need for a deeper understanding of how intelligent systems think and decide. The concept of explainability appeared, in the extent of explaining the internal system mechanics in h… ▽ More

    Submitted 26 October, 2020; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: 19 pages, 8 figures, 1 table

    Journal ref: International Journal of Intelligent Systems, 2020

  14. arXiv:2010.04693  [pdf, other

    cs.CY

    Resha** consumption habits by exploiting energy-related micro-moment recommendations: A case study

    Authors: Christos Sardianos, Iraklis Varlamis, Christos Chronis, George Dimitrakopoulos, Abdullah Alsalemi, Yassine Himeur, Faycal Bensaali, Abbes Amira

    Abstract: The environmental change and its effects, caused by human influences and natural ecological processes over the last decade, prove that it is now more prudent than ever to transition to more sustainable models of energy consumption behaviors. User energy consumption is inductively derived from the time-to-time standards of living that shape the user's everyday consumption habits. This work builds o… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: This paper will appear in Communications in Computer and Information Science( CCIS) - Springer Book - [Smartgreens extension]

  15. Memristive Learning Cellular Automata: Theory and Applications

    Authors: Rafailia-Eleni Karamani, Iosif-Angelos Fyrigos, Vasileios Ntinas, Orestis Liolis, Giorgos Dimitrakopoulos, Mustafa Altun, Andrew Adamatzky, Mircea R. Stan, Georgios Ch. Sirakoulis

    Abstract: Memristors are novel non volatile devices that manage to combine storing and processing capabilities in the same physical place.Their nanoscale dimensions and low power consumption enable the further design of various nanoelectronic processing circuits and corresponding computing architectures, like neuromorhpic, in memory, unconventional, etc.One of the possible ways to exploit the memristor's ad… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

  16. Memristive oscillatory circuits for resolution of NP-complete logic puzzles: Sudoku case

    Authors: Theodoros Panagiotis Chatzinikolaou, Iosif-Angelos Fyrigos, Rafailia-Eleni Karamani, Vasileios Ntinas, Giorgos Dimitrakopoulos, Sorin Cotofana, Georgios Ch. Sirakoulis

    Abstract: Memristor networks are capable of low-power and massive parallel processing and information storage. Moreover, they have presented the ability to apply for a vast number of intelligent data analysis applications targeting mobile edge devices and low power computing. Beyond the memory and conventional computing architectures, memristors are widely studied in circuits aiming for increased intelligen… ▽ More

    Submitted 15 February, 2020; originally announced February 2020.

    Comments: To be presented/published in the IEEE International Symposium on Circuits and Systems (ISCAS) 2020