Search | arXiv e-print repository

Error Checking for Sparse Systolic Tensor Arrays

Authors: Christodoulos Peltekis, Dionysios Filippas, Giorgos Dimitrakopoulos

Abstract: Structured sparsity is an efficient way to prune the complexity of modern Machine Learning (ML) applications and to simplify the handling of sparse data in hardware. In such cases, the acceleration of structured-sparse ML models is handled by sparse systolic tensor arrays. The increasing prevalence of ML in safety-critical systems requires enhancing the sparse tensor arrays with online error detec… ▽ More Structured sparsity is an efficient way to prune the complexity of modern Machine Learning (ML) applications and to simplify the handling of sparse data in hardware. In such cases, the acceleration of structured-sparse ML models is handled by sparse systolic tensor arrays. The increasing prevalence of ML in safety-critical systems requires enhancing the sparse tensor arrays with online error detection for managing random hardware failures. Algorithm-based fault tolerance has been proposed as a low-cost mechanism to check online the result of computations against random hardware failures. In this work, we address a key architectural challenge with structured-sparse tensor arrays: how to provide online error checking for a range of structured sparsity levels while maintaining high utilization of the hardware. Experimental results highlight the minimum hardware overhead incurred by the proposed checking logic and its error detection properties after injecting random hardware faults on sparse tensor arrays that execute layers of ResNet50 CNN. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: AICAS 2024

arXiv:2402.10118 [pdf, other]

Reusing Softmax Hardware Unit for GELU Computation in Transformers

Authors: Christodoulos Peltekis, Kosmas Alexandridis, Giorgos Dimitrakopoulos

Abstract: Transformers have improved drastically the performance of natural language processing (NLP) and computer vision applications. The computation of transformers involves matrix multiplications and non-linear activation functions such as softmax and GELU (Gaussion Error Linear Unit) that are accelerated directly in hardware. Currently, function evaluation is done separately for each function and rarel… ▽ More Transformers have improved drastically the performance of natural language processing (NLP) and computer vision applications. The computation of transformers involves matrix multiplications and non-linear activation functions such as softmax and GELU (Gaussion Error Linear Unit) that are accelerated directly in hardware. Currently, function evaluation is done separately for each function and rarely allows for hardware reuse. To mitigate this problem, in this work, we map the computation of GELU to a softmax operator. In this way, the efficient hardware units designed already for softmax can be reused for computing GELU as well. Computation of GELU can enjoy the inherent vectorized nature of softmax and produce in parallel multiple GELU outcomes. Experimental results show that computing GELU via a pre-existing and incrementally modified softmax hardware unit (a) does not reduce the accuracy of representative NLP applications and (b) allows the reduction of the overall hardware area and power by 6.1% and 11.9%, respectively, on average. △ Less

Submitted 16 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

Comments: AICAS 2024

arXiv:2401.08179 [pdf, ps, other]

doi 10.1109/LCA.2024.3355178

DeMM: A Decoupled Matrix Multiplication Engine Supporting Relaxed Structured Sparsity

Authors: Christodoulos Peltekis, Vasileios Titopoulos, Chrysostomos Nicopoulos, Giorgos Dimitrakopoulos

Abstract: Deep Learning (DL) has achieved unprecedented success in various application domains. Meanwhile, model pruning has emerged as a viable solution to reduce the footprint of DL models in mobile applications, without compromising their accuracy. To enable the matrix engines built for dense DL models to also handle their pruned counterparts, pruned DL models follow a fine-grained structured sparsity pa… ▽ More Deep Learning (DL) has achieved unprecedented success in various application domains. Meanwhile, model pruning has emerged as a viable solution to reduce the footprint of DL models in mobile applications, without compromising their accuracy. To enable the matrix engines built for dense DL models to also handle their pruned counterparts, pruned DL models follow a fine-grained structured sparsity pattern of 1:4, or 2:4, whereby in each group of four contiguous values, at least one, or two, respectively, must be non-zero. Structured sparsity has recently also moved to coarser (relaxed) cases of N:128, or N:256, for small values of N, targeting a wider range of sparsity (10%-90%) for the DL models. In this work, we design an accelerator that operates, by construction, on wide blocks with relaxed structured sparsity. In contrast to the conventional systolic array archetype, the new engine decouples the memory part of the systolic array from the multiply-add units. The memory block comprises 1 write and N read ports, with the number of read ports being equal to the number of non-zero elements per row. The multiply-add units connect directly to each read port and complete the multiplication in a row-wise product-first order. More importantly, simple reconfiguration facilitates more dense patterns. The experimental evaluation demonstrates substantial latency improvements over current state-of-the-art systolic array engines built for fine-grained and relaxed structured sparsity. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: Accepted on the IEEE Computer Architecture Letters

arXiv:2311.07241 [pdf, other]

IndexMAC: A Custom RISC-V Vector Instruction to Accelerate Structured-Sparse Matrix Multiplications

Authors: V. Titopoulos, K. Alexandridis, C. Peltekis, C. Nicopoulos, G. Dimitrakopoulos

Abstract: Structured sparsity has been proposed as an efficient way to prune the complexity of modern Machine Learning (ML) applications and to simplify the handling of sparse data in hardware. The acceleration of ML models - for both training and inference - relies primarily on equivalent matrix multiplications that can be executed efficiently on vector processors or custom matrix engines. The goal of this… ▽ More Structured sparsity has been proposed as an efficient way to prune the complexity of modern Machine Learning (ML) applications and to simplify the handling of sparse data in hardware. The acceleration of ML models - for both training and inference - relies primarily on equivalent matrix multiplications that can be executed efficiently on vector processors or custom matrix engines. The goal of this work is to incorporate the simplicity of structured sparsity into vector execution, thereby accelerating the corresponding matrix multiplications. Toward this objective, a new vector index-multiply-accumulate instruction is proposed, which enables the implementation of lowcost indirect reads from the vector register file. This reduces unnecessary memory traffic and increases data locality. The proposed new instruction was integrated in a decoupled RISCV vector processor with negligible hardware cost. Extensive evaluation demonstrates significant speedups of 1.80x-2.14x, as compared to state-of-the-art vectorized kernels, when executing layers of varying sparsity from state-of-the-art Convolutional Neural Networks (CNNs). △ Less

Submitted 13 November, 2023; originally announced November 2023.

Comments: DATE 2024

arXiv:2309.02969 [pdf, other]

The Case for Asymmetric Systolic Array Floorplanning

Authors: C. Peltekis, D. Filippas, G. Dimitrakopoulos, C. Nicopoulos

Abstract: The widespread proliferation of deep learning applications has triggered the need to accelerate them directly in hardware. General Matrix Multiplication (GEMM) kernels are elemental deep-learning constructs and they inherently map onto Systolic Arrays (SAs). SAs are regular structures that are well-suited for accelerating matrix multiplications. Typical SAs use a pipelined array of Processing Elem… ▽ More The widespread proliferation of deep learning applications has triggered the need to accelerate them directly in hardware. General Matrix Multiplication (GEMM) kernels are elemental deep-learning constructs and they inherently map onto Systolic Arrays (SAs). SAs are regular structures that are well-suited for accelerating matrix multiplications. Typical SAs use a pipelined array of Processing Elements (PEs), which communicate with local connections and pre-orchestrated data movements. In this work, we show that the physical layout of SAs should be asymmetric to minimize wirelength and improve energy efficiency. The floorplan of the SA adjusts better to the asymmetric widths of the horizontal and vertical data buses and their switching activity profiles. It is demonstrated that such physically asymmetric SAs reduce interconnect power by 9.1% when executing state-of-the-art Convolutional Neural Network (CNN) layers, as compared to SAs of the same size but with a square (i.e., symmetric) layout. The savings in interconnect power translate, in turn, to 2.1% overall power savings. △ Less

Submitted 13 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

Comments: CNNA 2023

arXiv:2304.12691 [pdf, other]

doi 10.1109/MOCAST57943.2023.10176467

Low-Power Data Streaming in Systolic Arrays with Bus-Invert Coding and Zero-Value Clock Gating

Authors: C. Peltekis, D. Filippas, G. Dimitrakopoulos, C. Nicopoulos

Abstract: Systolic Array (SA) architectures are well suited for accelerating matrix multiplications through the use of a pipelined array of Processing Elements (PEs) communicating with local connections and pre-orchestrated data movements. Even though most of the dynamic power consumption in SAs is due to multiplications and additions, pipelined data movement within the SA constitutes an additional importan… ▽ More Systolic Array (SA) architectures are well suited for accelerating matrix multiplications through the use of a pipelined array of Processing Elements (PEs) communicating with local connections and pre-orchestrated data movements. Even though most of the dynamic power consumption in SAs is due to multiplications and additions, pipelined data movement within the SA constitutes an additional important contributor. The goal of this work is to reduce the dynamic power consumption associated with the feeding of data to the SA, by synergistically applying bus-invert coding and zero-value clock gating. By exploiting salient attributes of state-of-the-art CNNs, such as the value distribution of the weights, the proposed SA applies appropriate encoding only to the data that exhibits high switching activity. Similarly, when one of the inputs is zero, unnecessary operations are entirely skipped. This selectively targeted, application-aware encoding approach is demonstrated to reduce the dynamic power consumption of data streaming in CNN applications using Bfloat16 arithmetic by 1%-19%. This translates to an overall dynamic power reduction of 6.2%-9.4%. △ Less

Submitted 8 September, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

Comments: International Conference on Modern Circuits and Systems Technologies (MOCAST)

arXiv:2304.01668 [pdf, other]

doi 10.1109/AICAS57966.2023.10168556

Reduced-Precision Floating-Point Arithmetic in Systolic Arrays with Skewed Pipelines

Authors: D. Filippas, C. Peltekis, G. Dimitrakopoulos, C. Nicopoulos

Abstract: The acceleration of deep-learning kernels in hardware relies on matrix multiplications that are executed efficiently on Systolic Arrays (SA). To effectively trade off deep-learning training/inference quality with hardware cost, SA accelerators employ reduced-precision Floating-Point (FP) arithmetic. In this work, we demonstrate the need for new pipeline organizations to reduce latency and improve… ▽ More The acceleration of deep-learning kernels in hardware relies on matrix multiplications that are executed efficiently on Systolic Arrays (SA). To effectively trade off deep-learning training/inference quality with hardware cost, SA accelerators employ reduced-precision Floating-Point (FP) arithmetic. In this work, we demonstrate the need for new pipeline organizations to reduce latency and improve energy efficiency of reduced-precision FP operators for the chained multiply-add operation imposed by the structure of the SA. The proposed skewed pipeline design reorganizes the pipelined operation of the FP multiply-add units to enable new forwarding paths for the exponent logic, which allow for parallel execution of the pipeline stages of consecutive PEs. As a result, the latency of the matrix multiplication operation within the SA is significantly reduced with minimal hardware cost, thereby yielding an energy reduction of 8% and 11% for the examined state-of-the-art CNNs. △ Less

Submitted 8 September, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

Comments: IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS) 2023

arXiv:2211.12600 [pdf, other]

doi 10.23919/DATE56975.2023.10136913

ArrayFlex: A Systolic Array Architecture with Configurable Transparent Pipelining

Authors: C. Peltekis, D. Filippas, G. Dimitrakopoulos, C. Nicopoulos, D. Pnevmatikatos

Abstract: Convolutional Neural Networks (CNNs) are the state-of-the-art solution for many deep learning applications. For maximum scalability, their computation should combine high performance and energy efficiency. In practice, the convolutions of each CNN layer are mapped to a matrix multiplication that includes all input features and kernels of each layer and is computed using a systolic array. In this w… ▽ More Convolutional Neural Networks (CNNs) are the state-of-the-art solution for many deep learning applications. For maximum scalability, their computation should combine high performance and energy efficiency. In practice, the convolutions of each CNN layer are mapped to a matrix multiplication that includes all input features and kernels of each layer and is computed using a systolic array. In this work, we focus on the design of a systolic array with configurable pipeline with the goal to select an optimal pipeline configuration for each CNN layer. The proposed systolic array, called ArrayFlex, can operate in normal, or in shallow pipeline mode, thus balancing the execution time in cycles and the operating clock frequency. By selecting the appropriate pipeline configuration per CNN layer, ArrayFlex reduces the inference latency of state-of-the-art CNNs by 11%, on average, as compared to a traditional fixed-pipeline systolic array. Most importantly, this result is achieved while using 13%-23% less power, for the same applications, thus offering a combined energy-delay-product efficiency between 1.4x and 1.8x. △ Less

Submitted 6 June, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

Comments: DATE 2023

arXiv:2204.00704 [pdf, other]

doi 10.1016/

doi j.isci.2022.104319

THz emission from Fe/Pt spintronic emitters with L1$_{0}$-FePt alloyed interface

Authors: Laura Scheuer, Moritz Ruhwedel, Dimitris Karfaridis, Isaak G. Vasileiadis, Dominik Sokoluk, Garik Torosyan, George Vourlias, George P. Dimitrakopoulos, Marco Rahm, Burkard Hillebrands, Thomas Kehagias, René Beigang, Evangelos Th. Papaioannou

Abstract: Recent developments in nanomagnetism and spintronics have enabled the use of ultrafast spin physics for terahertz (THz) emission. Spintronic THz emitters, consisting of ferromagnetic FM / non-magnetic (NM) thin film heterostructures, have demonstrated impressive properties for the use in THz spectroscopy and have great potential in scientific and industrial applications. In this work, we focus on… ▽ More Recent developments in nanomagnetism and spintronics have enabled the use of ultrafast spin physics for terahertz (THz) emission. Spintronic THz emitters, consisting of ferromagnetic FM / non-magnetic (NM) thin film heterostructures, have demonstrated impressive properties for the use in THz spectroscopy and have great potential in scientific and industrial applications. In this work, we focus on the impact of the FM/NM interface on the THz emission by investigating Fe/Pt bilayers with engineered interfaces. In particular, we intentionally modify the Fe/Pt interface by inserting an ordered L1$_{0}$-FePt alloy interlayer. Subsequently, we establish that a Fe/L1$_{0}$-FePt (2\,nm)/Pt configuration is significantly superior to a Fe/Pt bilayer structure, regarding THz emission amplitude. The latter depends on the extent of alloying on either side of the interface. The unique trilayer structure opens new perspectives in terms of material choices for the next generation of spintronic THz emitters. △ Less

Submitted 1 April, 2022; originally announced April 2022.

Journal ref: iScience 25, 104319 (2022)

arXiv:2111.11509 [pdf]

doi 10.1016/j.cosrev.2021.100439

Blockchain-based Recommender Systems: Applications, Challenges and Future Opportunities

Authors: Yassine Himeur, Aya Sayed, Abdullah Alsalemi, Faycal Bensaali, Abbes Amira, Iraklis Varlamis, Magdalini Eirinaki, Christos Sardianos, George Dimitrakopoulos

Abstract: Recommender systems have been widely used in different application domains including energy-preservation, e-commerce, healthcare, social media, etc. Such applications require the analysis and mining of massive amounts of various types of user data, including demographics, preferences, social interactions, etc. in order to develop accurate and precise recommender systems. Such datasets often includ… ▽ More Recommender systems have been widely used in different application domains including energy-preservation, e-commerce, healthcare, social media, etc. Such applications require the analysis and mining of massive amounts of various types of user data, including demographics, preferences, social interactions, etc. in order to develop accurate and precise recommender systems. Such datasets often include sensitive information, yet most recommender systems are focusing on the models' accuracy and ignore issues related to security and the users' privacy. Despite the efforts to overcome these problems using different risk reduction techniques, none of them has been completely successful in ensuring cryptographic security and protection of the users' private information. To bridge this gap, the blockchain technology is presented as a promising strategy to promote security and privacy preservation in recommender systems, not only because of its security and privacy salient features, but also due to its resilience, adaptability, fault tolerance and trust characteristics. This paper presents a holistic review of blockchain-based recommender systems covering challenges, open issues and solutions. Accordingly, a well-designed taxonomy is introduced to describe the security and privacy challenges, overview existing frameworks and discuss their applications and benefits when using blockchain before indicating opportunities for future research. △ Less

Submitted 22 November, 2021; originally announced November 2021.

Comments: 25 pages, 6 figures, 3 tables

Journal ref: Computer Science Review, Volume 43, February 2022, 100439

arXiv:2105.10460 [pdf, other]

Marketability of building energy efficiency systems based on behavioral change: A case study of a novel micro-moments based solution

Authors: Yassine Himeur, Abdullah Alsalemi, Faycal Bensaali, Abbes Amira, Iraklis Varlamis, George Bravos, Christos Sardianos, George Dimitrakopoulos

Abstract: In spite of the substantial advance in develo** energy-efficient buildings, power demand in the building sector is still remarkably growing due to teleworking and e-learning triggered by the COVID-19 movement restrictions. This is highlighted by the inefficiency of energy saving measures that have recently been set owing to the the marketability failure and unsuccessful design integration of exi… ▽ More In spite of the substantial advance in develo** energy-efficient buildings, power demand in the building sector is still remarkably growing due to teleworking and e-learning triggered by the COVID-19 movement restrictions. This is highlighted by the inefficiency of energy saving measures that have recently been set owing to the the marketability failure and unsuccessful design integration of existing ICT based solutions. Specifically, the factors that affect energy efficiency comprise natural and socio-economic restrictions, technological advances, and last but not least the human behavior. Energy efficiency based on behavioral change has attracted an increasing interest in the recent years, unfortunately, solutions in this area suffer from the lack of marketability because of the absence of both prospective studies and consumer awareness. This work, focuses on a key cost-effective technology for monitoring power consumption and for contributing to the behavioral change through smart, personalized, and on the moment, action recommendations. In order to examine the marketability of the proposed solution, we begin with a market and research analysis of the domain of energy efficiency systems in the building sector that target behavioral change. Moving forward, various factors that affect the commercialization of the final product are considered before proceeding with recommended actions to ensure its successful marketability via conducting a Go/No-Go analysis. Finally, based on the comprehensive commercialization study, a GO decision is concluded for the subject technology. △ Less

Submitted 30 November, 2020; originally announced May 2021.

Comments: 05 Figures and 04 Tables

arXiv:2102.07654 [pdf, other]

A survey of recommender systems for energy efficiency in buildings: Principles, challenges and prospects

Authors: Yassine Himeur, Abdullah Alsalemi, Ayman Al-Kababji, Faycal Bensaali, Abbes Amira, Christos Sardianos, George Dimitrakopoulos, Iraklis Varlamis

Abstract: Recommender systems have significantly developed in recent years in parallel with the witnessed advancements in both internet of things (IoT) and artificial intelligence (AI) technologies. Accordingly, as a consequence of IoT and AI, multiple forms of data are incorporated in these systems, e.g. social, implicit, local and personal information, which can help in improving recommender systems' perf… ▽ More Recommender systems have significantly developed in recent years in parallel with the witnessed advancements in both internet of things (IoT) and artificial intelligence (AI) technologies. Accordingly, as a consequence of IoT and AI, multiple forms of data are incorporated in these systems, e.g. social, implicit, local and personal information, which can help in improving recommender systems' performance and widen their applicability to traverse different disciplines. On the other side, energy efficiency in the building sector is becoming a hot research topic, in which recommender systems play a major role by promoting energy saving behavior and reducing carbon emissions. However, the deployment of the recommendation frameworks in buildings still needs more investigations to identify the current challenges and issues, where their solutions are the keys to enable the pervasiveness of research findings, and therefore, ensure a large-scale adoption of this technology. Accordingly, this paper presents, to the best of the authors' knowledge, the first timely and comprehensive reference for energy-efficiency recommendation systems through (i) surveying existing recommender systems for energy saving in buildings; (ii) discussing their evolution; (iii) providing an original taxonomy of these systems based on specified criteria, including the nature of the recommender engine, its objective, computing platforms, evaluation metrics and incentive measures; and (iv) conducting an in-depth, critical analysis to identify their limitations and unsolved issues. The derived challenges and areas of future implementation could effectively guide the energy research community to improve the energy-efficiency in buildings and reduce the cost of developed recommender systems-based solutions. △ Less

Submitted 9 February, 2021; originally announced February 2021.

Comments: 35 pages, 11 figures, 1 table

Journal ref: Information Fusion 2021

arXiv:2010.04990 [pdf, other]

doi 10.1002/int.22314

The emergence of Explainability of Intelligent Systems: Delivering Explainable and Personalised Recommendations for Energy Efficiency

Authors: Christos Sardianos, Iraklis Varlamis, Christos Chronis, George Dimitrakopoulos, Abdullah Alsalemi, Yassine Himeur, Faycal Bensaali, Abbes Amira

Abstract: The recent advances in artificial intelligence namely in machine learning and deep learning, have boosted the performance of intelligent systems in several ways. This gave rise to human expectations, but also created the need for a deeper understanding of how intelligent systems think and decide. The concept of explainability appeared, in the extent of explaining the internal system mechanics in h… ▽ More The recent advances in artificial intelligence namely in machine learning and deep learning, have boosted the performance of intelligent systems in several ways. This gave rise to human expectations, but also created the need for a deeper understanding of how intelligent systems think and decide. The concept of explainability appeared, in the extent of explaining the internal system mechanics in human terms. Recommendation systems are intelligent systems that support human decision making, and as such, they have to be explainable in order to increase user trust and improve the acceptance of recommendations. In this work, we focus on a context-aware recommendation system for energy efficiency and develop a mechanism for explainable and persuasive recommendations, which are personalized to user preferences and habits. The persuasive facts either emphasize on the economical saving prospects (Econ) or on a positive ecological impact (Eco) and explanations provide the reason for recommending an energy saving action. Based on a study conducted using a Telegram bot, different scenarios have been validated with actual data and human feedback. Current results show a total increase of 19\% on the recommendation acceptance ratio when both economical and ecological persuasive facts are employed. This revolutionary approach on recommendation systems, demonstrates how intelligent recommendations can effectively encourage energy saving behavior. △ Less

Submitted 26 October, 2020; v1 submitted 10 October, 2020; originally announced October 2020.

Comments: 19 pages, 8 figures, 1 table

Journal ref: International Journal of Intelligent Systems, 2020

arXiv:2010.04693 [pdf, other]

Resha** consumption habits by exploiting energy-related micro-moment recommendations: A case study

Authors: Christos Sardianos, Iraklis Varlamis, Christos Chronis, George Dimitrakopoulos, Abdullah Alsalemi, Yassine Himeur, Faycal Bensaali, Abbes Amira

Abstract: The environmental change and its effects, caused by human influences and natural ecological processes over the last decade, prove that it is now more prudent than ever to transition to more sustainable models of energy consumption behaviors. User energy consumption is inductively derived from the time-to-time standards of living that shape the user's everyday consumption habits. This work builds o… ▽ More The environmental change and its effects, caused by human influences and natural ecological processes over the last decade, prove that it is now more prudent than ever to transition to more sustainable models of energy consumption behaviors. User energy consumption is inductively derived from the time-to-time standards of living that shape the user's everyday consumption habits. This work builds on the detection of repeated usage consumption patterns from consumption logs. It presents the structure and operation of an energy consumption reduction system, which employs a set of sensors, smart-meters and actuators in an office environment and targets specific user habits. Using our previous research findings on the value of energy-related micro-moment recommendations, the implemented system is an integrated solution that avoids unnecessary energy consumption. With the use of a messaging API, the system recommends to the user the proper energy saving action at the right moment and gradually shapes user's habits. The solution has been implemented on the Home Assistant open source platform, which allows the definition of automations for controlling the office equipment. Experimental evaluation with several scenarios shows that the system manages first to reduce energy consumption, and second, to trigger users' actions that could potentially urge them to more sustainable energy consumption habits. △ Less

Submitted 9 October, 2020; originally announced October 2020.

Comments: This paper will appear in Communications in Computer and Information Science( CCIS) - Springer Book - [Smartgreens extension]

arXiv:2003.06983 [pdf, other]

doi 10.1016/j.chaos.2021.110700

Memristive Learning Cellular Automata: Theory and Applications

Authors: Rafailia-Eleni Karamani, Iosif-Angelos Fyrigos, Vasileios Ntinas, Orestis Liolis, Giorgos Dimitrakopoulos, Mustafa Altun, Andrew Adamatzky, Mircea R. Stan, Georgios Ch. Sirakoulis

Abstract: Memristors are novel non volatile devices that manage to combine storing and processing capabilities in the same physical place.Their nanoscale dimensions and low power consumption enable the further design of various nanoelectronic processing circuits and corresponding computing architectures, like neuromorhpic, in memory, unconventional, etc.One of the possible ways to exploit the memristor's ad… ▽ More Memristors are novel non volatile devices that manage to combine storing and processing capabilities in the same physical place.Their nanoscale dimensions and low power consumption enable the further design of various nanoelectronic processing circuits and corresponding computing architectures, like neuromorhpic, in memory, unconventional, etc.One of the possible ways to exploit the memristor's advantages is by combining them with Cellular Automata (CA).CA constitute a well known non von Neumann computing architecture that is based on the local interconnection of simple identical cells forming N-dimensional grids.These local interconnections allow the emergence of global and complex phenomena.In this paper, we propose a hybridization of the CA original definition coupled with memristor based implementation, and, more specifically, we focus on Memristive Learning Cellular Automata (MLCA), which have the ability of learning using also simple identical interconnected cells and taking advantage of the memristor devices inherent variability.The proposed MLCA circuit level implementation is applied on optimal detection of edges in image processing through a series of SPICE simulations, proving its robustness and efficacy. △ Less

Submitted 15 March, 2020; originally announced March 2020.

arXiv:2002.06339 [pdf, other]

doi 10.1109/ISCAS45731.2020.9181110

Memristive oscillatory circuits for resolution of NP-complete logic puzzles: Sudoku case

Authors: Theodoros Panagiotis Chatzinikolaou, Iosif-Angelos Fyrigos, Rafailia-Eleni Karamani, Vasileios Ntinas, Giorgos Dimitrakopoulos, Sorin Cotofana, Georgios Ch. Sirakoulis

Abstract: Memristor networks are capable of low-power and massive parallel processing and information storage. Moreover, they have presented the ability to apply for a vast number of intelligent data analysis applications targeting mobile edge devices and low power computing. Beyond the memory and conventional computing architectures, memristors are widely studied in circuits aiming for increased intelligen… ▽ More Memristor networks are capable of low-power and massive parallel processing and information storage. Moreover, they have presented the ability to apply for a vast number of intelligent data analysis applications targeting mobile edge devices and low power computing. Beyond the memory and conventional computing architectures, memristors are widely studied in circuits aiming for increased intelligence that are suitable to tackle complex problems in a power and area efficient manner, offering viable solutions oftenly arriving also from the biological principles of living organisms. In this paper, a memristive circuit exploiting the dynamics of oscillating networks is utilized for the resolution of very popular and NP-complete logic puzzles, like the well-known "Sudoku". More specifically, the proposed circuit design methodology allows for appropriate usage of interconnections' advantages in a oscillation network and of memristor's switching dynamics resulting to logic-solvable puzzle-instances. The reduced complexity of the proposed circuit and its increased scalability constitute its main advantage against previous approaches and the broadly presented SPICE based simulations provide a clear proof of concept of the aforementioned appealing characteristics. △ Less

Submitted 15 February, 2020; originally announced February 2020.

Comments: To be presented/published in the IEEE International Symposium on Circuits and Systems (ISCAS) 2020

Showing 1–16 of 16 results for author: Dimitrakopoulos, G