Skip to main content

Showing 1–11 of 11 results for author: Haensch, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18261  [pdf

    cs.ET cond-mat.mes-hall cond-mat.mtrl-sci physics.app-ph

    Error-Free and Current-Driven Synthetic Antiferromagnetic Domain Wall Memory Enabled by Channel Meandering

    Authors: Pengxiang Zhang, Wilfried Haensch, Charudatta M. Phatak, Supratik Guha

    Abstract: We propose a new type of multi-bit and energy-efficient magnetic memory based on current-driven, field-free, and highly controlled domain wall motion. A meandering domain wall channel with precisely interspersed pinning regions provides the multi-bit capability of a magnetic tunnel junction. The magnetic free layer of the memory device has perpendicular magnetic anisotropy and interfacial Dzyalosh… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 24 pages

  2. arXiv:2401.13754  [pdf, other

    math.NA cs.ET

    Multi-Function Multi-Way Analog Technology for Sustainable Machine Intelligence Computation

    Authors: Vassilis Kalantzis, Mark S. Squillante, Shashanka Ubaru, Tayfun Gokmen, Chai Wah Wu, Anshul Gupta, Haim Avron, Tomasz Nowicki, Malte Rasch, Murat Onen, Vanessa Lopez Marrero, Effendi Leobandung, Yasuteru Kohda, Wilfried Haensch, Lior Horesh

    Abstract: Numerical computation is essential to many areas of artificial intelligence (AI), whose computing demands continue to grow dramatically, yet their continued scaling is jeopardized by the slowdown in Moore's law. Multi-function multi-way analog (MFMWA) technology, a computing architecture comprising arrays of memristors supporting in-memory computation of matrix operations, can offer tremendous imp… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    MSC Class: 65F10; C3; G1 ACM Class: G.1.3

  3. arXiv:2312.03146  [pdf, other

    cs.AR

    LRMP: Layer Replication with Mixed Precision for Spatial In-memory DNN Accelerators

    Authors: Abinand Nallathambi, Christin David Bose, Wilfried Haensch, Anand Raghunathan

    Abstract: In-memory computing (IMC) with non-volatile memories (NVMs) has emerged as a promising approach to address the rapidly growing computational demands of Deep Neural Networks (DNNs). Map** DNN layers spatially onto NVM-based IMC accelerators achieves high degrees of parallelism. However, two challenges that arise in this approach are the highly non-uniform distribution of layer processing times an… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  4. arXiv:2206.08735  [pdf

    cs.ET cs.NE

    A Co-design view of Compute in-Memory with Non-Volatile Elements for Neural Networks

    Authors: Wilfried Haensch, Anand Raghunathan, Kaushik Roy, Bhaswar Chakrabarti, Charudatta M. Phatak, Cheng Wang, Supratik Guha

    Abstract: Deep Learning neural networks are pervasive, but traditional computer architectures are reaching the limits of being able to efficiently execute them for the large workloads of today. They are limited by the von Neumann bottleneck: the high cost in energy and latency incurred in moving data between memory and the compute engine. Today, special CMOS designs address this bottleneck. The next generat… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: 56 pages, 15 figures

  5. arXiv:2201.13377  [pdf

    cs.LG cs.ET eess.SY

    Neural Network Training with Asymmetric Crosspoint Elements

    Authors: Murat Onen, Tayfun Gokmen, Teodor K. Todorov, Tomasz Nowicki, Jesus A. del Alamo, John Rozen, Wilfried Haensch, Seyoung Kim

    Abstract: Analog crossbar arrays comprising programmable nonvolatile resistors are under intense investigation for acceleration of deep neural network training. However, the ubiquitous asymmetric conductance modulation of practical resistive devices critically degrades the classification performance of networks trained with conventional algorithms. Here, we describe and experimentally demonstrate an alterna… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

  6. arXiv:1909.07908  [pdf

    cs.LG cs.ET cs.NE stat.ML

    Algorithm for Training Neural Networks on Resistive Device Arrays

    Authors: Tayfun Gokmen, Wilfried Haensch

    Abstract: Hardware architectures composed of resistive cross-point device arrays can provide significant power and speed benefits for deep neural network training workloads using stochastic gradient descent (SGD) and backpropagation (BP) algorithm. The training accuracy on this imminent analog hardware however strongly depends on the switching characteristics of the cross-point elements. One of the key requ… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: 26 pages, 7 fiures

  7. arXiv:1906.02698  [pdf, ps, other

    cs.NE cs.ET cs.LG

    Training large-scale ANNs on simulated resistive crossbar arrays

    Authors: Malte J. Rasch, Tayfun Gokmen, Wilfried Haensch

    Abstract: Accelerating training of artificial neural networks (ANN) with analog resistive crossbar arrays is a promising idea. While the concept has been verified on very small ANNs and toy data sets (such as MNIST), more realistically sized ANNs and datasets have not yet been tackled. However, it is to be expected that device materials and hardware design constraints, such as noisy computations, finite num… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  8. arXiv:1807.01356  [pdf, ps, other

    cs.ET cs.LG stat.ML

    Efficient ConvNets for Analog Arrays

    Authors: Malte J. Rasch, Tayfun Gokmen, Mattia Rigotti, Wilfried Haensch

    Abstract: Analog arrays are a promising upcoming hardware technology with the potential to drastically speed up deep learning. Their main advantage is that they compute matrix-vector products in constant time, irrespective of the size of the matrix. However, early convolution layers in ConvNets map very unfavorably onto analog arrays, because kernel matrices are typically small and the constant time operati… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

  9. arXiv:1806.00166  [pdf

    cs.LG cs.ET stat.ML

    Training LSTM Networks with Resistive Cross-Point Devices

    Authors: Tayfun Gokmen, Malte Rasch, Wilfried Haensch

    Abstract: In our previous work we have shown that resistive cross point devices, so called Resistive Processing Unit (RPU) devices, can provide significant power and speed benefits when training deep fully connected networks as well as convolutional neural networks. In this work, we further extend the RPU concept for training recurrent neural networks (RNNs) namely LSTMs. We show that the map** of recurre… ▽ More

    Submitted 31 May, 2018; originally announced June 2018.

    Comments: 17 pages, 5 figures

  10. Analog CMOS-based Resistive Processing Unit for Deep Neural Network Training

    Authors: Seyoung Kim, Tayfun Gokmen, Hyung-Min Lee, Wilfried E. Haensch

    Abstract: Recently we have shown that an architecture based on resistive processing unit (RPU) devices has potential to achieve significant acceleration in deep neural network (DNN) training compared to today's software-based DNN implementations running on CPU/GPU. However, currently available device candidates based on non-volatile memory technologies do not satisfy all the requirements to realize the RPU… ▽ More

    Submitted 20 June, 2017; originally announced June 2017.

  11. arXiv:1705.08014  [pdf

    cs.LG cs.NE stat.ML

    Training Deep Convolutional Neural Networks with Resistive Cross-Point Devices

    Authors: Tayfun Gokmen, O. Murat Onen, Wilfried Haensch

    Abstract: In a previous work we have detailed the requirements to obtain a maximal performance benefit by implementing fully connected deep neural networks (DNN) in form of arrays of resistive devices for deep learning. This concept of Resistive Processing Unit (RPU) devices we extend here towards convolutional neural networks (CNNs). We show how to map the convolutional layers to RPU arrays such that the p… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

    Comments: 22 pages, 6 figures, 2 tables