Skip to main content

Showing 1–18 of 18 results for author: Ginosar, R

.
  1. arXiv:2206.01978  [pdf

    cs.HC

    Inbetween: Visual Selection in Parametric Design

    Authors: Rony Ginosar, Amit Zoran

    Abstract: The act of selection plays a leading role in the design process and in the definition of personal style. This work introduces visual selection catalogs into parametric design environments. A two-fold contribution is presented: (i) guidelines for construction of a minimal-bias visual selection catalog from a parametric space, and (ii) Inbetween, a catalog for a parametric typeface that adheres to t… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

    Comments: tool can be found at https://ronyginosar.github.io/parametricSpecimen

  2. arXiv:2010.02825  [pdf, other

    cs.AR cs.ET

    WoLFRaM: Enhancing Wear-Leveling and Fault Tolerance in Resistive Memories using Programmable Address Decoders

    Authors: Leonid Yavits, Lois Orosa, Suyash Mahar, João Dinis Ferreira, Mattan Erez, Ran Ginosar, Onur Mutlu

    Abstract: Resistive memories have limited lifetime caused by limited write endurance and highly non-uniform write access patterns. Two main techniques to mitigate endurance-related memory failures are 1) wear-leveling, to evenly distribute the writes across the entire memory, and 2) fault tolerance, to correct memory cell failures. However, one of the main open challenges in extending the lifetime of existi… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: To appear in ICCD 2020

  3. arXiv:1901.05959  [pdf

    cs.ET cs.AR

    BioSEAL: In-Memory Biological Sequence Alignment Accelerator for Large-Scale Genomic Data

    Authors: Roman Kaplan, Leonid Yavits, Ran Ginosar

    Abstract: Genome sequences contain hundreds of millions of DNA base pairs. Finding the degree of similarity between two genomes requires executing a compute-intensive dynamic programming algorithm, such as Smith-Waterman. Traditional von Neumann architectures have limited parallelism and cannot provide an efficient solution for large-scale genomic data. Approximate heuristic methods (e.g. BLAST) are commonl… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

  4. arXiv:1901.04976  [pdf

    cs.DC

    AIDA: Associative DNN Inference Accelerator

    Authors: Leonid Yavits, Roman Kaplan, Ran Ginosar

    Abstract: We propose AIDA, an inference engine for accelerating fully-connected (FC) layers of Deep Neural Network (DNN). AIDA is an associative in-memory processor, where the bulk of data never leaves the confines of the memory arrays, and processing is performed in-situ. AIDA area and energy efficiency strongly benefit from sparsity and lower arithmetic precision. We show that AIDA outperforms the state o… ▽ More

    Submitted 20 December, 2018; originally announced January 2019.

  5. arXiv:1809.01127  [pdf

    q-bio.GN cs.ET

    RASSA: Resistive Pre-Alignment Accelerator for Approximate DNA Long Read Map**

    Authors: Roman Kaplan, Leonid Yavits, Ran Ginosar

    Abstract: DNA read map** is a computationally expensive bioinformatics task, required for genome assembly and consensus polishing. It requires to find the best-fitting location for each DNA read on a long reference sequence. A novel resistive approximate similarity search accelerator, RASSA, exploits charge distribution and parallel in-memory processing to reflect a mismatch count between DNA sequences. R… ▽ More

    Submitted 28 January, 2019; v1 submitted 2 September, 2018; originally announced September 2018.

  6. arXiv:1805.09612  [pdf

    cs.AR

    PRINS: Resistive CAM Processing in Storage

    Authors: Leonid Yavits, Roman Kaplan, Ran Ginosar

    Abstract: Near-data in-storage processing research has been gaining momentum in recent years. Typical processing-in-storage architecture places a single or several processing cores inside the storage and allows data processing without transferring it to the host CPU. Since this approach replicates von Neumann architecture inside storage, it is exposed to the problems faced by von Neumann architecture, espec… ▽ More

    Submitted 17 March, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

  7. arXiv:1705.09937  [pdf

    cs.AR

    Sparse Matrix Multiplication on CAM Based Accelerator

    Authors: Leonid Yavits, Ran Ginosar

    Abstract: Sparse matrix multiplication is an important component of linear algebra computations. In this paper, an architecture based on Content Addressable Memory (CAM) and Resistive Content Addressable Memory (ReCAM) is proposed for accelerating sparse matrix by sparse vector and matrix multiplication in CSR format. Using functional simulation, we show that the proposed ReCAM-based accelerator exhibits tw… ▽ More

    Submitted 28 May, 2017; originally announced May 2017.

  8. arXiv:1705.07282  [pdf

    cs.MS

    Sparse Matrix Multiplication On An Associative Processor

    Authors: L. Yavits, A. Morad, R. Ginosar

    Abstract: Sparse matrix multiplication is an important component of linear algebra computations. Implementing sparse matrix multiplication on an associative processor (AP) enables high level of parallelism, where a row of one matrix is multiplied in parallel with the entire second matrix, and where the execution time of vector dot product does not depend on the vector size. Four sparse matrix multiplication… ▽ More

    Submitted 20 May, 2017; originally announced May 2017.

  9. arXiv:1705.07281  [pdf

    cs.AR

    Cache Hierarchy Optimization

    Authors: Leonid Yavits, Amir Morad, Ran Ginosar

    Abstract: Power consumption, off-chip memory bandwidth, chip area and Network on Chip (NoC) capacity are among main chip resources limiting the scalability of Chip Multiprocessors (CMP). A closed form analytical solution for optimizing the CMP cache hierarchy and optimally allocating area among hierarchy levels under such constrained resources is developed. The optimization framework is extended by incorpor… ▽ More

    Submitted 20 May, 2017; originally announced May 2017.

  10. arXiv:1705.07280  [pdf

    cs.AR

    The Effect of Temperature on Amdahl Law in 3D Multicore Era

    Authors: Leonid Yavits, Amir Morad, Ran Ginosar

    Abstract: This work studies the influence of temperature on performance and scalability of 3D Chip Multiprocessors (CMP) from Amdahl law perspective. We find that 3D CMP may reach its thermal limit before reaching its maximum power. We show that a high level of parallelism may lead to high peak temperatures even in small scale 3D CMPs, thus limiting 3D CMP scalability and calling for different, in-memory co… ▽ More

    Submitted 20 May, 2017; originally announced May 2017.

  11. arXiv:1705.06923  [pdf

    cs.AR

    MultiAmdahl: Optimal Resource Allocation in Heterogeneous Architectures

    Authors: Leonid Yavits, Amir Morad, Uri Weiser, Ran Ginosar

    Abstract: Future multiprocessor chips will integrate many different units, each tailored to a specific computation. When designing such a system, the chip architect must decide how to distribute limited system resources such as area, power, and energy among the computational units. We extend MultiAmdahl, an analytical optimization technique for resource allocation in heterogeneous architectures, for energy… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

  12. A Resistive CAM Processing-in-Storage Architecture for DNA Sequence Alignment

    Authors: Roman Kaplan, Leonid Yavits, Ran Ginosar, Uri Weiser

    Abstract: A novel processing-in-storage (PRinS) architecture based on Resistive CAM (ReCAM) is described and proposed for Smith-Waterman (S-W) sequence alignment. The ReCAM massively-parallel compare operation finds matching base-pairs in a fixed number of cycles, regardless of sequence length. The ReCAM PRinS S-W algorithm is simulated and compared to FPGA, Xeon Phi and GPU-based implementations, showing a… ▽ More

    Submitted 11 June, 2017; v1 submitted 17 January, 2017; originally announced January 2017.

    Journal ref: IEEE Micro, vol. 37, no. 4, pp. 20-28, 2017

  13. arXiv:1602.01329  [pdf

    cs.AR

    Effect of Data Sharing on Private Cache Design in Chip Multiprocessors

    Authors: Leonid Yavits, Amir Morad, Ran Ginosar

    Abstract: In multithreaded applications with high degree of data sharing, the miss rate of private cache is shown to exhibit a compulsory miss component. It manifests because at least some of the shared data originates from other cores and can only be accessed in a shared cache. The compulsory component does not change with the private cache size, causing its miss rate to diminish slower as the cache size g… ▽ More

    Submitted 3 February, 2016; originally announced February 2016.

    Comments: 4 pages, 4 figures

  14. arXiv:1601.07815  [pdf

    cs.DC cs.PF

    Convex Optimization of Real Time SoC

    Authors: L. Yavits, A. Morad, R. Ginosar, U. Weiser

    Abstract: Convex optimization methods are employed to optimize a real-time (RT) system-on-chip (SoC) under a variety of physical resource-driven constraints, demonstrated on an industry MPEG2 encoder SoC. The power optimization is compared to conventional performance-optimization framework, showing a factor of two and a half saving in power. Convex optimization is shown to be very efficient in a high-level… ▽ More

    Submitted 19 May, 2017; v1 submitted 28 January, 2016; originally announced January 2016.

    Comments: 6 pages, 3 figures

  15. arXiv:1311.1667  [pdf

    cs.AR

    3D Cache Hierarchy Optimization

    Authors: Leonid Yavits, Amir Morad, Ran Ginosar

    Abstract: 3D integration has the potential to improve the scalability and performance of Chip Multiprocessors (CMP). A closed form analytical solution for optimizing 3D CMP cache hierarchy is developed. It allows optimal partitioning of the cache hierarchy levels into 3D silicon layers and optimal allocation of area among cache hierarchy levels under constrained area and power budgets. The optimization fram… ▽ More

    Submitted 7 November, 2013; originally announced November 2013.

  16. arXiv:1307.3853  [pdf

    cs.AR

    Thermal analysis of 3D associative processor

    Authors: Leonid Yavits, Amir Morad, Ran Ginosar

    Abstract: Thermal density and hot spots limit three-dimensional (3D) implementation of massively-parallel SIMD processors and prohibit stacking DRAM dies above them. This study proposes replacing SIMD by an Associative Processor (AP). AP exhibits close to uniform thermal distribution with reduced hot spots. Additionally, AP may outperform SIMD processor when the data set size is sufficiently large, while di… ▽ More

    Submitted 15 July, 2013; originally announced July 2013.

    Comments: arXiv admin note: text overlap with arXiv:1306.3109

  17. arXiv:1306.3302  [pdf

    cs.AR cs.PF

    The Effect of Communication and Synchronization on Amdahl Law in Multicore Systems

    Authors: Leonid Yavits, Amir Morad, Ran Ginosar

    Abstract: This work analyses the effects of sequential-to-parallel synchronization and inter-core communication on multicore performance, speedup and scaling. A modification of Amdahl law is formulated, to reflect the finding that parallel speedup is lower than originally predicted, due to these effects. In applications with high inter-core communication requirements, the workload should be executed on a sm… ▽ More

    Submitted 14 June, 2013; originally announced June 2013.

  18. arXiv:1306.3109   

    cs.AR

    Computer Architecture with Associative Processor Replacing Last Level Cache and SIMD Accelerator

    Authors: Leonid Yavits, Amir Morad, Ran Ginosar

    Abstract: This study presents a novel computer architecture where a last level cache and a SIMD accelerator are replaced by an Associative Processor. Associative Processor combines data storage and data processing and provides parallel computational capabilities and data memory at the same time. An analytic performance model of the new computer architecture is introduced. Comparative analysis supported by s… ▽ More

    Submitted 8 November, 2013; v1 submitted 13 June, 2013; originally announced June 2013.

    Comments: This paper has been withdrawn by the author due to a crucial error in equation 10