Skip to main content

Showing 1–9 of 9 results for author: Ünsal, O S

.
  1. Adaptable Register File Organization for Vector Processors

    Authors: Cristóbal Ramírez Lazo, Enrico Reggiani, Carlos Rojas Morales, Roger Figueras Bagué, Luis Alfonso Villa Vargas, Marco Antonio Ramírez Salinas, Mateo Valero Cortés, Osman Sabri Unsal, Adrián Cristal

    Abstract: Modern scientific applications are getting more diverse, and the vector lengths in those applications vary widely. Contemporary Vector Processors (VPs) are designed either for short vector lengths, e.g., Fujitsu A64FX with 512-bit ARM SVE vector support, or long vectors, e.g., NEC Aurora Tsubasa with 16Kbits Maximum Vector Length (MVL). Unfortunately, both approaches have drawbacks. On the one han… ▽ More

    Submitted 29 May, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: 28th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2022)

  2. A RISC-V Simulator and Benchmark Suite for Designing and Evaluating Vector Architectures

    Authors: Cristóbal Ramírez Lazo, César Alejandro Hernández, Oscar Palomar, Osman Sabri Unsal, Marco Antonio Ramírez, Adrían Cristal

    Abstract: Vector architectures lack tools for research. Consider the gem5 simulator, which is possibly the leading platform for computer-system architecture research. Unfortunately, gem5 does not have an available distribution that includes a flexible and customizable vector architecture model. In consequence, researchers have to develop their own simulation platform to test their ideas, which consume much… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

    Comments: ACM Transactions on Architecture and Code Optimization, Volume 17, Issue 4, December 2020, Article No.38

  3. arXiv:2110.05855  [pdf, other

    cs.AR

    MoRS: An Approximate Fault Modelling Framework for Reduced-Voltage SRAMs

    Authors: İsmail Emir Yüksel, Behzad Salami, Oğuz Ergin, Osman Sabri Ünsal, Adrian Cristal Kestelman

    Abstract: On-chip memory (usually based on Static RAMs-SRAMs) are crucial components for various computing devices including heterogeneous devices, e.g., GPUs, FPGAs, ASICs to achieve high performance. Modern workloads such as Deep Neural Networks (DNNs) running on these heterogeneous fabrics are highly dependent on the on-chip memory architecture for efficient acceleration. Hence, improving the energy-effi… ▽ More

    Submitted 19 July, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: 13 pages, 10 figures. This work appears at the Transactions on Computer-Aided Design of Integrated Circuits and Systems: SI on Compiler Frameworks and Co-design Methodologies

  4. arXiv:2101.00969  [pdf, other

    cs.AR

    Understanding Power Consumption and Reliability of High-Bandwidth Memory with Voltage Underscaling

    Authors: Seyed Saber Nabavi Larimi, Behzad Salami, Osman S. Unsal, Adrian Cristal Kestelman, Hamid Sarbazi-Azad, Onur Mutlu

    Abstract: Modern computing devices employ High-Bandwidth Memory (HBM) to meet their memory bandwidth requirements. An HBM-enabled device consists of multiple DRAM layers stacked on top of one another next to a compute chip (e.g. CPU, GPU, and FPGA) in the same package. Although such HBM structures provide high bandwidth at a small form factor, the stacked memory layers consume a substantial portion of the p… ▽ More

    Submitted 30 December, 2020; originally announced January 2021.

    Comments: To appear at DATE 2021 conference

  5. Exceeding Conservative Limits: A Consolidated Analysis on Modern Hardware Margins

    Authors: George Papadimitriou, Athanasios Chatzidimitriou, Dimitris Gizopoulos, Vijay Janapa Reddi, **gwen Leng, Behzad Salami, Osman S. Unsal, Adrian Cristal Kestelman

    Abstract: Modern large-scale computing systems (data centers, supercomputers, cloud and edge setups and high-end cyber-physical systems) employ heterogeneous architectures that consist of multicore CPUs, general-purpose many-core GPUs, and programmable FPGAs. The effective utilization of these architectures poses several challenges, among which a primary one is power consumption. Voltage reduction is one of… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted for publication in IEEE Transactions on Device and Materials Reliability

  6. arXiv:2005.03451  [pdf, other

    cs.LG

    An Experimental Study of Reduced-Voltage Operation in Modern FPGAs for Neural Network Acceleration

    Authors: Behzad Salami, Erhan Baturay Onural, Ismail Emir Yuksel, Fahrettin Koc, Oguz Ergin, Adrian Cristal Kestelman, Osman S. Unsal, Hamid Sarbazi-Azad, Onur Mutlu

    Abstract: We empirically evaluate an undervolting technique, i.e., underscaling the circuit supply voltage below the nominal level, to improve the power-efficiency of Convolutional Neural Network (CNN) accelerators mapped to Field Programmable Gate Arrays (FPGAs). Undervolting below a safe voltage level can lead to timing faults due to excessive circuit latency increase. We evaluate the reliability-power tr… ▽ More

    Submitted 30 December, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: To appear at the DSN 2020 conference

  7. arXiv:2001.00053  [pdf, other

    cs.LG cs.NE

    On the Resilience of Deep Learning for Reduced-voltage FPGAs

    Authors: Kamyar Givaki, Behzad Salami, Reza Hojabr, S. M. Reza Tayaranian, Ahmad Khonsari, Dara Rahmati, Saeid Gorgin, Adrian Cristal, Osman S. Unsal

    Abstract: Deep Neural Networks (DNNs) are inherently computation-intensive and also power-hungry. Hardware accelerators such as Field Programmable Gate Arrays (FPGAs) are a promising solution that can satisfy these requirements for both embedded and High-Performance Computing (HPC) systems. In FPGAs, as well as CPUs and GPUs, aggressive voltage scaling below the nominal level is an effective technique for p… ▽ More

    Submitted 26 December, 2019; originally announced January 2020.

  8. arXiv:1905.05567  [pdf, other

    cs.LG cs.AI cs.NE

    TauRieL: Targeting Traveling Salesman Problem with a deep reinforcement learning inspired architecture

    Authors: Gorker Alp Malazgirt, Osman S. Unsal, Adrian Cristal Kestelman

    Abstract: In this paper, we propose TauRieL and target Traveling Salesman Problem (TSP) since it has broad applicability in theoretical and applied sciences. TauRieL utilizes an actor-critic inspired architecture that adopts ordinary feedforward nets to obtain a policy update vector $v$. Then, we use $v$ to improve the state transition matrix from which we generate the policy. Also, the state transition mat… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: 10 pages, 5 figures, 1 Algorithm, 4 Tables

  9. Evaluating Built-in ECC of FPGA on-chip Memories for the Mitigation of Undervolting Faults

    Authors: Behzad Salami, Osman S. Unsal, Adrian Cristal Kestelman

    Abstract: Voltage underscaling below the nominal level is an effective solution for improving energy efficiency in digital circuits, e.g., Field Programmable Gate Arrays (FPGAs). However, further undervolting below a safe voltage level and without accompanying frequency scaling leads to timing related faults, potentially undermining the energy savings. Through experimental voltage underscaling studies on co… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.

    Comments: 6 pages, 2 figures