Skip to main content

Showing 1–3 of 3 results for author: Qararyah, F

.
  1. arXiv:2404.19331  [pdf, other

    cs.PF cs.AR cs.DC

    Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs

    Authors: Fareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso

    Abstract: Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including convolutional neural networks (CNNs) and vision transformers (ViTs). However, they have a lower compute-to-memory-access ratio than standard convolutions, making their memory accesses often the perform… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  2. arXiv:2305.05388  [pdf, other

    cs.AR cs.AI cs.CR

    VEDLIoT -- Next generation accelerated AIoT systems and applications

    Authors: Kevin Mika, René Griessl, Nils Kucza, Florian Porrmann, Martin Kaiser, Lennart Tigges, Jens Hagemeyer, Pedro Trancoso, Muhammad Waqar Azhar, Fareed Qararyah, Stavroula Zouzoula, Jämes Ménétrey, Marcelo Pasin, Pascal Felber, Carina Marcus, Oliver Brunnegard, Olof Eriksson, Hans Salomonsson, Daniel Ödman, Andreas Ask, Antonio Casimiro, Alysson Bessani, Tiago Carvalho, Karol Gugala, Piotr Zierhoffer , et al. (7 additional authors not shown)

    Abstract: The VEDLIoT project aims to develop energy-efficient Deep Learning methodologies for distributed Artificial Intelligence of Things (AIoT) applications. During our project, we propose a holistic approach that focuses on optimizing algorithms while addressing safety and security challenges inherent to AIoT systems. The foundation of this approach lies in a modular and scalable cognitive IoT hardware… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: This publication incorporates results from the VEDLIoT project, which received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 957197. CF'23: 20th ACM International Conference on Computing Frontiers, May 2023, Bologna, Italy

    Journal ref: CF'23: 20th ACM International Conference on Computing Frontiers, May 2023, Bologna, Italy

  3. A Computational-Graph Partitioning Method for Training Memory-Constrained DNNs

    Authors: Fareed Qararyah, Mohamed Wahib, Doğa Dikbayır, Mehmet Esat Belviranli, Didem Unat

    Abstract: Many state-of-the-art Deep Neural Networks (DNNs) have substantial memory requirements. Limited device memory becomes a bottleneck when training those models. We propose ParDNN, an automatic, generic, and non-intrusive partitioning strategy for DNNs that are represented as computational graphs. ParDNN decides a placement of DNN's underlying computational graph operations across multiple devices so… ▽ More

    Submitted 5 May, 2021; v1 submitted 19 August, 2020; originally announced August 2020.