Skip to main content

Showing 1–6 of 6 results for author: Tavakoli, E B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.02535   

    cs.CV

    TokenMotion: Motion-Guided Vision Transformer for Video Camouflaged Object Detection Via Learnable Token Selection

    Authors: Zifan Yu, Erfan Bank Tavakoli, Meida Chen, Suya You, Raghuveer Rao, Sanjeev Agarwal, Fengbo Ren

    Abstract: The area of Video Camouflaged Object Detection (VCOD) presents unique challenges in the field of computer vision due to texture similarities between target objects and their surroundings, as well as irregular motion patterns caused by both objects and camera movement. In this paper, we introduce TokenMotion (TMNet), which employs a transformer-based model to enhance VCOD by extracting motion-guide… ▽ More

    Submitted 1 February, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: Revising Needed

  2. arXiv:2112.10037  [pdf, other

    cs.PF cs.AR

    FSpGEMM: An OpenCL-based HPC Framework for Accelerating General Sparse Matrix-Matrix Multiplication on FPGAs

    Authors: Erfan Bank Tavakoli, Michael Riera, Masudul Hassan Quraishi, Fengbo Ren

    Abstract: General sparse matrix-matrix multiplication (SpGEMM) is an integral part of many scientific computing, high-performance computing (HPC), and graph analytic applications. This paper presents a new compressed sparse vector (CSV) format for representing sparse matrices and FSpGEMM, an OpenCL-based HPC framework for accelerating general sparse matrix-matrix multiplication on FPGAs. The proposed FSpGEM… ▽ More

    Submitted 18 December, 2021; originally announced December 2021.

    Comments: 12 pages

  3. arXiv:2106.13645  [pdf, other

    cs.DC

    FLASH 1.0: A Software Framework for Rapid Parallel Deployment and Enhancing Host Code Portability in Heterogeneous Computing

    Authors: Michael Riera, Masudul Hassan Quraishi, Erfan Bank Tavakoli, Fengbo Ren

    Abstract: This paper presents FLASH 1.0, a C++-based software framework for rapid parallel deployment and enhancing host code portability in heterogeneous computing. FLASH takes a novel approach in describing kernels and dynamically dispatching them in a hardware-agnostic manner. FLASH features truly hardware-agnostic frontend interfaces, which unify the compile-time control flow and enforce a portability-o… ▽ More

    Submitted 5 July, 2023; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: 10 pages

  4. arXiv:2101.02667  [pdf

    cs.AR cs.LG

    BRDS: An FPGA-based LSTM Accelerator with Row-Balanced Dual-Ratio Sparsification

    Authors: Seyed Abolfazl Ghasemzadeh, Erfan Bank Tavakoli, Mehdi Kamal, Ali Afzali-Kusha, Massoud Pedram

    Abstract: In this paper, first, a hardware-friendly pruning algorithm for reducing energy consumption and improving the speed of Long Short-Term Memory (LSTM) neural network accelerators is presented. Next, an FPGA-based platform for efficient execution of the pruned networks based on the proposed algorithm is introduced. By considering the sensitivity of two weight matrices of the LSTM models in pruning, d… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 8 pages, 9 figures, 2 tables

  5. arXiv:2011.10896  [pdf, other

    cs.DC cs.CL cs.PF

    HALO 1.0: A Hardware-agnostic Accelerator Orchestration Framework for Enabling Hardware-agnostic Programming with True Performance Portability for Heterogeneous HPC

    Authors: Michael Riera, Erfan Bank Tavakoli, Masudul Hassan Quraishi, Fengbo Ren

    Abstract: This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implements a set of proposed hardware-agnostic accelerator orchestration (HALO) principles. HALO implements a novel compute-centric message passing interface (C^2MPI) specification for enabling the performance portable execution of a hardware-agnostic host application across heterogeneous accelerators. The e… ▽ More

    Submitted 6 July, 2022; v1 submitted 21 November, 2020; originally announced November 2020.

    Comments: 37 pages

  6. arXiv:2011.09073  [pdf, other

    cs.AR cs.DC

    A Survey of System Architectures and Techniques for FPGA Virtualization

    Authors: Masudul Hassan Quraishi, Erfan Bank Tavakoli, Fengbo Ren

    Abstract: FPGA accelerators are gaining increasing attention in both cloud and edge computing because of their hardware flexibility, high computational throughput, and low power consumption. However, the design flow of FPGAs often requires specific knowledge of the underlying hardware, which hinders the wide adoption of FPGAs by application developers. Therefore, the virtualization of FPGAs becomes extremel… ▽ More

    Submitted 18 February, 2021; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: 15 pages