Skip to main content

Showing 1–12 of 12 results for author: Shokouhi, S B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01328  [pdf

    cs.CV

    CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes

    Authors: Danial Qashqai, Emad Mousavian, Shahriar Baradaran Shokouhi, Sattar Mirzakuchaki

    Abstract: Semantic segmentation, as a crucial component of complex visual interpretation, plays a fundamental role in autonomous vehicle vision systems. Recent studies have significantly improved the accuracy of semantic segmentation by exploiting complementary information and develo** multimodal methods. Despite the gains in accuracy, multimodal semantic segmentation methods suffer from high computationa… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2311.06651  [pdf, other

    cs.CV

    Traffic Sign Recognition Using Local Vision Transformer

    Authors: Ali Farzipour, Omid Nejati Manzari, Shahriar B. Shokouhi

    Abstract: Recognition of traffic signs is a crucial aspect of self-driving cars and driver assistance systems, and machine vision tasks such as traffic sign recognition have gained significant attention. CNNs have been frequently used in machine vision, but introducing vision transformers has provided an alternative approach to global feature learning. This paper proposes a new novel model that blends the a… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  3. MedViT: A Robust Vision Transformer for Generalized Medical Image Classification

    Authors: Omid Nejati Manzari, Hamid Ahmadabadi, Hossein Kashiani, Shahriar B. Shokouhi, Ahmad Ayatollahi

    Abstract: Convolutional Neural Networks (CNNs) have advanced existing medical systems for automatic disease diagnosis. However, there are still concerns about the reliability of deep medical diagnosis systems against the potential threats of adversarial attacks since inaccurate diagnosis could lead to disastrous consequences in the safety realm. In this study, we propose a highly robust yet efficient CNN-Tr… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Journal ref: Computers in Biology and Medicine 2023

  4. arXiv:2301.11553  [pdf, other

    cs.CV

    Robust Transformer with Locality Inductive Bias and Feature Normalization

    Authors: Omid Nejati Manzari, Hossein Kashiani, Hojat Asgarian Dehkordi, Shahriar Baradaran Shokouhi

    Abstract: Vision transformers have been demonstrated to yield state-of-the-art results on a variety of computer vision tasks using attention-based networks. However, research works in transformers mostly do not investigate robustness/accuracy trade-off, and they still struggle to handle adversarial perturbations. In this paper, we explore the robustness of vision transformers against adversarial perturbatio… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: 9 pages, 3 Figures, 6 Tables

    Journal ref: Engineering Science and Technology, an International Journal, 2023

  5. arXiv:2207.06067  [pdf, other

    cs.CV

    Pyramid Transformer for Traffic Sign Detection

    Authors: Omid Nejati Manzari, Amin Boudesh, Shahriar B. Shokouhi

    Abstract: Traffic sign detection is a vital task in the visual system of self-driving cars and the automated driving system. Recently, novel Transformer-based models have achieved encouraging results for various computer vision tasks. We still observed that vanilla ViT could not yield satisfactory results in traffic sign detection because the overall size of the datasets is very small and the class distribu… ▽ More

    Submitted 22 July, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

  6. arXiv:2112.07015  [pdf, other

    cs.CV cs.HC

    Multi-Expert Human Action Recognition with Hierarchical Super-Class Learning

    Authors: Hojat Asgarian Dehkordi, Ali Soltani Nezhad, Hossein Kashiani, Shahriar Baradaran Shokouhi, Ahmad Ayatollahi

    Abstract: In still image human action recognition, existing studies have mainly leveraged extra bounding box information along with class labels to mitigate the lack of temporal information in still images; however, preparing extra data with manual annotation is time-consuming and also prone to human errors. Moreover, the existing studies have not addressed action recognition with long-tailed distribution.… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 47 pages

  7. arXiv:2102.03932  [pdf

    eess.IV cs.CV cs.LG

    Automatic Breast Lesion Detection in Ultrafast DCE-MRI Using Deep Learning

    Authors: Fazael Ayatollahi, Shahriar B. Shokouhi, Ritse M. Mann, Jonas Teuwen

    Abstract: Purpose: We propose a deep learning-based computer-aided detection (CADe) method to detect breast lesions in ultrafast DCE-MRI sequences. This method uses both the three-dimensional spatial information and temporal information obtained from the early-phase of the dynamic acquisition. Methods: The proposed CADe method, based on a modified 3D RetinaNet model, operates on ultrafast T1 weighted sequen… ▽ More

    Submitted 15 August, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Journal ref: Medical physics vol. 48,10 (2021): 5897-5907

  8. arXiv:2008.09891  [pdf, other

    cs.CV

    Online Visual Tracking with One-Shot Context-Aware Domain Adaptation

    Authors: Hossein Kashiani, Amir Abbas Hamidi Imani, Shahriar Baradaran Shokouhi, Ahmad Ayatollahi

    Abstract: Online learning policy makes visual trackers more robust against different distortions through learning domain-specific cues. However, the trackers adopting this policy fail to fully leverage the discriminative context of the background areas. Moreover, owing to the lack of sufficient data at each time step, the online learning approach can also make the trackers prone to over-fitting to the backg… ▽ More

    Submitted 17 April, 2021; v1 submitted 22 August, 2020; originally announced August 2020.

    Comments: 36 pages, 1 algorithm, 8 figures, 1 table

  9. Ensembles of Deep Neural Networks for Action Recognition in Still Images

    Authors: Sina Mohammadi, Sina Ghofrani Majelan, Shahriar B. Shokouhi

    Abstract: Despite the fact that notable improvements have been made recently in the field of feature extraction and classification, human action recognition is still challenging, especially in images, in which, unlike videos, there is no motion. Thus, the methods proposed for recognizing human actions in videos cannot be applied to still images. A big challenge in action recognition in still images is the l… ▽ More

    Submitted 22 March, 2020; originally announced March 2020.

    Comments: 5 pages, 2 figures, 3 tables, Accepted by ICCKE 2019

    Journal ref: 2019 9th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, Iran, 2019, pp. 315-318

  10. arXiv:1810.00119  [pdf, other

    cs.CV

    Visual Object Tracking based on Adaptive Siamese and Motion Estimation Network

    Authors: Hossein Kashiani, Shahriar B. Shokouhi

    Abstract: Recently, convolutional neural network (CNN) has attracted much attention in different areas of computer vision, due to its powerful abstract feature representation. Visual object tracking is one of the interesting and important areas in computer vision that achieves remarkable improvements in recent years. In this work, we aim to improve both the motion and observation models in visual object tra… ▽ More

    Submitted 28 September, 2018; originally announced October 2018.

    Comments: 28 pages, 1 algorithm, 7 figures, 2 table, Submitted to Elsevier, Image and Vision Computing

  11. Patchwise object tracking via structural local sparse appearance model

    Authors: Hossein Kashiyani, Shahriar B. Shokouhi

    Abstract: In this paper, we propose a robust visual tracking method which exploits the relationships of targets in adjacent frames using patchwise joint sparse representation. Two sets of overlap** patches with different sizes are extracted from target candidates to construct two dictionaries with consideration of joint sparse representation. By applying this representation into structural sparse appearan… ▽ More

    Submitted 16 March, 2018; originally announced March 2018.

    Comments: 6 pages, 3 figures, Accepted by ICCKE 2017

  12. arXiv:1209.1949  [pdf

    cs.CR cs.MM cs.SE

    Improved Robust DWT-Watermarking in YCbCr Color Space

    Authors: Atefeh Elahian, Mehdi Khalili, Shahriar Baradaran Shokouhi

    Abstract: Digital watermarking is an effective way to protect copyright. In this paper, a robust watermarking algorithm based on wavelet transformation is proposed which can confirm the copyright without original image. The wavelet transformation technique is effective in image analyzing and processing. Thus the color-image watermark algorithm based on discrete wavelet transformation (DWT) begins to draw an… ▽ More

    Submitted 10 September, 2012; originally announced September 2012.

    Comments: 5 Pages, 4 Figures, 3 Tables

    MSC Class: 68U10; 68U20; 65C20; 94A08; 94A24; 94A60; 11T71; 14G50; 68P25; 81P94 ACM Class: D.4.6; K.6.5; K.4.2

    Journal ref: Global journal of Computer Application and Technology (GJCAT), Vol.1, No.3, 2011, Pages 300-304