Skip to main content

Showing 1–22 of 22 results for author: Khan, F S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08486  [pdf, other

    eess.IV cs.CV

    On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models

    Authors: Hashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman Khan, Fahad Shahbaz Khan

    Abstract: Volumetric medical segmentation models have achieved significant success on organ and tumor-based segmentation tasks in recent years. However, their vulnerability to adversarial attacks remains largely unexplored, raising serious concerns regarding the real-world deployment of tools employing such models in the healthcare sector. This underscores the importance of investigating the robustness of e… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.00449  [pdf, other

    eess.IV cs.CV

    Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging

    Authors: Jiahua Dong, Hui Yin, Hongliu Li, Wenbo Li, Yulun Zhang, Salman Khan, Fahad Shahbaz Khan

    Abstract: Deep unfolding methods have made impressive progress in restoring 3D hyperspectral images (HSIs) from 2D measurements through convolution neural networks or Transformers in spectral compressive imaging. However, they cannot efficiently capture long-range dependencies using global receptive fields, which significantly limits their performance in HSI reconstruction. Moreover, these methods may suffe… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures

  3. arXiv:2307.07269  [pdf, other

    eess.IV cs.CV cs.LG

    Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation

    Authors: Asif Hanif, Muzammal Naseer, Salman Khan, Mubarak Shah, Fahad Shahbaz Khan

    Abstract: It is imperative to ensure the robustness of deep learning models in critical applications such as, healthcare. While recent advances in deep learning have improved the performance of volumetric medical image segmentation models, these models cannot be deployed for real-world applications immediately due to their vulnerability to adversarial attacks. We present a 3D frequency domain adversarial at… ▽ More

    Submitted 20 July, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted in MICCAI 2023 conference

  4. arXiv:2306.14255  [pdf, other

    eess.IV cs.CV

    AttResDU-Net: Medical Image Segmentation Using Attention-based Residual Double U-Net

    Authors: Akib Mohammed Khan, Alif Ashrafee, Fahim Shahriar Khan, Md. Bakhtiar Hasan, Md. Hasanul Kabir

    Abstract: Manually inspecting polyps from a colonoscopy for colorectal cancer or performing a biopsy on skin lesions for skin cancer are time-consuming, laborious, and complex procedures. Automatic medical image segmentation aims to expedite this diagnosis process. However, numerous challenges exist due to significant variations in the appearance and sizes of objects with no distinct boundaries. This paper… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted in 2023 International Joint Conference on Neural Networks (IJCNN 2023)

  5. arXiv:2306.09320  [pdf, other

    eess.IV cs.CV

    Learnable Weight Initialization for Volumetric Medical Image Segmentation

    Authors: Shahina Kunhimon, Abdelrahman Shaker, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan

    Abstract: Hybrid volumetric medical image segmentation models, combining the advantages of local convolution and global attention, have recently received considerable attention. While mainly focusing on architectural modifications, most existing hybrid approaches still use conventional data-independent weight initialization schemes which restrict their performance due to ignoring the inherent volumetric nat… ▽ More

    Submitted 3 April, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted at Elsevier AI in Medicine Journal

  6. arXiv:2305.16789  [pdf, other

    cs.LG cs.CV eess.SP

    Modulate Your Spectrum in Self-Supervised Learning

    Authors: Xi Weng, Yunhao Ni, Tengwei Song, Jie Luo, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan, Lei Huang

    Abstract: Whitening loss offers a theoretical guarantee against feature collapse in self-supervised learning (SSL) with joint embedding architectures. Typically, it involves a hard whitening approach, transforming the embedding and applying loss to the whitened output. In this work, we introduce Spectral Transformation (ST), a framework to modulate the spectrum of embedding and to seek for functions beyond… ▽ More

    Submitted 21 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at ICLR 2024. The code is available at https://github.com/winci-ai/intl

  7. arXiv:2304.03307  [pdf, other

    cs.CV eess.IV

    Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting

    Authors: Syed Talal Wasim, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: Adopting contrastive image-text pretrained models like CLIP towards video classification has gained attention due to its cost-effectiveness and competitive performance. However, recent works in this area face a trade-off. Finetuning the pretrained model to achieve strong supervised performance results in low zero-shot generalization. Similarly, freezing the backbone to retain zero-shot capability… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR-2023. Codes/models available at https://github.com/TalalWasim/Vita-CLIP

  8. arXiv:2304.01992  [pdf, other

    eess.IV cs.CV

    Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification

    Authors: Amandeep Kumar, Ankan kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Jorma Laaksonen, Fahad Shahbaz Khan

    Abstract: In this work, we propose a few-shot colorectal tissue image generation method for addressing the scarcity of histopathological training data for rare cancer tissues. Our few-shot generation method, named XM-GAN, takes one base and a pair of reference tissue images as input and generates high-quality yet diverse images. Within our XM-GAN, a novel controllable fusion block densely aggregates local r… ▽ More

    Submitted 4 July, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Early Accept in MICCAI 2023

  9. arXiv:2303.12073  [pdf, other

    eess.IV cs.CV

    3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers

    Authors: Omkar Thawakar, Rao Muhammad Anwer, Jorma Laaksonen, Orly Reiner, Mubarak Shah, Fahad Shahbaz Khan

    Abstract: Accurate 3D mitochondria instance segmentation in electron microscopy (EM) is a challenging problem and serves as a prerequisite to empirically analyze their distributions and morphology. Most existing approaches employ 3D convolutions to obtain representative features. However, these convolution-based approaches struggle to effectively capture long-range dependencies in the volume mitochondria da… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, 5 Tables, 2 page references

  10. arXiv:2205.05675  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, **gyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, **shan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang , et al. (86 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR

  11. arXiv:2205.01649  [pdf, other

    eess.IV cs.CV

    Learning Enriched Features for Fast Image Restoration and Enhancement

    Authors: Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao

    Abstract: Given a degraded input image, image restoration aims to recover the missing high-quality image content. Numerous applications demand effective image restoration, e.g., computational photography, surveillance, autonomous vehicles, and remote sensing. Significant advances in image restoration have been made in recent years, dominated by convolutional neural networks (CNNs). The widely-used CNN-based… ▽ More

    Submitted 19 April, 2022; originally announced May 2022.

    Comments: This article supersedes arXiv:2003.06792. Accepted for publication in TPAMI

  12. arXiv:2204.10846  [pdf, other

    cs.CV eess.IV

    Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging

    Authors: Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, Mubarak Shah

    Abstract: We propose a novel self-supervised Video Object Segmentation (VOS) approach that strives to achieve better object-background discriminability for accurate object segmentation. Distinct from previous self-supervised VOS methods, our approach is based on a discriminative learning loss formulation that takes into account both object and background information to ensure object-background discriminabil… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  13. arXiv:2204.07756  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Visual Attention Methods in Deep Learning: An In-Depth Survey

    Authors: Mohammed Hassanin, Saeed Anwar, Ibrahim Radwan, Fahad S Khan, Ajmal Mian

    Abstract: Inspired by the human cognitive system, attention is a mechanism that imitates the human cognitive awareness about specific information, amplifying critical details to focus more on the essential aspects of data. Deep learning has employed attention to boost performance for many applications. Interestingly, the same attention design can suit processing different data modalities and can easily be i… ▽ More

    Submitted 5 May, 2024; v1 submitted 16 April, 2022; originally announced April 2022.

    Comments: Accepted in Information Fusion

  14. arXiv:2204.04218  [pdf, other

    eess.IV cs.CV cs.LG

    Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae-Catalin Ristea, Nicolae Verga, Fahad Shahbaz Khan

    Abstract: Super-resolving medical images can help physicians in providing more accurate diagnostics. In many situations, computed tomography (CT) or magnetic resonance imaging (MRI) techniques capture several scans (modes) during a single investigation, which can jointly be used (in a multimodal fashion) to further boost the quality of super-resolution results. To this end, we propose a novel multimodal mul… ▽ More

    Submitted 12 October, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted at WACV 2023 (main paper + supplementary)

  15. arXiv:2201.09873  [pdf, other

    eess.IV cs.CV

    Transformers in Medical Imaging: A Survey

    Authors: Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, Huazhu Fu

    Abstract: Following unprecedented success on the natural language tasks, Transformers have been successfully applied to several computer vision problems, achieving state-of-the-art results and prompting researchers to reconsider the supremacy of convolutional neural networks (CNNs) as {de facto} operators. Capitalizing on these advances in computer vision, the medical imaging field has also witnessed growin… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 41 pages, \url{https://github.com/fahadshamshad/awesome-transformers-in-medical-imaging}

  16. CyTran: A Cycle-Consistent Transformer with Multi-Level Consistency for Non-Contrast to Contrast CT Translation

    Authors: Nicolae-Catalin Ristea, Andreea-Iuliana Miron, Olivian Savencu, Mariana-Iuliana Georgescu, Nicolae Verga, Fahad Shahbaz Khan, Radu Tudor Ionescu

    Abstract: We propose a novel approach to translate unpaired contrast computed tomography (CT) scans to non-contrast CT scans and the other way around. Solving this task has two important applications: (i) to automatically generate contrast CT scans for patients for whom injecting contrast substance is not an option, and (ii) to enhance the alignment between contrast and non-contrast CT by reducing the diffe… ▽ More

    Submitted 5 April, 2023; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in Neurocomputing

  17. arXiv:2012.02776  [pdf, other

    cs.CV cs.LG eess.IV

    Learning to Fuse Asymmetric Feature Maps in Siamese Trackers

    Authors: Wencheng Han, ** Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen

    Abstract: Recently, Siamese-based trackers have achieved promising performance in visual tracking. Most recent Siamese-based trackers typically employ a depth-wise cross-correlation (DW-XCorr) to obtain multi-channel correlation information from the two feature maps (target and search region). However, DW-XCorr has several limitations within Siamese-based tracking: it can easily be fooled by distractors, ha… ▽ More

    Submitted 30 March, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: Accepted by CVPR2021

  18. arXiv:2011.07491  [pdf, other

    cs.CV cs.LG eess.IV

    Anomaly Detection in Video via Self-Supervised and Multi-Task Learning

    Authors: Mariana-Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, Mubarak Shah

    Abstract: Anomaly detection in video is a challenging computer vision problem. Due to the lack of anomalous events at training time, anomaly detection requires the design of learning methods without full supervision. In this paper, we approach anomalous event detection in video through self-supervised and multi-task learning at the object level. We first utilize a pre-trained detector to detect objects. The… ▽ More

    Submitted 10 September, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

    Comments: Accepted at CVPR 2021. Main paper and supplementary are both included

  19. A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, Mubarak Shah

    Abstract: Abnormal event detection in video is a complex computer vision problem that has attracted significant attention in recent years. The complexity of the task arises from the commonly-adopted definition of an abnormal event, that is, a rarely occurring event that typically depends on the surrounding context. Following the standard formulation of abnormal event detection as outlier detection, we propo… ▽ More

    Submitted 6 April, 2023; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence

  20. arXiv:2008.10774  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Image Colorization: A Survey and Dataset

    Authors: Saeed Anwar, Muhammad Tahir, Chongyi Li, Ajmal Mian, Fahad Shahbaz Khan, Abdul Wahab Muzaffar

    Abstract: Image colorization is the process of estimating RGB colors for grayscale images or video frames to improve their aesthetic and perceptual quality. Deep learning techniques for image colorization have progressed notably over the last decade, calling the need for a systematic survey and benchmarking of these techniques. This article presents a comprehensive survey of recent state-of-the-art deep lea… ▽ More

    Submitted 26 January, 2022; v1 submitted 24 August, 2020; originally announced August 2020.

  21. arXiv:2003.08798  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Incremental Object Detection via Meta-Learning

    Authors: K J Joseph, Jathushan Rajasegaran, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian

    Abstract: In a real-world setting, object instances from new classes can be continuously encountered by object detectors. When existing object detectors are applied to such scenarios, their performance on old classes deteriorates significantly. A few efforts have been reported to address this limitation, all of which apply variants of knowledge distillation to avoid catastrophic forgetting. We note that alt… ▽ More

    Submitted 15 December, 2021; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: Published in IEEE Transactions on Pattern Analysis & Machine Intelligence, Nov 2021. Code is available in https://github.com/JosephKJ/iOD

    Journal ref: TPAMI, Nov 2021

  22. arXiv:2003.07761  [pdf, other

    eess.IV cs.CV

    CycleISP: Real Image Restoration via Improved Data Synthesis

    Authors: Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao

    Abstract: The availability of large-scale datasets has helped unleash the true potential of deep convolutional neural networks (CNNs). However, for the single-image denoising problem, capturing a real dataset is an unacceptably expensive and cumbersome procedure. Consequently, image denoising algorithms are mostly developed and evaluated on synthetic data that is usually generated with a widespread assumpti… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: CVPR 2020 (Oral)