Skip to main content

Showing 1–24 of 24 results for author: Kazerouni, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17815  [pdf, other

    cs.CV cs.AI

    SUM: Saliency Unification through Mamba for Visual Attention Modeling

    Authors: Alireza Hosseini, Amirhossein Kazerouni, Saeed Akhavan, Michael Brudno, Babak Taati

    Abstract: Visual attention modeling, important for interpreting and prioritizing visual stimuli, plays a significant role in applications such as marketing, multimedia, and robotics. Traditional saliency prediction models, especially those based on Convolutional Neural Networks (CNNs) or Transformers, achieve notable success by leveraging large-scale annotated datasets. However, the current state-of-the-art… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2403.19882  [pdf, other

    eess.IV cs.CV cs.LG

    Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights

    Authors: Moein Heidari, Reza Azad, Sina Ghorbani Kolahi, René Arimond, Leon Niggemeier, Alaa Sulaiman, Afshin Bozorgpour, Ehsan Khodapanah Aghdam, Amirhossein Kazerouni, Ilker Hacihaliloglu, Dorit Merhof

    Abstract: Intrigued by the inherent ability of the human visual system to identify salient regions in complex scenes, attention mechanisms have been seamlessly integrated into various Computer Vision (CV) tasks. Building upon this paradigm, Vision Transformer (ViT) networks exploit attention mechanisms for improved efficiency. This review navigates the landscape of redesigned attention mechanisms within ViT… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Submitted to Computational Visual Media Journal

  3. arXiv:2311.13069  [pdf, other

    cs.CV

    FuseNet: Self-Supervised Dual-Path Network for Medical Image Segmentation

    Authors: Amirhossein Kazerouni, Sanaz Karimijafarbigloo, Reza Azad, Yury Velichko, Ulas Bagci, Dorit Merhof

    Abstract: Semantic segmentation, a crucial task in computer vision, often relies on labor-intensive and costly annotated datasets for training. In response to this challenge, we introduce FuseNet, a dual-stream framework for self-supervised semantic segmentation that eliminates the need for manual annotation. FuseNet leverages the shared semantic dependencies between the original and augmented images to cre… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  4. arXiv:2310.18846  [pdf, other

    cs.CV

    INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings

    Authors: Amirhossein Kazerouni, Reza Azad, Alireza Hosseini, Dorit Merhof, Ulas Bagci

    Abstract: Implicit Neural Representations (INRs) have revolutionized signal representation by leveraging neural networks to provide continuous and smooth representations of complex data. However, existing INRs face limitations in capturing fine-grained details, handling noise, and adapting to diverse signal types. To address these challenges, we introduce INCODE, a novel approach that enhances the control o… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted at WACV 2024 conference

  5. arXiv:2310.18689  [pdf, other

    cs.CV

    Foundational Models in Medical Imaging: A Comprehensive Survey and Future Vision

    Authors: Bobby Azad, Reza Azad, Sania Eskandari, Afshin Bozorgpour, Amirhossein Kazerouni, Islem Rekik, Dorit Merhof

    Abstract: Foundation models, large-scale, pre-trained deep-learning models adapted to a wide range of downstream tasks have gained significant interest lately in various deep-learning problems undergoing a paradigm shift with the rise of these models. Trained on large-scale dataset to bridge the gap between different modalities, foundation models facilitate contextual reasoning, generalization, and prompt c… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: The paper is currently in the process of being prepared for submission to MIA

  6. arXiv:2309.00143  [pdf, other

    cs.CV

    Self-supervised Semantic Segmentation: Consistency over Transformation

    Authors: Sanaz Karimijafarbigloo, Reza Azad, Amirhossein Kazerouni, Yury Velichko, Ulas Bagci, Dorit Merhof

    Abstract: Accurate medical image segmentation is of utmost importance for enabling automated clinical decision procedures. However, prevailing supervised deep learning approaches for medical image segmentation encounter significant challenges due to their heavy dependence on extensive labeled training data. To tackle this issue, we propose a novel self-supervised algorithm, \textbf{S$^3$-Net}, which integra… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: Accepted in ICCV 2023 workshop CVAMD

  7. arXiv:2309.00121  [pdf, other

    cs.CV

    Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation

    Authors: Reza Azad, Leon Niggemeier, Michael Huttemann, Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof

    Abstract: Medical image segmentation has seen significant improvements with transformer models, which excel in gras** far-reaching contexts and global contextual information. However, the increasing computational demands of these models, proportional to the squared token count, limit their depth and resolution capabilities. Most current methods process D volumetric image data slice-by-slice (called pseudo… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  8. arXiv:2309.00108  [pdf, other

    cs.CV

    Laplacian-Former: Overcoming the Limitations of Vision Transformers in Local Texture Detection

    Authors: Reza Azad, Amirhossein Kazerouni, Babak Azad, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof

    Abstract: Vision Transformer (ViT) models have demonstrated a breakthrough in a wide range of computer vision tasks. However, compared to the Convolutional Neural Network (CNN) models, it has been observed that the ViT models struggle to capture high-frequency components of images, which can limit their ability to detect local textures and edge information. As abnormalities in human tissue, such as tumors a… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: Accepted in the main conference MICCAI 2023

  9. arXiv:2308.13442  [pdf, other

    cs.CV

    Unlocking Fine-Grained Details with Wavelet-based High-Frequency Enhancement in Transformers

    Authors: Reza Azad, Amirhossein Kazerouni, Alaa Sulaiman, Afshin Bozorgpour, Ehsan Khodapanah Aghdam, Abin Jose, Dorit Merhof

    Abstract: Medical image segmentation is a critical task that plays a vital role in diagnosis, treatment planning, and disease monitoring. Accurate segmentation of anatomical structures and abnormalities from medical images can aid in the early detection and treatment of various diseases. In this paper, we address the local feature deficiency of the Transformer model by carefully re-designing the self-attent… ▽ More

    Submitted 12 September, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Accepted in MICCAI 2023 workshop MLMI

    Journal ref: MICCAI 2023 workshop

  10. arXiv:2308.02959  [pdf, other

    eess.IV cs.CV

    DermoSegDiff: A Boundary-aware Segmentation Diffusion Model for Skin Lesion Delineation

    Authors: Afshin Bozorgpour, Yousef Sadegheih, Amirhossein Kazerouni, Reza Azad, Dorit Merhof

    Abstract: Skin lesion segmentation plays a critical role in the early detection and accurate diagnosis of dermatological conditions. Denoising Diffusion Probabilistic Models (DDPMs) have recently gained attention for their exceptional image-generation capabilities. Building on these advancements, we propose DermoSegDiff, a novel framework for skin lesion segmentation that incorporates boundary information d… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: MICCAI workshop PRIME

  11. arXiv:2307.16142  [pdf, other

    eess.IV cs.CV

    Implicit Neural Representation in Medical Imaging: A Comparative Survey

    Authors: Amirali Molaei, Amirhossein Aminimehr, Armin Tavakoli, Amirhossein Kazerouni, Bobby Azad, Reza Azad, Dorit Merhof

    Abstract: Implicit neural representations (INRs) have gained prominence as a powerful paradigm in scene reconstruction and computer graphics, demonstrating remarkable results. By utilizing neural networks to parameterize data through implicit continuous functions, INRs offer several benefits. Recognizing the potential of INRs beyond these domains, this survey aims to provide a comprehensive overview of INR… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  12. arXiv:2303.17648  [pdf, other

    cs.LG

    Practical Policy Optimization with Personalized Experimentation

    Authors: Mia Garrard, Hanson Wang, Ben Letham, Shaun Singh, Abbas Kazerouni, Sarah Tan, Zehui Wang, Yin Huang, Yichun Hu, Chad Zhou, Norm Zhou, Eytan Bakshy

    Abstract: Many organizations measure treatment effects via an experimentation platform to evaluate the casual effect of product variations prior to full-scale deployment. However, standard experimentation platforms do not perform optimally for end user populations that exhibit heterogeneous treatment effects (HTEs). Here we present a personalized experimentation framework, Personalized Experiments (PEX), wh… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 5 pages, 2 figures

  13. arXiv:2303.16710  [pdf, other

    cs.CV cs.RO

    An intelligent modular real-time vision-based system for environment perception

    Authors: Amirhossein Kazerouni, Amirhossein Heydarian, Milad Soltany, Aida Mohammadshahi, Abbas Omidi, Saeed Ebadollahi

    Abstract: A significant portion of driving hazards is caused by human error and disregard for local driving regulations; Consequently, an intelligent assistance system can be beneficial. This paper proposes a novel vision-based modular package to ensure drivers' safety by perceiving the environment. Each module is designed based on accuracy and inference time to deliver real-time performance. As a result, t… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted in NeurIPS 2022 Workshop on Machine Learning for Autonomous Driving

  14. arXiv:2301.03505  [pdf, other

    cs.CV

    Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review

    Authors: Reza Azad, Amirhossein Kazerouni, Moein Heidari, Ehsan Khodapanah Aghdam, Amirali Molaei, Yiwei Jia, Abin Jose, Rijo Roy, Dorit Merhof

    Abstract: The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision… ▽ More

    Submitted 5 November, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: https://www.sciencedirect.com/science/article/abs/pii/S1361841523002608

  15. arXiv:2212.13504  [pdf, other

    cs.CV

    DAE-Former: Dual Attention-guided Efficient Transformer for Medical Image Segmentation

    Authors: Reza Azad, René Arimond, Ehsan Khodapanah Aghdam, Amirhossein Kazerouni, Dorit Merhof

    Abstract: Transformers have recently gained attention in the computer vision domain due to their ability to model long-range dependencies. However, the self-attention mechanism, which is the core part of the Transformer model, usually suffers from quadratic computational complexity with respect to the number of tokens. Many architectures attempt to reduce model complexity by limiting the self-attention mech… ▽ More

    Submitted 26 July, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    Comments: MICCAI 2023 PRIME workshop

  16. arXiv:2211.07804  [pdf, other

    eess.IV cs.CV

    Diffusion Models for Medical Image Analysis: A Comprehensive Survey

    Authors: Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Moein Heidari, Reza Azad, Mohsen Fayyaz, Ilker Hacihaliloglu, Dorit Merhof

    Abstract: Denoising diffusion models, a class of generative models, have garnered immense interest lately in various deep-learning problems. A diffusion probabilistic model defines a forward diffusion stage where the input data is gradually perturbed over several steps by adding Gaussian noise and then learns to reverse the diffusion process to retrieve the desired noise-free data from noisy data samples. D… ▽ More

    Submitted 3 June, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: Third revision: including more papers and further discussions

  17. arXiv:2207.08518  [pdf, other

    cs.CV cs.AI

    HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation

    Authors: Moein Heidari, Amirhossein Kazerouni, Milad Soltany, Reza Azad, Ehsan Khodapanah Aghdam, Julien Cohen-Adad, Dorit Merhof

    Abstract: Convolutional neural networks (CNNs) have been the consensus for medical image segmentation tasks. However, they suffer from the limitation in modeling long-range dependencies and spatial correlations due to the nature of convolution operation. Although transformers were first developed to address this issue, they fail to capture low-level features. In contrast, it is demonstrated that both local… ▽ More

    Submitted 9 January, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: The accepted version of the paper at WACV 2023

    Journal ref: WACV 2023

  18. arXiv:2110.04124  [pdf, other

    cs.LG eess.IV eess.SP

    Ensemble Neural Representation Networks

    Authors: Milad Soltany Kadarvish, Hesam Mojtahedi, Hossein Entezari Zarch, Amirhossein Kazerouni, Alireza Morsali, Azra Abtahi, Farokh Marvasti

    Abstract: Implicit Neural Representation (INR) has recently attracted considerable attention for storing various types of signals in continuous forms. The existing INR networks require lengthy training processes and high-performance computational resources. In this paper, we propose a novel sub-optimal ensemble architecture for INR that resolves the aforementioned problems. In this architecture, the represe… ▽ More

    Submitted 15 March, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: IEEE Signal Processing Letters submitted, 5 pages, 6 figures, 2 tables

  19. arXiv:2011.01488  [pdf, other

    cs.LG cs.AI

    Multi-armed Bandits with Cost Subsidy

    Authors: Deeksha Sinha, Karthik Abinav Sankararama, Abbas Kazerouni, Vashist Avadhanula

    Abstract: In this paper, we consider a novel variant of the multi-armed bandit (MAB) problem, MAB with cost subsidy, which models many real-life applications where the learning agent has to pay to select an arm and is concerned about optimizing cumulative costs and rewards. We present two applications, intelligent SMS routing problem and ad audience optimization problem faced by several businesses (especial… ▽ More

    Submitted 15 March, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

  20. arXiv:2005.11442  [pdf, other

    cs.LG stat.ML

    Active Learning for Skewed Data Sets

    Authors: Abbas Kazerouni, Qi Zhao, **g Xie, Sandeep Tata, Marc Najork

    Abstract: Consider a sequential active learning problem where, at each round, an agent selects a batch of unlabeled data points, queries their labels and updates a binary classifier. While there exists a rich body of work on active learning in this general form, in this paper, we focus on problems with two distinguishing characteristics: severe class imbalance (skew) and small amounts of initial training da… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

  21. arXiv:1905.08224  [pdf, other

    cs.LG stat.ML

    Best Arm Identification in Generalized Linear Bandits

    Authors: Abbas Kazerouni, Lawrence M. Wein

    Abstract: Motivated by drug design, we consider the best-arm identification problem in generalized linear bandits. More specifically, we assume each arm has a vector of covariates, there is an unknown vector of parameters that is common across the arms, and a generalized linear model captures the dependence of rewards on the covariate and parameter vectors. The problem is to minimize the number of arm pulls… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

  22. arXiv:1708.09020  [pdf, other

    cs.GT cs.AI

    Learning to Price with Reference Effects

    Authors: Abbas Kazerouni, Benjamin Van Roy

    Abstract: As a firm varies the price of a product, consumers exhibit reference effects, making purchase decisions based not only on the prevailing price but also the product's price history. We consider the problem of learning such behavioral patterns as a monopolist releases, markets, and prices products. This context calls for pricing decisions that intelligently trade off between maximizing revenue gener… ▽ More

    Submitted 29 August, 2017; originally announced August 2017.

  23. arXiv:1707.02038  [pdf, other

    cs.LG

    A Tutorial on Thompson Sampling

    Authors: Daniel Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband, Zheng Wen

    Abstract: Thompson sampling is an algorithm for online decision problems where actions are taken sequentially in a manner that must balance between exploiting what is known to maximize immediate performance and investing to accumulate new information that may improve future performance. The algorithm addresses a broad range of problems in a computationally efficient manner and is therefore enjoying wide use… ▽ More

    Submitted 14 July, 2020; v1 submitted 7 July, 2017; originally announced July 2017.

    Journal ref: Foundations and Trends in Machine Learning, Vol. 11, No. 1, pp. 1-96, 2018

  24. arXiv:1611.06426  [pdf, other

    stat.ML cs.LG

    Conservative Contextual Linear Bandits

    Authors: Abbas Kazerouni, Mohammad Ghavamzadeh, Yasin Abbasi-Yadkori, Benjamin Van Roy

    Abstract: Safety is a desirable property that can immensely increase the applicability of learning algorithms in real-world decision-making problems. It is much easier for a company to deploy an algorithm that is safe, i.e., guaranteed to perform at least as well as a baseline. In this paper, we study the issue of safety in contextual linear bandits that have application in many different fields including p… ▽ More

    Submitted 3 March, 2017; v1 submitted 19 November, 2016; originally announced November 2016.