Skip to main content

Showing 1–18 of 18 results for author: Taghanaki, S A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.19050  [pdf, other

    cs.LG cs.AI

    Detecting Generative Parroting through Overfitting Masked Autoencoders

    Authors: Saeid Asgari Taghanaki, Joseph Lambourne

    Abstract: The advent of generative AI models has revolutionized digital content creation, yet it introduces challenges in maintaining copyright integrity due to generative parroting, where models mimic their training data too closely. Our research presents a novel approach to tackle this issue by employing an overfitted Masked Autoencoder (MAE) to detect such parroted samples effectively. We establish a det… ▽ More

    Submitted 19 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024, Responsible Generative AI workshop

  2. arXiv:2309.03179  [pdf, other

    cs.CV cs.LG

    SLiMe: Segment Like Me

    Authors: Aliasghar Khani, Saeid Asgari Taghanaki, Aditya Sanghi, Ali Mahdavi Amiri, Ghassan Hamarneh

    Abstract: Significant strides have been made using large vision-language models, like Stable Diffusion (SD), for a variety of downstream tasks, including image editing, image correspondence, and 3D shape generation. Inspired by these advancements, we explore leveraging these extensive vision-language models for segmenting images at any desired granularity using as few as one annotated sample by proposing SL… ▽ More

    Submitted 14 March, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

  3. arXiv:2309.00733  [pdf, other

    cs.CV cs.LG

    TExplain: Explaining Learned Visual Features via Pre-trained (Frozen) Language Models

    Authors: Saeid Asgari Taghanaki, Aliasghar Khani, Ali Saheb Pasand, Amir Khasahmadi, Aditya Sanghi, Karl D. D. Willis, Ali Mahdavi-Amiri

    Abstract: Interpreting the learned features of vision models has posed a longstanding challenge in the field of machine learning. To address this issue, we propose a novel method that leverages the capabilities of language models to interpret the learned features of pre-trained image classifiers. Our method, called TExplain, tackles this task by training a neural network to establish a connection between th… ▽ More

    Submitted 1 May, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: Accepted to ICLR 2024, Reliable and Responsible Foundation Models workshop

  4. arXiv:2307.03869  [pdf, other

    cs.CV

    Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation

    Authors: Aditya Sanghi, Pradeep Kumar Jayaraman, Arianna Rampini, Joseph Lambourne, Hooman Shayani, Evan Atherton, Saeid Asgari Taghanaki

    Abstract: Significant progress has recently been made in creative applications of large pre-trained models for downstream tasks in 3D vision, such as text-to-shape generation. This motivates our investigation of how these pre-trained models can be used effectively to generate 3D shapes from sketches, which has largely remained an open challenge due to the limited sketch-shape paired datasets and the varying… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  5. arXiv:2210.00055  [pdf, other

    cs.LG cs.CV

    MaskTune: Mitigating Spurious Correlations by Forcing to Explore

    Authors: Saeid Asgari Taghanaki, Aliasghar Khani, Fereshte Khani, Ali Gholami, Linh Tran, Ali Mahdavi-Amiri, Ghassan Hamarneh

    Abstract: A fundamental challenge of over-parameterized deep learning models is learning meaningful data representations that yield good performance on a downstream task without over-fitting spurious input features. This work proposes MaskTune, a masking strategy that prevents over-reliance on spurious (or a limited number of) features. MaskTune forces the trained model to explore new features during a sing… ▽ More

    Submitted 8 October, 2022; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  6. arXiv:2207.01548  [pdf, other

    cs.LG cs.CV

    Counterbalancing Teacher: Regularizing Batch Normalized Models for Robustness

    Authors: Saeid Asgari Taghanaki, Ali Gholami, Fereshte Khani, Kristy Choi, Linh Tran, Ran Zhang, Aliasghar Khani

    Abstract: Batch normalization (BN) is a ubiquitous technique for training deep neural networks that accelerates their convergence to reach higher accuracy. However, we demonstrate that BN comes with a fundamental drawback: it incentivizes the model to rely on low-variance features that are highly specific to the training (in-domain) data, hurting generalization performance on out-of-domain examples. In this… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  7. arXiv:2106.06620  [pdf, other

    cs.LG cs.CV

    Robust Representation Learning via Perceptual Similarity Metrics

    Authors: Saeid Asgari Taghanaki, Kristy Choi, Amir Khasahmadi, Anirudh Goyal

    Abstract: A fundamental challenge in artificial intelligence is learning useful representations of data that yield good performance on a downstream task, without overfitting to spurious input features. Extracting such task-relevant predictive information is particularly difficult for real-world datasets. In this work, we propose Contrastive Input Morphing (CIM), a representation learning framework that lear… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: Accepted to ICML 2021

  8. arXiv:2011.11572  [pdf, other

    cs.CV

    RobustPointSet: A Dataset for Benchmarking Robustness of Point Cloud Classifiers

    Authors: Saeid Asgari Taghanaki, Jieliang Luo, Ran Zhang, Ye Wang, Pradeep Kumar Jayaraman, Krishna Murthy Jatavallabhula

    Abstract: The 3D deep learning community has seen significant strides in pointcloud processing over the last few years. However, the datasets on which deep models have been trained have largely remained the same. Most datasets comprise clean, clutter-free pointclouds canonicalized for pose. Models trained on these datasets fail in uninterpretible and unintuitive ways when presented with data that contains t… ▽ More

    Submitted 16 April, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Published at the Robust and Reliable Machine Learning in the Real World Workshop, ICLR 2021

  9. arXiv:2007.04525  [pdf, other

    cs.CV cs.LG

    PointMask: Towards Interpretable and Bias-Resilient Point Cloud Processing

    Authors: Saeid Asgari Taghanaki, Kaveh Hassani, Pradeep Kumar Jayaraman, Amir Hosein Khasahmadi, Tonya Custis

    Abstract: Deep classifiers tend to associate a few discriminative input variables with their objective function, which in turn, may hurt their generalization capabilities. To address this, one can design systematic experiments and/or inspect the models via interpretability methods. In this paper, we investigate both of these strategies on deep models operating on point clouds. We propose PointMask, a model-… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: Accepted to ICML 2020 WHI

  10. arXiv:2005.05496  [pdf, other

    cs.LG cs.CV stat.ML

    Jigsaw-VAE: Towards Balancing Features in Variational Autoencoders

    Authors: Saeid Asgari Taghanaki, Mohammad Havaei, Alex Lamb, Aditya Sanghi, Ara Danielyan, Tonya Custis

    Abstract: The latent variables learned by VAEs have seen considerable interest as an unsupervised way of extracting features, which can then be used for downstream tasks. There is a growing interest in the question of whether features learned on one environment will generalize across different environments. We demonstrate here that VAE latent variables often focus on some factors of variation at the expense… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

  11. arXiv:1911.07086  [pdf, other

    cs.CV cs.LG eess.IV

    Signed Input Regularization

    Authors: Saeid Asgari Taghanaki, Kumar Abhishek, Ghassan Hamarneh

    Abstract: Over-parameterized deep models usually over-fit to a given training distribution, which makes them sensitive to small changes and out-of-distribution samples at inference time, leading to low generalization performance. To this end, several model-based and randomized data-dependent regularization methods are applied, such as data augmentation, which prevents a model from memorizing the training di… ▽ More

    Submitted 11 December, 2019; v1 submitted 16 November, 2019; originally announced November 2019.

  12. arXiv:1910.07655  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Semantic Segmentation of Natural and Medical Images: A Review

    Authors: Saeid Asgari Taghanaki, Kumar Abhishek, Joseph Paul Cohen, Julien Cohen-Adad, Ghassan Hamarneh

    Abstract: The semantic image segmentation task consists of classifying each pixel of an image into an instance, where each instance corresponds to a class. This task is a part of the concept of scene understanding or better explaining the global context of an image. In the medical image analysis domain, image segmentation can be used for image-guided interventions, radiotherapy, or improved radiological dia… ▽ More

    Submitted 30 March, 2024; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: 45 pages, 16 figures. Accepted for publication in Springer Artificial Intelligence Review

  13. arXiv:1904.02307  [pdf, other

    cs.CV

    Improved Inference via Deep Input Transfer

    Authors: Saied Asgari Taghanaki, Kumar Abhishek, Ghassan Hamarneh

    Abstract: Although numerous improvements have been made in the field of image segmentation using convolutional neural networks, the majority of these improvements rely on training with larger datasets, model architecture modifications, novel loss functions, and better optimizers. In this paper, we propose a new segmentation performance boosting paradigm that relies on optimally modifying the network's input… ▽ More

    Submitted 10 July, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: Accepted to MICCAI 2019

    Journal ref: MICCAI 2019

  14. arXiv:1903.11741  [pdf, other

    cs.CV

    InfoMask: Masked Variational Latent Representation to Localize Chest Disease

    Authors: Saeid Asgari Taghanaki, Mohammad Havaei, Tess Berthier, Francis Dutil, Lisa Di Jorio, Ghassan Hamarneh, Yoshua Bengio

    Abstract: The scarcity of richly annotated medical images is limiting supervised deep learning based solutions to medical image analysis tasks, such as localizing discriminatory radiomic disease signatures. Therefore, it is desirable to leverage unsupervised and weakly supervised models. Most recent weakly supervised localization methods apply attention maps or region proposals in a multiple instance learni… ▽ More

    Submitted 6 June, 2019; v1 submitted 27 March, 2019; originally announced March 2019.

    Comments: Accepted to MICCAI 2019

  15. arXiv:1903.01015  [pdf, other

    cs.CV

    A Kernelized Manifold Map** to Diminish the Effect of Adversarial Perturbations

    Authors: Saeid Asgari Taghanaki, Kumar Abhishek, Shekoofeh Azizi, Ghassan Hamarneh

    Abstract: The linear and non-flexible nature of deep convolutional models makes them vulnerable to carefully crafted adversarial perturbations. To tackle this problem, we propose a non-linear radial basis convolutional feature map** by learning a Mahalanobis-like distance function. Our method then maps the convolutional features onto a linearly well-separated manifold, which prevents small adversarial per… ▽ More

    Submitted 8 May, 2019; v1 submitted 3 March, 2019; originally announced March 2019.

    Comments: Accepted to CVPR 2019. 10 pages, 6 figures

  16. arXiv:1807.02905  [pdf, other

    cs.CV

    Vulnerability Analysis of Chest X-Ray Image Classification Against Adversarial Attacks

    Authors: Saeid Asgari Taghanaki, Arkadeep Das, Ghassan Hamarneh

    Abstract: Recently, there have been several successful deep learning approaches for automatically classifying chest X-ray images into different disease categories. However, there is not yet a comprehensive vulnerability analysis of these models against the so-called adversarial perturbations/attacks, which makes deep models more trustful in clinical practices. In this paper, we extensively analyzed the perf… ▽ More

    Submitted 28 July, 2018; v1 submitted 8 July, 2018; originally announced July 2018.

    Comments: Accepted in MICCAI, DLF, 2018

  17. arXiv:1805.02798  [pdf, other

    cs.CV

    Combo Loss: Handling Input and Output Imbalance in Multi-Organ Segmentation

    Authors: Saeid Asgari Taghanaki, Yefeng Zheng, S. Kevin Zhou, Bogdan Georgescu, Puneet Sharma, Daguang Xu, Dorin Comaniciu, Ghassan Hamarneh

    Abstract: Simultaneous segmentation of multiple organs from different medical imaging modalities is a crucial task as it can be utilized for computer-aided diagnosis, computer-assisted surgery, and therapy planning. Thanks to the recent advances in deep learning, several deep neural networks for medical image segmentation have been introduced successfully for this purpose. In this paper, we focus on learnin… ▽ More

    Submitted 15 September, 2021; v1 submitted 7 May, 2018; originally announced May 2018.

  18. arXiv:1804.05181  [pdf, other

    cs.CV

    Select, Attend, and Transfer: Light, Learnable Skip Connections

    Authors: Saeid Asgari Taghanaki, Aicha Bentaieb, Anmol Sharma, S. Kevin Zhou, Yefeng Zheng, Bogdan Georgescu, Puneet Sharma, Sasa Grbic, Zhoubing Xu, Dorin Comaniciu, Ghassan Hamarneh

    Abstract: Skip connections in deep networks have improved both segmentation and classification performance by facilitating the training of deeper network architectures, and reducing the risks for vanishing gradients. They equip encoder-decoder-like networks with richer feature representations, but at the cost of higher memory usage, computation, and possibly resulting in transferring non-discriminative feat… ▽ More

    Submitted 2 May, 2018; v1 submitted 14 April, 2018; originally announced April 2018.