Skip to main content

Showing 1–17 of 17 results for author: Koohpayegani, S A

.
  1. arXiv:2406.19301  [pdf, other

    cs.LG

    MCNC: Manifold Constrained Network Compression

    Authors: Chayne Thrash, Ali Abbasi, Parsa Nooralinejad, Soroush Abbasi Koohpayegani, Reed Andreas, Hamed Pirsiavash, Soheil Kolouri

    Abstract: The outstanding performance of large foundational models across diverse tasks-from computer vision to speech and natural language processing-has significantly increased their demand. However, storing and transmitting these models pose significant challenges due to their massive size (e.g., 350GB for GPT-3). Recent literature has focused on compressing the original weights or reducing the number of… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2312.02548  [pdf, other

    cs.CV

    GeNIe: Generative Hard Negative Images Through Diffusion

    Authors: Soroush Abbasi Koohpayegani, Anuj Singh, K L Navaneet, Hadi Jamali-Rad, Hamed Pirsiavash

    Abstract: Data augmentation is crucial in training deep models, preventing them from overfitting to limited data. Recent advances in generative AI, e.g., diffusion models, have enabled more sophisticated augmentation techniques that produce data resembling natural images. We introduce GeNIe a novel augmentation method which leverages a latent diffusion model conditioned on a text prompt to merge contrasting… ▽ More

    Submitted 23 March, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Our code is available https://github.com/UCDvision/GeNIe

  3. arXiv:2311.18159  [pdf, other

    cs.CV

    Compact3D: Smaller and Faster Gaussian Splatting with Vector Quantization

    Authors: KL Navaneet, Kossar Pourahmadi Meibodi, Soroush Abbasi Koohpayegani, Hamed Pirsiavash

    Abstract: 3D Gaussian Splatting (3DGS) is a new method for modeling and rendering 3D radiance fields that achieves much faster learning and rendering time compared to SOTA NeRF methods. However, it comes with the drawback of a much larger storage demand compared to NeRF methods since it needs to store the parameters for millions of 3D Gaussians. We notice that large groups of Gaussians share similar paramet… ▽ More

    Submitted 11 June, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Code is available at https://github.com/UCDvision/compact3d

  4. arXiv:2310.02556  [pdf, other

    cs.CL cs.CV

    NOLA: Compressing LoRA using Linear Combination of Random Basis

    Authors: Soroush Abbasi Koohpayegani, KL Navaneet, Parsa Nooralinejad, Soheil Kolouri, Hamed Pirsiavash

    Abstract: Fine-tuning Large Language Models (LLMs) and storing them for each downstream task or domain is impractical because of the massive model size (e.g., 350GB in GPT-3). Current literature, such as LoRA, showcases the potential of low-rank modifications to the original weights of an LLM, enabling efficient adaptation and storage for task-specific models. These methods can reduce the number of paramete… ▽ More

    Submitted 29 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024. Our code is available here: https://github.com/UCDvision/NOLA

  5. arXiv:2310.02544  [pdf, other

    cs.CV

    SlowFormer: Universal Adversarial Patch for Attack on Compute and Energy Efficiency of Inference Efficient Vision Transformers

    Authors: KL Navaneet, Soroush Abbasi Koohpayegani, Essam Sleiman, Hamed Pirsiavash

    Abstract: Recently, there has been a lot of progress in reducing the computation of deep models at inference time. These methods can reduce both the computational needs and power usage of deep models. Some of these approaches adaptively scale the compute based on the input instance. We show that such models can be vulnerable to a universal adversarial patch attack, where the attacker optimizes for a patch t… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Code is available at https://github.com/UCDvision/SlowFormer

  6. arXiv:2206.08898  [pdf, other

    cs.CV

    SimA: Simple Softmax-free Attention for Vision Transformers

    Authors: Soroush Abbasi Koohpayegani, Hamed Pirsiavash

    Abstract: Recently, vision transformers have become very popular. However, deploying them in many applications is computationally expensive partly due to the Softmax layer in the attention block. We introduce a simple but effective, Softmax-free attention block, SimA, which normalizes query and key matrices with simple $\ell_1$-norm instead of using Softmax layer. Then, the attention block in SimA is a simp… ▽ More

    Submitted 23 March, 2024; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Code is available here: https://github.com/UCDvision/sima

  7. arXiv:2206.08477  [pdf, other

    cs.CV cs.CR cs.LG

    Backdoor Attacks on Vision Transformers

    Authors: Akshayvarun Subramanya, Aniruddha Saha, Soroush Abbasi Koohpayegani, A**kya Tejankar, Hamed Pirsiavash

    Abstract: Vision Transformers (ViT) have recently demonstrated exemplary performance on a variety of vision tasks and are being used as an alternative to CNNs. Their design is based on a self-attention mechanism that processes images as a sequence of patches, which is quite different compared to CNNs. Hence it is interesting to study if ViTs are vulnerable to backdoor attacks. Backdoor attacks happen when a… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  8. arXiv:2206.08464  [pdf, other

    cs.LG

    PRANC: Pseudo RAndom Networks for Compacting deep models

    Authors: Parsa Nooralinejad, Ali Abbasi, Soroush Abbasi Koohpayegani, Kossar Pourahmadi Meibodi, Rana Muhammad Shahroz Khan, Soheil Kolouri, Hamed Pirsiavash

    Abstract: We demonstrate that a deep model can be reparametrized as a linear combination of several randomly initialized and frozen deep models in the weight space. During training, we seek local minima that reside within the subspace spanned by these random models (i.e., `basis' networks). Our framework, PRANC, enables significant compaction of a deep model. The model can be reconstructed using a single sc… ▽ More

    Submitted 28 August, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

  9. arXiv:2201.05131  [pdf, other

    cs.CV

    SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation

    Authors: K L Navaneet, Soroush Abbasi Koohpayegani, A**kya Tejankar, Hamed Pirsiavash

    Abstract: Feature regression is a simple way to distill large neural network models to smaller ones. We show that with simple changes to the network architecture, regression can outperform more complex state-of-the-art approaches for knowledge distillation from self-supervised models. Surprisingly, the addition of a multi-layer perceptron head to the CNN backbone is beneficial even if used only during disti… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: In BMVC 2021. Code available at: https://github.com/UCDvision/simreg

  10. arXiv:2112.04607  [pdf, other

    cs.CV

    Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning

    Authors: KL Navaneet, Soroush Abbasi Koohpayegani, A**kya Tejankar, Kossar Pourahmadi, Akshayvarun Subramanya, Hamed Pirsiavash

    Abstract: We are interested in representation learning in self-supervised, supervised, and semi-supervised settings. Some recent self-supervised learning methods like mean-shift (MSF) cluster images by pulling the embedding of a query image to be closer to its nearest neighbors (NNs). Since most NNs are close to the query by design, the averaging may not affect the embedding of the query much. On the other… ▽ More

    Submitted 14 October, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Code is available at https://github.com/UCDvision/CMSF. arXiv admin note: text overlap with arXiv:2110.10309

  11. arXiv:2111.15667  [pdf, other

    cs.CV

    Adaptive Token Sampling For Efficient Vision Transformers

    Authors: Mohsen Fayyaz, Soroush Abbasi Koohpayegani, Farnoush Rezaei Jafari, Sunando Sengupta, Hamid Reza Vaezi Joze, Eric Sommerlade, Hamed Pirsiavash, Juergen Gall

    Abstract: While state-of-the-art vision transformer models achieve promising results in image classification, they are computationally expensive and require many GFLOPs. Although the GFLOPs of a vision transformer can be decreased by reducing the number of tokens in the network, there is no setting that is optimal for all input images. In this work, we therefore introduce a differentiable parameter-free Ada… ▽ More

    Submitted 26 July, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: ECCV 2022

  12. arXiv:2110.10309  [pdf, other

    cs.CV

    Constrained Mean Shift for Representation Learning

    Authors: A**kya Tejankar, Soroush Abbasi Koohpayegani, Hamed Pirsiavash

    Abstract: We are interested in representation learning from labeled or unlabeled data. Inspired by recent success of self-supervised learning (SSL), we develop a non-contrastive representation learning method that can exploit additional knowledge. This additional knowledge may come from annotated labels in the supervised setting or an SSL model from another modality in the SSL setting. Our main idea is to g… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  13. arXiv:2110.00527  [pdf, other

    cs.CV

    Consistent Explanations by Contrastive Learning

    Authors: Vipin Pillai, Soroush Abbasi Koohpayegani, Ashley Ouligian, Dennis Fong, Hamed Pirsiavash

    Abstract: Post-hoc explanation methods, e.g., Grad-CAM, enable humans to inspect the spatial regions responsible for a particular network decision. However, it is shown that such explanations are not always consistent with human priors, such as consistency across image transformations. Given an interpretation algorithm, e.g., Grad-CAM, we introduce a novel training method to train the model to produce more… ▽ More

    Submitted 8 April, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: To be published in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022

  14. arXiv:2105.10123  [pdf, other

    cs.CV

    Backdoor Attacks on Self-Supervised Learning

    Authors: Aniruddha Saha, A**kya Tejankar, Soroush Abbasi Koohpayegani, Hamed Pirsiavash

    Abstract: Large-scale unlabeled data has spurred recent progress in self-supervised learning methods that learn rich visual representations. State-of-the-art self-supervised methods for learning representations from images (e.g., MoCo, BYOL, MSF) use an inductive bias that random augmentations (e.g., random crops) of an image should produce similar embeddings. We show that such methods are vulnerable to bac… ▽ More

    Submitted 8 June, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: CVPR 2022 (Oral)

  15. arXiv:2105.07269  [pdf, other

    cs.CV

    Mean Shift for Self-Supervised Learning

    Authors: Soroush Abbasi Koohpayegani, A**kya Tejankar, Hamed Pirsiavash

    Abstract: Most recent self-supervised learning (SSL) algorithms learn features by contrasting between instances of images or by clustering the images and then contrasting between the image clusters. We introduce a simple mean-shift algorithm that learns representations by grou** images together without contrasting between them or adopting much of prior on the structure of the clusters. We simply "shift" t… ▽ More

    Submitted 10 September, 2021; v1 submitted 15 May, 2021; originally announced May 2021.

  16. arXiv:2012.09259  [pdf, other

    cs.CV

    ISD: Self-Supervised Learning by Iterative Similarity Distillation

    Authors: A**kya Tejankar, Soroush Abbasi Koohpayegani, Vipin Pillai, Paolo Favaro, Hamed Pirsiavash

    Abstract: Recently, contrastive learning has achieved great results in self-supervised learning, where the main idea is to push two augmentations of an image (positive pairs) closer compared to other random images (negative pairs). We argue that not all random images are equal. Hence, we introduce a self supervised learning algorithm where we use a soft similarity for the negative images rather than a binar… ▽ More

    Submitted 10 September, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

  17. arXiv:2010.14713  [pdf, other

    cs.CV cs.LG

    CompRess: Self-Supervised Learning by Compressing Representations

    Authors: Soroush Abbasi Koohpayegani, A**kya Tejankar, Hamed Pirsiavash

    Abstract: Self-supervised learning aims to learn good representations with unlabeled data. Recent works have shown that larger models benefit more from self-supervised learning than smaller models. As a result, the gap between supervised and self-supervised learning has been greatly reduced for larger models. In this work, instead of designing a new pseudo task for self-supervised learning, we develop a mod… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.