Skip to main content

Showing 1–8 of 8 results for author: Hakim, G A V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03588  [pdf, other

    cs.CV

    Feedback-guided Domain Synthesis with Multi-Source Conditional Diffusion Models for Domain Generalization

    Authors: Mehrdad Noori, Milad Cheraghalikhani, Ali Bahri, Gustavo Adolfo Vargas Hakim, David Osowiechi, Moslem Yazdanpanah, Ismail Ben Ayed, Christian Desrosiers

    Abstract: Standard deep learning architectures such as convolutional neural networks and vision transformers often fail to generalize to previously unseen domains due to the implicit assumption that both source and target data are drawn from independent and identically distributed (i.i.d.) populations. In response, Domain Generalization techniques aim to enhance model robustness by simulating novel data dis… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.13875  [pdf, other

    cs.CV

    WATT: Weight Average Test-Time Adaptation of CLIP

    Authors: David Osowiechi, Mehrdad Noori, Gustavo Adolfo Vargas Hakim, Moslem Yazdanpanah, Ali Bahri, Milad Cheraghalikhani, Sahar Dastani, Farzad Beizaee, Ismail Ben Ayed, Christian Desrosiers

    Abstract: Vision-Language Models (VLMs) such as CLIP have yielded unprecedented performance for zero-shot image classification, yet their generalization capability may still be seriously challenged when confronted to domain shifts. In response, we present Weight Average Test-Time Adaptation (WATT) of CLIP, a pioneering approach facilitating full test-time adaptation (TTA) of this VLM. Our method employs a d… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2405.12419  [pdf, other

    cs.CV cs.LG

    GeoMask3D: Geometrically Informed Mask Selection for Self-Supervised Point Cloud Learning in 3D

    Authors: Ali Bahri, Moslem Yazdanpanah, Mehrdad Noori, Milad Cheraghalikhani, Gustavo Adolfo Vargas Hakim, David Osowiechi, Farzad Beizaee, Ismail Ben Ayed, Christian Desrosiers

    Abstract: We introduce a pioneering approach to self-supervised learning for point clouds, employing a geometrically informed mask selection strategy called GeoMask3D (GM3D) to boost the efficiency of Masked Auto Encoders (MAE). Unlike the conventional method of random masking, our technique utilizes a teacher-student model to focus on intricate areas within the data, guiding the model's focus toward region… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  4. arXiv:2405.00754  [pdf, other

    cs.CV cs.LG

    CLIPArTT: Light-weight Adaptation of CLIP to New Domains at Test Time

    Authors: Gustavo Adolfo Vargas Hakim, David Osowiechi, Mehrdad Noori, Milad Cheraghalikhani, Ali Bahri, Moslem Yazdanpanah, Ismail Ben Ayed, Christian Desrosiers

    Abstract: Pre-trained vision-language models (VLMs), exemplified by CLIP, demonstrate remarkable adaptability across zero-shot classification tasks without additional training. However, their performance diminishes in the presence of domain shifts. In this study, we introduce CLIP Adaptation duRing Test-Time (CLIPArTT), a fully test-time adaptation (TTA) approach for CLIP, which involves automatic text prom… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  5. arXiv:2404.08392  [pdf, other

    cs.CV cs.LG

    NC-TTT: A Noise Contrastive Approach for Test-Time Training

    Authors: David Osowiechi, Gustavo A. Vargas Hakim, Mehrdad Noori, Milad Cheraghalikhani, Ali Bahri, Moslem Yazdanpanah, Ismail Ben Ayed, Christian Desrosiers

    Abstract: Despite their exceptional performance in vision tasks, deep learning models often struggle when faced with domain shifts during testing. Test-Time Training (TTT) methods have recently gained popularity by their ability to enhance the robustness of models through the addition of an auxiliary objective that is jointly optimized with the main task. Being strictly unsupervised, this auxiliary objectiv… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  6. arXiv:2310.12345  [pdf, other

    cs.CV cs.AI cs.LG

    ClusT3: Information Invariant Test-Time Training

    Authors: Gustavo A. Vargas Hakim, David Osowiechi, Mehrdad Noori, Milad Cheraghalikhani, Ismail Ben Ayed, Christian Desrosiers

    Abstract: Deep Learning models have shown remarkable performance in a broad range of vision tasks. However, they are often vulnerable against domain shifts at test-time. Test-time training (TTT) methods have been developed in an attempt to mitigate these vulnerabilities, where a secondary task is solved at training time simultaneously with the main task, to be later used as an self-supervised proxy task at… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  7. arXiv:2303.15698  [pdf, other

    cs.CV

    TFS-ViT: Token-Level Feature Stylization for Domain Generalization

    Authors: Mehrdad Noori, Milad Cheraghalikhani, Ali Bahri, Gustavo A. Vargas Hakim, David Osowiechi, Ismail Ben Ayed, Christian Desrosiers

    Abstract: Standard deep learning models such as convolutional neural networks (CNNs) lack the ability of generalizing to domains which have not been seen during training. This problem is mainly due to the common but often wrong assumption of such models that the source and target data come from the same i.i.d. distribution. Recently, Vision Transformers (ViTs) have shown outstanding performance for a broad… ▽ More

    Submitted 16 March, 2024; v1 submitted 27 March, 2023; originally announced March 2023.

  8. arXiv:2210.11389  [pdf, other

    cs.CV cs.AI cs.LG

    TTTFlow: Unsupervised Test-Time Training with Normalizing Flow

    Authors: David Osowiechi, Gustavo A. Vargas Hakim, Mehrdad Noori, Milad Cheraghalikhani, Ismail Ben Ayed, Christian Desrosiers

    Abstract: A major problem of deep neural networks for image classification is their vulnerability to domain changes at test-time. Recent methods have proposed to address this problem with test-time training (TTT), where a two-branch model is trained to learn a main classification task and also a self-supervised task used to perform test-time adaptation. However, these techniques require defining a proxy tas… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.