Skip to main content

Showing 1–4 of 4 results for author: Javadi, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.15293  [pdf, other

    cs.CV cs.AI cs.LG

    SkipViT: Speeding Up Vision Transformers with a Token-Level Skip Connection

    Authors: Foozhan Ataiefard, Walid Ahmed, Habib Hajimolahoseini, Saina Asani, Farnoosh Javadi, Mohammad Hassanpour, Omar Mohamed Awad, Austin Wen, Kangling Liu, Yang Liu

    Abstract: Vision transformers are known to be more computationally and data-intensive than CNN models. These transformer models such as ViT, require all the input image tokens to learn the relationship among them. However, many of these tokens are not informative and may contain irrelevant information such as unrelated background or unimportant scenery. These tokens are overlooked by the multi-head self-att… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  2. arXiv:2311.15134  [pdf, other

    cs.LG cs.AI

    SwiftLearn: A Data-Efficient Training Method of Deep Learning Models using Importance Sampling

    Authors: Habib Hajimolahoseini, Omar Mohamed Awad, Walid Ahmed, Austin Wen, Saina Asani, Mohammad Hassanpour, Farnoosh Javadi, Mehdi Ahmadi, Foozhan Ataiefard, Kangling Liu, Yang Liu

    Abstract: In this paper, we present SwiftLearn, a data-efficient approach to accelerate training of deep learning models using a subset of data samples selected during the warm-up stages of training. This subset is selected based on an importance criteria measured over the entire dataset during warm-up stages, aiming to preserve the model performance with fewer examples during the rest of training. The impo… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  3. arXiv:2311.03426  [pdf, other

    cs.LG cs.AI cs.CV

    GQKVA: Efficient Pre-training of Transformers by Grou** Queries, Keys, and Values

    Authors: Farnoosh Javadi, Walid Ahmed, Habib Hajimolahoseini, Foozhan Ataiefard, Mohammad Hassanpour, Saina Asani, Austin Wen, Omar Mohamed Awad, Kangling Liu, Yang Liu

    Abstract: Massive transformer-based models face several challenges, including slow and computationally intensive pre-training and over-parametrization. This paper addresses these challenges by proposing a versatile method called GQKVA, which generalizes query, key, and value grou** techniques. GQKVA is designed to speed up transformer pre-training while reducing the model size. Our experiments with variou… ▽ More

    Submitted 13 December, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

  4. arXiv:2310.03148  [pdf, other

    cs.IR cs.LG

    Multi-Task Learning For Reduced Popularity Bias In Multi-Territory Video Recommendations

    Authors: Phanideep Gampa, Farnoosh Javadi, Belhassen Bayar, Ainur Yessenalina

    Abstract: Various data imbalances that naturally arise in a multi-territory personalized recommender system can lead to a significant item bias for globally prevalent items. A locally popular item can be overshadowed by a globally prevalent item. Moreover, users' viewership patterns/statistics can drastically change from one geographic location to another which may suggest to learn specific user embeddings.… ▽ More

    Submitted 24 September, 2023; originally announced October 2023.

    Comments: Recsys CARS 2023 Workshop paper