Skip to main content

Showing 1–2 of 2 results for author: Sekhar, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.07449  [pdf, other

    cs.CV

    Language Grounded QFormer for Efficient Vision Language Understanding

    Authors: Moulik Choraria, Nitesh Sekhar, Yue Wu, Xu Zhang, Prateek Singhal, Lav R. Varshney

    Abstract: Large-scale pretraining and instruction tuning have been successful for training general-purpose language models with broad competencies. However, extending to general-purpose vision-language models is challenging due to the distributional diversity in visual inputs. A recent line of work explores vision-language instruction tuning, taking inspiration from the Query Transformer (QFormer) approach… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Preprint Under Review

  2. arXiv:2212.01433  [pdf, other

    cs.LG cs.CL cs.CV eess.IV stat.ML

    Avoiding spurious correlations via logit correction

    Authors: Sheng Liu, Xu Zhang, Nitesh Sekhar, Yue Wu, Prateek Singhal, Carlos Fernandez-Granda

    Abstract: Empirical studies suggest that machine learning models trained with empirical risk minimization (ERM) often rely on attributes that may be spuriously correlated with the class labels. Such models typically lead to poor performance during inference for data lacking such correlations. In this work, we explicitly consider a situation where potential spurious correlations are present in the majority o… ▽ More

    Submitted 28 February, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: 17 pages, 6 figures