Skip to main content

Showing 1–7 of 7 results for author: Ghosal, S S

.
  1. arXiv:2406.13683  [pdf, other

    cs.CV cs.AI

    IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning

    Authors: Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha

    Abstract: Image-text contrastive models such as CLIP learn transferable and robust representations for zero-shot transfer to a variety of downstream tasks. However, to obtain strong downstream performances, prompts need to be carefully curated, which can be a tedious engineering task. To address the issue of manual prompt engineering, prompt-tuning is used where a set of contextual vectors are learned by le… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2405.20495  [pdf, other

    cs.CL cs.LG

    Transfer Q Star: Principled Decoding for LLM Alignment

    Authors: Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang

    Abstract: Aligning foundation models is essential for their safe and trustworthy deployment. However, traditional fine-tuning methods are computationally intensive and require updating billions of model parameters. A promising alternative, alignment via decoding, adjusts the response distribution directly without model updates to maximize a target reward $r$, thus providing a lightweight and adaptable frame… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2312.14452  [pdf, other

    cs.LG

    How to Overcome Curse-of-Dimensionality for Out-of-Distribution Detection?

    Authors: Soumya Suvra Ghosal, Yiyou Sun, Yixuan Li

    Abstract: Machine learning models deployed in the wild can be challenged by out-of-distribution (OOD) data from unknown classes. Recent advances in OOD detection rely on distance measures to distinguish samples that are relatively far away from the in-distribution (ID) data. Despite the promise, distance-based methods can suffer from the curse-of-dimensionality problem, which limits the efficacy in high-dim… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  4. arXiv:2310.15264  [pdf, other

    cs.CL cs.AI

    Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey

    Authors: Soumya Suvra Ghosal, Souradip Chakraborty, Jonas Gei**, Furong Huang, Dinesh Manocha, Amrit Singh Bedi

    Abstract: Large Language Models (LLMs) have revolutionized the domain of natural language processing (NLP) with remarkable capabilities of generating human-like text responses. However, despite these advancements, several works in the existing literature have raised serious concerns about the potential misuse of LLMs such as spreading misinformation, generating fake news, plagiarism in academia, and contami… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  5. arXiv:2303.05809  [pdf, other

    cs.LG

    Distributionally Robust Optimization with Probabilistic Group

    Authors: Soumya Suvra Ghosal, Yixuan Li

    Abstract: Modern machine learning models may be susceptible to learning spurious correlations that hold on average but not for the atypical group of samples. To address the problem, previous approaches minimize the empirical worst-group risk. Despite the promise, they often assume that each sample belongs to one and only one group, which does not allow expressing the uncertainty in group labeling. In this p… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Published at AAAI 2023

  6. arXiv:2203.09125  [pdf, other

    cs.CV cs.AI cs.LG

    Are Vision Transformers Robust to Spurious Correlations?

    Authors: Soumya Suvra Ghosal, Yifei Ming, Yixuan Li

    Abstract: Deep neural networks may be susceptible to learning spurious correlations that hold on average but not in atypical test samples. As with the recent emergence of vision transformer (ViT) models, it remains underexplored how spurious correlations are manifested in such architectures. In this paper, we systematically investigate the robustness of vision transformers to spurious correlations on three… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  7. arXiv:2010.10836  [pdf, ps, other

    cs.CL cs.AI cs.LG

    ReSCo-CC: Unsupervised Identification of Key Disinformation Sentences

    Authors: Soumya Suvra Ghosal, Deepak P, Anna Jurek-Loughrey

    Abstract: Disinformation is often presented in long textual articles, especially when it relates to domains such as health, often seen in relation to COVID-19. These articles are typically observed to have a number of trustworthy sentences among which core disinformation sentences are scattered. In this paper, we propose a novel unsupervised task of identifying sentences containing key disinformation within… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: The 22nd International Conference on Information Integration and Web-based Applications & Services (iiWAS '20), Chiang Mai, Thailand