Skip to main content

Showing 1–2 of 2 results for author: Gajera, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10601  [pdf, other

    cs.CL cs.AI

    Jailbreaking Proprietary Large Language Models using Word Substitution Cipher

    Authors: Divij Handa, Advait Chirmule, Bimal Gajera, Chitta Baral

    Abstract: Large Language Models (LLMs) are aligned to moral and ethical guidelines but remain susceptible to creative prompts called Jailbreak that can bypass the alignment process. However, most jailbreaking prompts contain harmful questions in the natural language (mainly English), which can be detected by the LLM themselves. In this paper, we present jailbreaking prompts encoded using cryptographic techn… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 15 pages

  2. arXiv:2305.05050  [pdf, other

    cs.CL cs.AI

    ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in Large Language Models

    Authors: Thilini Wijesiriwardene, Ruwan Wickramarachchi, Bimal G. Gajera, Shreeyash Mukul Gowaikar, Chandan Gupta, Aman Chadha, Aishwarya Naresh Reganti, Amit Sheth, Amitava Das

    Abstract: Over the past decade, analogies, in the form of word-level analogies, have played a significant role as an intrinsic measure of evaluating the quality of word embedding methods such as word2vec. Modern large language models (LLMs), however, are primarily evaluated on extrinsic measures based on benchmarks such as GLUE and SuperGLUE, and there are only a few investigations on whether LLMs can draw… ▽ More

    Submitted 25 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted as a long paper at Findings of ACL 2023