Skip to main content

Showing 1–8 of 8 results for author: Layek, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12274  [pdf, other

    cs.CL

    SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

    Authors: Somnath Banerjee, Soham Tripathy, Sayan Layek, Shanu Kumar, Animesh Mukherjee, Rima Hazra

    Abstract: Safety-aligned language models often exhibit fragile and imbalanced safety mechanisms, increasing the likelihood of generating unsafe content. In addition, incorporating new knowledge through editing techniques to language models can further compromise safety. To address these issues, we propose SafeInfer, a context-adaptive, decoding-time safety alignment strategy for generating safe responses to… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Under review

  2. arXiv:2406.11801  [pdf, other

    cs.CL

    Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: Ensuring the safe alignment of large language models (LLMs) with human values is critical as they become integral to applications like translation and question answering. Current alignment methods struggle with dynamic user intentions and complex objectives, making models vulnerable to generating harmful content. We propose Safety Arithmetic, a training-free framework enhancing LLM safety across d… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under Review. Codes are available at: https://github.com/declare-lab/safety-arithmetic

  3. arXiv:2406.11139  [pdf, other

    cs.CL

    Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

    Authors: Somnath Banerjee, Avik Halder, Rajarshi Mandal, Sayan Layek, Ian Soboroff, Rima Hazra, Animesh Mukherjee

    Abstract: The integration of pretrained language models (PLMs) like BERT and GPT has revolutionized NLP, particularly for English, but it has also created linguistic imbalances. This paper strategically identifies the need for linguistic equity by examining several knowledge editing techniques in multilingual contexts. We evaluate the performance of models such as Mistral, TowerInstruct, OpenHathi, Tamil-Ll… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Under review

  4. arXiv:2405.01596  [pdf

    physics.soc-ph cs.SI

    Analyzing Player Involvement in the Indian Pro Kabaddi League: A Network Analysis Approach

    Authors: Arjab Sengupta, Subhadip Layek, Krishanu Deyasi

    Abstract: This paper aims to apply network analysis to all players who have participated in the Indian Pro Kabaddi League since its inception. The Kabaddi network has been constructed based on the number of teams and players they have played with. The players have been ranked with the help of the degree and PageRank algorithm. Small-world phenomenon is observed in the Kabaddi network. The significance of th… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  5. arXiv:2402.15302  [pdf, other

    cs.CL cs.CR

    How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries

    Authors: Somnath Banerjee, Sayan Layek, Rima Hazra, Animesh Mukherjee

    Abstract: In this study, we tackle a growing concern around the safety and ethical use of large language models (LLMs). Despite their potential, these models can be tricked into producing harmful or unethical content through various sophisticated methods, including 'jailbreaking' techniques and targeted manipulation. Our work zeroes in on a specific issue: to what extent LLMs can be led astray by asking the… ▽ More

    Submitted 15 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Under review. {https://huggingface.co/datasets/SoftMINER-Group/TechHazardQA}

  6. arXiv:2401.12671  [pdf, other

    cs.CL

    Context Matters: Pushing the Boundaries of Open-Ended Answer Generation with Graph-Structured Knowledge Context

    Authors: Somnath Banerjee, Amruit Sahoo, Sayan Layek, Avik Dutta, Rima Hazra, Animesh Mukherjee

    Abstract: In the continuously advancing AI landscape, crafting context-rich and meaningful responses via Large Language Models (LLMs) is essential. Researchers are becoming more aware of the challenges that LLMs with fewer parameters encounter when trying to provide suitable answers to open-ended questions. To address these hurdles, the integration of cutting-edge strategies, augmentation of rich external d… ▽ More

    Submitted 5 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  7. arXiv:2401.10647  [pdf, other

    cs.CL

    Sowing the Wind, Rea** the Whirlwind: The Impact of Editing Language Models

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: In the rapidly advancing field of artificial intelligence, the concept of Red-Teaming or Jailbreaking large language models (LLMs) has emerged as a crucial area of study. This approach is especially significant in terms of assessing and enhancing the safety and robustness of these models. This paper investigates the intricate consequences of such modifications through model editing, uncovering a c… ▽ More

    Submitted 16 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted at ACL 2024

  8. arXiv:2312.05626  [pdf, other

    cs.SE cs.AI

    Redefining Developer Assistance: Through Large Language Models in Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Sayan Layek, Amruit Sahoo, Sam Conrad Joyce, Rima Hazra

    Abstract: In this paper, we delve into the advancement of domain-specific Large Language Models (LLMs) with a focus on their application in software development. We introduce DevAssistLlama, a model developed through instruction tuning, to assist developers in processing software-related natural language queries. This model, a variant of instruction tuned LLM, is particularly adept at handling intricate tec… ▽ More

    Submitted 15 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: Under review