Skip to main content

Showing 1–26 of 26 results for author: Hazra, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12274  [pdf, other

    cs.CL

    SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

    Authors: Somnath Banerjee, Soham Tripathy, Sayan Layek, Shanu Kumar, Animesh Mukherjee, Rima Hazra

    Abstract: Safety-aligned language models often exhibit fragile and imbalanced safety mechanisms, increasing the likelihood of generating unsafe content. In addition, incorporating new knowledge through editing techniques to language models can further compromise safety. To address these issues, we propose SafeInfer, a context-adaptive, decoding-time safety alignment strategy for generating safe responses to… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Under review

  2. arXiv:2406.11801  [pdf, other

    cs.CL

    Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: Ensuring the safe alignment of large language models (LLMs) with human values is critical as they become integral to applications like translation and question answering. Current alignment methods struggle with dynamic user intentions and complex objectives, making models vulnerable to generating harmful content. We propose Safety Arithmetic, a training-free framework enhancing LLM safety across d… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under Review. Codes are available at: https://github.com/declare-lab/safety-arithmetic

  3. arXiv:2406.11139  [pdf, other

    cs.CL

    Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

    Authors: Somnath Banerjee, Avik Halder, Rajarshi Mandal, Sayan Layek, Ian Soboroff, Rima Hazra, Animesh Mukherjee

    Abstract: The integration of pretrained language models (PLMs) like BERT and GPT has revolutionized NLP, particularly for English, but it has also created linguistic imbalances. This paper strategically identifies the need for linguistic equity by examining several knowledge editing techniques in multilingual contexts. We evaluate the performance of models such as Mistral, TowerInstruct, OpenHathi, Tamil-Ll… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Under review

  4. arXiv:2406.01309  [pdf, other

    cs.NE cs.AI

    REvolve: Reward Evolution with Large Language Models for Autonomous Driving

    Authors: Rishi Hazra, Alkis Sygkounas, Andreas Persson, Amy Loutfi, Pedro Zuidberg Dos Martires

    Abstract: Designing effective reward functions is crucial to training reinforcement learning (RL) algorithms. However, this design is non-trivial, even for domain experts, due to the subjective nature of certain tasks that are hard to quantify explicitly. In recent works, large language models (LLMs) have been used for reward generation from natural language task descriptions, leveraging their extensive ins… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: project page: https://rishihazra.github.io/REvolve. arXiv admin note: text overlap with arXiv:2310.12931 by other authors

  5. arXiv:2402.16159  [pdf, other

    cs.CL

    DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Aaditya Agrawal, Rima Hazra, Animesh Mukherjee

    Abstract: With the AI revolution in place, the trend for building automated systems to support professionals in different domains such as the open source software systems, healthcare systems, banking systems, transportation systems and many others have become increasingly prominent. A crucial requirement in the automation of support tools for such systems is the early identification of named entities, which… ▽ More

    Submitted 20 June, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted at ECML-PKDD 2024 (Long Paper)

  6. arXiv:2402.15302  [pdf, other

    cs.CL cs.CR

    How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries

    Authors: Somnath Banerjee, Sayan Layek, Rima Hazra, Animesh Mukherjee

    Abstract: In this study, we tackle a growing concern around the safety and ethical use of large language models (LLMs). Despite their potential, these models can be tricked into producing harmful or unethical content through various sophisticated methods, including 'jailbreaking' techniques and targeted manipulation. Our work zeroes in on a specific issue: to what extent LLMs can be led astray by asking the… ▽ More

    Submitted 15 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Under review. {https://huggingface.co/datasets/SoftMINER-Group/TechHazardQA}

  7. arXiv:2401.12671  [pdf, other

    cs.CL

    Context Matters: Pushing the Boundaries of Open-Ended Answer Generation with Graph-Structured Knowledge Context

    Authors: Somnath Banerjee, Amruit Sahoo, Sayan Layek, Avik Dutta, Rima Hazra, Animesh Mukherjee

    Abstract: In the continuously advancing AI landscape, crafting context-rich and meaningful responses via Large Language Models (LLMs) is essential. Researchers are becoming more aware of the challenges that LLMs with fewer parameters encounter when trying to provide suitable answers to open-ended questions. To address these hurdles, the integration of cutting-edge strategies, augmentation of rich external d… ▽ More

    Submitted 5 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  8. arXiv:2401.10647  [pdf, other

    cs.CL

    Sowing the Wind, Rea** the Whirlwind: The Impact of Editing Language Models

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: In the rapidly advancing field of artificial intelligence, the concept of Red-Teaming or Jailbreaking large language models (LLMs) has emerged as a crucial area of study. This approach is especially significant in terms of assessing and enhancing the safety and robustness of these models. This paper investigates the intricate consequences of such modifications through model editing, uncovering a c… ▽ More

    Submitted 16 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted at ACL 2024

  9. arXiv:2312.05626  [pdf, other

    cs.SE cs.AI

    Redefining Developer Assistance: Through Large Language Models in Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Sayan Layek, Amruit Sahoo, Sam Conrad Joyce, Rima Hazra

    Abstract: In this paper, we delve into the advancement of domain-specific Large Language Models (LLMs) with a focus on their application in software development. We introduce DevAssistLlama, a model developed through instruction tuning, to assist developers in processing software-related natural language queries. This model, a variant of instruction tuned LLM, is particularly adept at handling intricate tec… ▽ More

    Submitted 15 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: Under review

  10. arXiv:2309.10811  [pdf, other

    cs.DL cs.CL

    Modeling interdisciplinary interactions among Physics, Mathematics & Computer Science

    Authors: Rima Hazra, Mayank Singh, Pawan Goyal, Bibhas Adhikari, Animesh Mukherjee

    Abstract: Interdisciplinarity has over the recent years have gained tremendous importance and has become one of the key ways of doing cutting edge research. In this paper we attempt to model the citation flow across three different fields -- Physics (PHY), Mathematics (MA) and Computer Science (CS). For instance, is there a specific pattern in which these fields cite one another? We carry out experiments on… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted at Journal of Physics: Complexity

  11. arXiv:2309.05961  [pdf, other

    cs.SI cs.CL cs.IR cs.LG

    Evaluating the Ebb and Flow: An In-depth Analysis of Question-Answering Trends across Diverse Platforms

    Authors: Rima Hazra, Agnik Saha, Somnath Banerjee, Animesh Mukherjee

    Abstract: Community Question Answering (CQA) platforms steadily gain popularity as they provide users with fast responses to their queries. The swiftness of these responses is contingent on a mixture of query-specific and user-related elements. This paper scrutinizes these contributing factors within the context of six highly popular CQA platforms, identified through their standout answering speed. Our inve… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted as POSTER

  12. arXiv:2309.05035  [pdf, other

    cs.IR cs.SE cs.SI

    Duplicate Question Retrieval and Confirmation Time Prediction in Software Communities

    Authors: Rima Hazra, Debanjan Saha, Amruit Sahoo, Somnath Banerjee, Animesh Mukherjee

    Abstract: Community Question Answering (CQA) in different domains is growing at a large scale because of the availability of several platforms and huge shareable information among users. With the rapid growth of such online platforms, a massive amount of archived data makes it difficult for moderators to retrieve possible duplicates for a new question and identify and confirm existing question pairs as dupl… ▽ More

    Submitted 5 March, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: Full paper accepted at ASONAM 2023: The 2023 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

  13. arXiv:2308.12682  [pdf, other

    cs.AI

    SayCanPay: Heuristic Planning with Large Language Models using Learnable Domain Knowledge

    Authors: Rishi Hazra, Pedro Zuidberg Dos Martires, Luc De Raedt

    Abstract: Large Language Models (LLMs) have demonstrated impressive planning abilities due to their vast "world knowledge". Yet, obtaining plans that are both feasible (grounded in affordances) and cost-effective (in plan length), remains a challenge, despite recent progress. This contrasts with heuristic planning methods that employ domain knowledge (formalized in action models such as PDDL) and heuristic… ▽ More

    Submitted 1 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted in AAAI 2024. Website: https://rishihazra.github.io/SayCanPay/

  14. arXiv:2304.08349  [pdf, other

    cs.AI cs.LG

    Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach

    Authors: Rishi Hazra, Luc De Raedt

    Abstract: Despite numerous successes in Deep Reinforcement Learning (DRL), the learned policies are not interpretable. Moreover, since DRL does not exploit symbolic relational representations, it has difficulties in co** with structural changes in its environment (such as increasing the number of objects). Relational Reinforcement Learning, on the other hand, inherits the relational representations from s… ▽ More

    Submitted 14 July, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: Abridged version accepted in ECML-PKDD 2023

  15. arXiv:2303.16975  [pdf, other

    cs.CV

    EgoTV: Egocentric Task Verification from Natural Language Task Descriptions

    Authors: Rishi Hazra, Brian Chen, Akshara Rai, Nitin Kamra, Ruta Desai

    Abstract: To enable progress towards egocentric agents capable of understanding everyday tasks specified in natural language, we propose a benchmark and a synthetic dataset called Egocentric Task Verification (EgoTV). The goal in EgoTV is to verify the execution of tasks from egocentric videos based on the natural language description of these tasks. EgoTV contains pairs of videos and their task description… ▽ More

    Submitted 25 September, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted at ICCV 2023

  16. arXiv:2207.00623  [pdf, other

    cs.IR cs.SE

    Is this bug severe? A text-cum-graph based model for bug severity prediction

    Authors: Rima Hazra, Arpit Dwivedi, Animesh Mukherjee

    Abstract: Repositories of large software systems have become commonplace. This massive expansion has resulted in the emergence of various problems in these software platforms including identification of (i) bug-prone packages, (ii) critical bugs, and (iii) severity of bugs. One of the important goals would be to mine these bugs and recommend them to the developers to resolve them. The first step to this is… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted at ECML PKDD 2022, Research and ADS Track

  17. arXiv:2105.05069  [pdf, other

    cs.AI cs.CL

    Zero-Shot Generalization using Intrinsically Motivated Compositional Emergent Protocols

    Authors: Rishi Hazra, Sonu Dixit, Sayambhu Sen

    Abstract: Human language has been described as a system that makes \textit{use of finite means to express an unlimited array of thoughts}. Of particular interest is the aspect of compositionality, whereby, the meaning of a compound language expression can be deduced from the meaning of its constituent parts. If artificial agents can develop compositional communication protocols akin to human language, they… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: Accepted in NAACL 2021 workshop: Visually Grounded Interaction and Language (ViGIL). arXiv admin note: substantial text overlap with arXiv:2012.05011

  18. arXiv:2105.03943  [pdf, other

    cs.CL cs.AI

    gComm: An environment for investigating generalization in Grounded Language Acquisition

    Authors: Rishi Hazra, Sonu Dixit

    Abstract: gComm is a step towards develo** a robust platform to foster research in grounded language acquisition in a more challenging and realistic setting. It comprises a 2-d grid environment with a set of agents (a stationary speaker and a mobile listener connected via a communication channel) exposed to a continuous array of tasks in a partially observable setting. The key to solving these tasks lies… ▽ More

    Submitted 15 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

    Comments: Accepted in NAACL 2021 workshop: Visually Grounded Interaction and Language (ViGIL). arXiv admin note: substantial text overlap with arXiv:2012.05011

  19. arXiv:2103.06490   

    cs.CL cs.AI cs.HC cs.LG cs.NE

    Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

    Authors: Rishi Hazra, Parag Dutta, Shubham Gupta, Mohammed Abdul Qaathir, Ambedkar Dukkipati

    Abstract: While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and time-consuming. Active Learning (AL) strategies reduce the need for huge volumes of labeled data by iteratively selecting a small number of examples for manual annotation… ▽ More

    Submitted 3 April, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Two of the authors had published similar manuscripts on arXiv. So withdrawing this one. All further updations will be reflected at arXiv:1911.00234

  20. arXiv:2101.08729  [pdf, other

    cs.IR cs.SE

    Joint Autoregressive and Graph Models for Software and Developer Social Networks

    Authors: Rima Hazra, Hardik Aggarwal, Pawan Goyal, Animesh Mukherjee, Soumen Chakrabarti

    Abstract: Social network research has focused on hyperlink graphs, bibliographic citations, friend/follow patterns, influence spread, etc. Large software repositories also form a highly valuable networked artifact, usually in the form of a collection of packages, their developers, dependencies among them, and bug reports. This "social network of code" is rarely studied by social network researchers. We intr… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: Accepted at ECIR 2021

  21. arXiv:2101.00454  [pdf, other

    cs.DL

    Mining the online infosphere: A survey

    Authors: Sayantan Adak, Souvic Chakraborty, Paramtia Das, Mithun Das, Abhisek Dash, Rima Hazra, Binny Mathew, Punyajoy Saha, Soumya Sarkar, Animesh Mukherjee

    Abstract: The evolution of AI-based system and applications had pervaded everyday life to make decisions that have momentous impact on individuals and society. With the staggering growth of online data, often termed as the Online Infosphere it has become paramount to monitor the infosphere to ensure social good as the AI-based decisions are severely dependent on it. The goal of this survey is to provide a c… ▽ More

    Submitted 2 January, 2021; originally announced January 2021.

    Comments: 29 pages

  22. arXiv:2012.05011  [pdf, other

    cs.CL cs.AI

    Intrinsically Motivated Compositional Language Emergence

    Authors: Rishi Hazra, Sonu Dixit, Sayambhu Sen

    Abstract: Recently, there has been a great deal of research in emergent communication on artificial agents interacting in simulated environments. Recent studies have revealed that, in general, emergent languages do not follow the compositionality patterns of natural language. To deal with this, existing works have proposed a limited channel capacity as an important constraint for learning highly composition… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 December, 2020; originally announced December 2020.

    Comments: abridged version Accepted in NAACL 2021 workshop: Visually Grounded Interaction and Language (ViGIL) titled "gComm: An environment for investigating generalization in Grounded Language Acquisition"

  23. Characterising authors on the extent of their paper acceptance: A case study of the Journal of High Energy Physics

    Authors: Rima Hazra, Aryan, Hardik Aggarwal, Matteo Marsili, Animesh Mukherjee

    Abstract: New researchers are usually very curious about the recipe that could accelerate the chances of their paper getting accepted in a reputed forum (journal/conference). In search of such a recipe, we investigate the profile and peer review text of authors whose papers almost always get accepted at a venue (Journal of High Energy Physics in our current work). We find authors with high acceptance rate a… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted in JCDL'2020

  24. arXiv:2004.02780  [pdf, other

    cs.MA cs.AI

    Networked Multi-Agent Reinforcement Learning with Emergent Communication

    Authors: Shubham Gupta, Rishi Hazra, Ambedkar Dukkipati

    Abstract: Multi-Agent Reinforcement Learning (MARL) methods find optimal policies for agents that operate in the presence of other learning agents. Central to achieving this is how the agents coordinate. One way to coordinate is by learning to communicate with each other. Can the agents develop a language while learning to perform a common task? In this paper, we formulate and study a MARL problem where coo… ▽ More

    Submitted 9 April, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: An abridged version of this paper has been accepted as a short paper at AAMAS 2020

  25. arXiv:1911.00234  [pdf, other

    cs.LG cs.IR stat.ML

    Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

    Authors: Rishi Hazra, Parag Dutta, Shubham Gupta, Mohammed Abdul Qaathir, Ambedkar Dukkipati

    Abstract: While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and time-consuming. Active Learning (AL) strategies reduce the need for huge volumes of labeled data by iteratively selecting a small number of examples for manual annotation… ▽ More

    Submitted 6 April, 2021; v1 submitted 1 November, 2019; originally announced November 2019.

    Comments: Accepted in NAACL-HLT 2021

  26. arXiv:1908.03793  [pdf, other

    cs.DL

    The rise and rise of interdisciplinary research: Understanding the interaction dynamics of three major fields -- Physics, Mathematics & Computer Science

    Authors: Rima Hazra, Mayank Singh, Pawan Goyal, Bibhas Adhikari, Animesh Mukherjee

    Abstract: The distinction between sciences is becoming increasingly more artificial -- an approach from one area can be easily applied to the other. More exciting research nowadays is happening perhaps at the interfaces of disciplines like Physics, Mathematics and Computer Science. How do these interfaces emerge and interact? For instance, is there a specific pattern in which these fields cite each other? I… ▽ More

    Submitted 10 August, 2019; originally announced August 2019.

    Comments: This work has been accepted in ICADL-2019 conference