Skip to main content

Showing 1–7 of 7 results for author: Raha, T

.
  1. arXiv:2405.16129  [pdf, other

    cs.CL

    iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers

    Authors: Harshit Gupta, Manav Chaudhary, Tathagata Raha, Shivansh Subramanian, Vasudeva Varma

    Abstract: This paper describes our approach for SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense. The BRAINTEASER task comprises multiple-choice Question Answering designed to evaluate the models' lateral thinking capabilities. It consists of Sentence Puzzle and Word Puzzle subtasks that require models to defy default common-sense associations and exhibit unconventional thinking. We propo… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2404.14779  [pdf, other

    cs.CL

    Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches

    Authors: Clément Christophe, Praveen K Kanithi, Prateek Munjal, Tathagata Raha, Nasir Hayat, Ronnie Rajan, Ahmed Al-Mahrooqi, Avani Gupta, Muhammad Umar Salman, Gurpreet Gosal, Bhargav Kanakiya, Charles Chen, Natalia Vassilieva, Boulbaba Ben Amor, Marco AF Pimentel, Shadab Khan

    Abstract: This study presents a comprehensive analysis and comparison of two predominant fine-tuning methodologies - full-parameter fine-tuning and parameter-efficient tuning - within the context of medical Large Language Models (LLMs). We developed and refined a series of LLMs, based on the Llama-2 architecture, specifically designed to enhance medical knowledge retrieval, reasoning, and question-answering… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: Published at AAAI 2024 Spring Symposium - Clinical Foundation Models

  3. arXiv:2306.08872  [pdf, other

    cs.CL cs.AI

    Neural models for Factual Inconsistency Classification with Explanations

    Authors: Tathagata Raha, Mukund Choudhary, Abhinav Menon, Harshit Gupta, KV Aditya Srivatsa, Manish Gupta, Vasudeva Varma

    Abstract: Factual consistency is one of the most important requirements when editing high quality documents. It is extremely important for automatic text generation systems like summarization, question answering, dialog modeling, and language modeling. Still, automated factual inconsistency detection is rather under-studied. Existing work has focused on (a) finding fake news kee** a knowledge base in cont… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: ECML-PKDD 2023

  4. arXiv:2101.11954  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Identifying COVID-19 Fake News in Social Media

    Authors: Tathagata Raha, Vijayasaradhi Indurthi, Aayush Upadhyaya, Jeevesh Kataria, Pramud Bommakanti, Vikram Keswani, Vasudeva Varma

    Abstract: The evolution of social media platforms have empowered everyone to access information easily. Social media users can easily share information with the rest of the world. This may sometimes encourage spread of fake news, which can result in undesirable consequences. In this work, we train models which can identify health news related to COVID-19 pandemic as real or fake. Our models achieve a high F… ▽ More

    Submitted 1 February, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: CONSTRAINT@AAAI

  5. arXiv:2101.03382  [pdf, other

    cs.CL cs.IR cs.LG

    Task Adaptive Pretraining of Transformers for Hostility Detection

    Authors: Tathagata Raha, Sayar Ghosh Roy, Ujwal Narayan, Zubair Abid, Vasudeva Varma

    Abstract: Identifying adverse and hostile content on the web and more particularly, on social media, has become a problem of paramount interest in recent years. With their ever increasing popularity, fine-tuning of pretrained Transformer-based encoder models with a classifier head are gradually becoming the new baseline for natural language classification tasks. In our work, we explore the gains attributed… ▽ More

    Submitted 9 January, 2021; originally announced January 2021.

    Comments: To be published in: Proceedings of the First Workshop on Combating Online Hostile Posts in Regional Languages during Emergency Situation (CONSTRAINT) at AAAI 2021

  6. arXiv:2101.03207  [pdf, other

    cs.CL cs.AI cs.CY cs.IR cs.LG

    Leveraging Multilingual Transformers for Hate Speech Detection

    Authors: Sayar Ghosh Roy, Ujwal Narayan, Tathagata Raha, Zubair Abid, Vasudeva Varma

    Abstract: Detecting and classifying instances of hate in social media text has been a problem of interest in Natural Language Processing in the recent years. Our work leverages state of the art Transformer language models to identify hate speech in a multilingual setting. Capturing the intent of a post or a comment on social media involves careful evaluation of the language style, semantic content and addit… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

    Comments: To be published in: FIRE (Working Notes) 2020, Hate Speech and Offensive Content Identification in Indo-European Languages, HASOC 2020

  7. arXiv:2007.14576  [pdf, other

    cs.CL

    Development of POS tagger for English-Bengali Code-Mixed data

    Authors: Tathagata Raha, Sainik Kumar Mahata, Dipankar Das, Sivaji Bandyopadhyay

    Abstract: Code-mixed texts are widespread nowadays due to the advent of social media. Since these texts combine two languages to formulate a sentence, it gives rise to various research problems related to Natural Language Processing. In this paper, we try to excavate one such problem, namely, Parts of Speech tagging of code-mixed texts. We have built a system that can POS tag English-Bengali code-mixed data… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted and published in The sixteenth International Conference on Natural Language Processing (ICON-2019)