Skip to main content

Showing 1–50 of 207 results for author: Bhattacharyya, P

.
  1. arXiv:2407.03076  [pdf, other

    cs.CL

    A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning

    Authors: Ramakrishna Appicharla, Baban Gain, Santanu Pal, Asif Ekbal, Pushpak Bhattacharyya

    Abstract: In document-level neural machine translation (DocNMT), multi-encoder approaches are common in encoding context and source sentences. Recent studies \cite{li-etal-2020-multi-encoder} have shown that the context encoder generates noise and makes the model robust to the choice of context. This paper further investigates this observation by explicitly modelling context encoding through multi-task lear… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted to EAMT 2024 (poster)

  2. arXiv:2406.15470  [pdf, other

    cs.CL cs.AI cs.SI

    Mental Disorder Classification via Temporal Representation of Text

    Authors: Raja Kumar, Kishan Maharaj, Ashita Saxena, Pushpak Bhattacharyya

    Abstract: Mental disorders pose a global challenge, aggravated by the shortage of qualified mental health professionals. Mental disorder prediction from social media posts by current LLMs is challenging due to the complexities of sequential text data and the limited context length of language models. Current language model-based approaches split a single data instance into multiple chunks to compensate for… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: RK and KM contributed equally to this work, 15 pages, 5 figures, 9 table

  3. arXiv:2406.14284  [pdf

    cs.CL cs.AI

    VAIYAKARANA : A Benchmark for Automatic Grammar Correction in Bangla

    Authors: Pramit Bhattacharyya, Arnab Bhattacharya

    Abstract: Bangla (Bengali) is the fifth most spoken language globally and, yet, the problem of automatic grammar correction in Bangla is still in its nascent stage. This is mostly due to the need for a large corpus of grammatically incorrect sentences, with their corresponding correct counterparts. The present state-of-the-art techniques to curate a corpus for grammatically wrong sentences involve random sw… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.13332  [pdf, other

    cs.CL

    How effective is Multi-source pivoting for Translation of Low Resource Indian Languages?

    Authors: Pranav Gaikwad, Meet Doshi, Raj Dabre, Pushpak Bhattacharyya

    Abstract: Machine Translation (MT) between linguistically dissimilar languages is challenging, especially due to the scarcity of parallel corpora. Prior works suggest that pivoting through a high-resource language can help translation into a related low-resource language. However, existing works tend to discard the source sentence when pivoting. Taking the case of English to Indian language MT, this paper e… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2406.11925  [pdf, other

    cs.SE cs.AI cs.CL

    DocCGen: Document-based Controlled Code Generation

    Authors: Sameer Pimparkhede, Mehant Kammakomati, Srikanth Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya

    Abstract: Recent developments show that Large Language Models (LLMs) produce state-of-the-art performance on natural language (NL) to code generation for resource-rich general-purpose languages like C++, Java, and Python. However, their practical usage for structured domain-specific languages (DSLs) such as YAML, JSON is limited due to domain-specific schema, grammar, and customizations generally unseen by… ▽ More

    Submitted 3 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2406.10993  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving

    Authors: Bhavani Shankar, Preethi Jyothi, Pushpak Bhattacharyya

    Abstract: Code-switching is a widely prevalent linguistic phenomenon in multilingual societies like India. Building speech-to-text models for code-switched speech is challenging due to limited availability of datasets. In this work, we focus on the problem of spoken translation (ST) of code-switched speech in Indian languages to English text. We present a new end-to-end model architecture COSTA that scaffol… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  7. arXiv:2406.10886  [pdf, other

    cs.CL cs.LG

    Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM

    Authors: Sri Raghava Muddu, Rupasai Rangaraju, Tejpalsingh Siledar, Swaroop Nath, Pushpak Bhattacharyya, Swaprava Nath, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Sudhanshu Shekhar Singh, Nikesh Garera

    Abstract: Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limi… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  8. arXiv:2406.10561  [pdf, other

    cs.CL

    We Care: Multimodal Depression Detection and Knowledge Infused Mental Health Therapeutic Response Generation

    Authors: Palash Moon, Pushpak Bhattacharyya

    Abstract: The detection of depression through non-verbal cues has gained significant attention. Previous research predominantly centred on identifying depression within the confines of controlled laboratory environments, often with the supervision of psychologists or counsellors. Unfortunately, datasets generated in such controlled settings may struggle to account for individual behaviours in real-life situ… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  9. arXiv:2406.10560  [pdf, other

    cs.CL

    Facts-and-Feelings: Capturing both Objectivity and Subjectivity in Table-to-Text Generation

    Authors: Tathagata Dey, Pushpak Bhattacharyya

    Abstract: Table-to-text generation, a long-standing challenge in natural language generation, has remained unexplored through the lens of subjectivity. Subjectivity here encompasses the comprehension of information derived from the table that cannot be described solely by objective data. Given the absence of pre-existing datasets, we introduce the Ta2TS dataset with 3849 data instances. We perform the task… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  10. arXiv:2406.09994  [pdf, other

    cs.CL

    Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models

    Authors: Manas Jhalani, Annervaz K M, Pushpak Bhattacharyya

    Abstract: In the realm of multimodal tasks, Visual Question Answering (VQA) plays a crucial role by addressing natural language questions grounded in visual content. Knowledge-Based Visual Question Answering (KBVQA) advances this concept by adding external knowledge along with images to respond to questions. We introduce an approach for KBVQA, augmenting the existing vision-language transformer encoder-deco… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 16 pages, 12 figures

  11. arXiv:2406.05881  [pdf, other

    cs.LG cs.CL cs.RO

    LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning

    Authors: Utsav Singh, Pramit Bhattacharyya, Vinay P. Namboodiri

    Abstract: Develo** interactive systems that leverage natural language instructions to solve complex robotic control tasks has been a long-desired goal in the robotics community. Large Language Models (LLMs) have demonstrated exceptional abilities in handling complex tasks, including logical reasoning, in-context learning, and code generation. However, predicting low-level robotic actions using LLMs poses… ▽ More

    Submitted 16 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  12. arXiv:2406.05344  [pdf, other

    cs.CL

    MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention

    Authors: Prince Jha, Raghav Jain, Konika Mandal, Aman Chadha, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: In the digital world, memes present a unique challenge for content moderation due to their potential to spread harmful content. Although detection methods have improved, proactive solutions such as intervention are still limited, with current research focusing mostly on text-based content, neglecting the widespread influence of multimodal content like memes. Addressing this gap, we present \textit… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  13. arXiv:2406.04886  [pdf, other

    cs.CV cs.AI cs.CL

    Seeing the Unseen: Visual Metaphor Captioning for Videos

    Authors: Abisek Rajakumar Kalarani, Pushpak Bhattacharyya, Sumit Shekhar

    Abstract: Metaphors are a common communication tool used in our day-to-day life. The detection and generation of metaphors in textual form have been studied extensively but metaphors in other forms have been under-explored. Recent studies have shown that Vision-Language (VL) models cannot understand visual metaphors in memes and adverts. As of now, no probing studies have been done that involve complex lang… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  14. arXiv:2405.20628  [pdf, other

    cs.AI cs.CL cs.CV

    ToxVidLLM: A Multimodal LLM-based Framework for Toxicity Detection in Code-Mixed Videos

    Authors: Krishanu Maity, A. S. Poornash, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: In an era of rapidly evolving internet technology, the surge in multimodal content, including videos, has expanded the horizons of online communication. However, the detection of toxic content in this diverse landscape, particularly in low-resource code-mixed languages, remains a critical challenge. While substantial research has addressed toxic content detection in textual data, the realm of vide… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: ACL Findings 2024

  15. arXiv:2405.09854  [pdf, other

    cs.CL

    On the relevance of pre-neural approaches in natural language processing pedagogy

    Authors: Aditya Joshi, Jake Renzella, Pushpak Bhattacharyya, Saurav Jha, Xiangyu Zhang

    Abstract: While neural approaches using deep learning are the state-of-the-art for natural language processing (NLP) today, pre-neural algorithms and approaches still find a place in NLP textbooks and courses of recent years. In this paper, we compare two introductory NLP courses taught in Australia and India, and examine how Transformer and pre-neural approaches are balanced within the lecture plan and ass… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Under review at Teaching NLP workshop at ACL 2024; 8 pages

  16. arXiv:2404.12742  [pdf, other

    cond-mat.str-el

    Kitaev-Heisenberg cobaltates: Coulomb exchange as leading nearest-neighbor interaction mechanism

    Authors: Pritam Bhattacharyya, Thorben Petersen, Satoshi Nishimoto, Liviu Hozoi

    Abstract: A range of honeycomb Co oxide compounds has been proposed and investigated in the search for a topological Kitaev spin liquid. Analyzing the quantum chemistry of interacting magnetic moments in Na$_3$Co$_2$SbO$_6$, a representative $LS$-coupled $t_{2g}^5e_g^2$ magnet, we find that the Kitaev and off-diagonal $Γ$ interactions are sizable and antiferromagnetic but still weaker than the Heisenberg co… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  17. arXiv:2404.05243  [pdf, other

    cs.CL cs.AI

    Product Description and QA Assisted Self-Supervised Opinion Summarization

    Authors: Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

    Abstract: In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) s… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  18. arXiv:2404.04530  [pdf, other

    cs.CL

    A Morphology-Based Investigation of Positional Encodings

    Authors: Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya

    Abstract: Contemporary deep learning models effectively handle languages with diverse morphology despite not being directly integrated into them. Morphology and word order are closely linked, with the latter incorporated into transformer-based models through positional encodings. This prompts a fundamental inquiry: Is there a correlation between the morphological complexity of a language and the utilization… ▽ More

    Submitted 30 May, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: Work in Progress

  19. arXiv:2403.20147  [pdf, other

    cs.CL

    IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context

    Authors: Nihar Ranjan Sahoo, Pranamya Prashant Kulkarni, Narjis Asad, Arif Ahmad, Tanu Goyal, Aparna Garimella, Pushpak Bhattacharyya

    Abstract: The pervasive influence of social biases in language data has sparked the need for benchmark datasets that capture and evaluate these biases in Large Language Models (LLMs). Existing efforts predominantly focus on English language and the Western context, leaving a void for a reliable dataset that encapsulates India's unique socio-cultural nuances. To bridge this gap, we introduce IndiBias, a comp… ▽ More

    Submitted 3 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  20. arXiv:2403.13638  [pdf, other

    cs.CL

    Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese

    Authors: Meet Doshi, Raj Dabre, Pushpak Bhattacharyya

    Abstract: In this paper, we explore the utility of Translationese as synthetic data created using machine translation for pre-training language models (LMs). Pre-training requires vast amounts of monolingual data, which is mostly unavailable for languages other than English. Recently, there has been a growing interest in using synthetic data to address this data scarcity. We take the case of English and Ind… ▽ More

    Submitted 21 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  21. arXiv:2402.17806  [pdf, other

    cs.LG cond-mat.mtrl-sci stat.ML

    Material Microstructure Design Using VAE-Regression with Multimodal Prior

    Authors: Avadhut Sardeshmukh, Sreedhar Reddy, BP Gautham, Pushpak Bhattacharyya

    Abstract: We propose a variational autoencoder (VAE)-based model for building forward and inverse structure-property linkages, a problem of paramount importance in computational materials science. Our model systematically combines VAE with regression, linking the two models through a two-level prior conditioned on the regression variables. The regression loss is optimized jointly with the reconstruction los… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 12 pages main paper, 9 pages appendix. 10 tables and 11 figures. Accepted for publication in PAKDD 2024

  22. arXiv:2402.15478  [pdf, other

    cs.LG stat.ML

    Transformers are Expressive, But Are They Expressive Enough for Regression?

    Authors: Swaroop Nath, Harshad Khadilkar, Pushpak Bhattacharyya

    Abstract: Transformers have become pivotal in Natural Language Processing, demonstrating remarkable success in applications like Machine Translation and Summarization. Given their widespread adoption, several works have attempted to analyze the expressivity of Transformers. Expressivity of a neural network is the class of functions it can approximate. A neural network is fully expressive if it can act as a… ▽ More

    Submitted 7 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 18 pages, 10 figures, 3 tables

  23. arXiv:2402.15473  [pdf, other

    cs.CL cs.LG

    Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization

    Authors: Swaroop Nath, Tejpalsingh Siledar, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Harshad Khadilkar, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of ten… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 19 pages, 6 figures, 21 tables

  24. arXiv:2402.11683  [pdf, other

    cs.CL

    One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

    Authors: Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluat… ▽ More

    Submitted 9 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  25. arXiv:2401.09899  [pdf, other

    cs.CL

    Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes Through Multimodal Explanations

    Authors: Prince Jha, Krishanu Maity, Raghav Jain, Apoorv Verma, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: Internet memes have gained significant influence in communicating political, psychological, and sociocultural ideas. While memes are often humorous, there has been a rise in the use of memes for trolling and cyberbullying. Although a wide variety of effective deep learning-based models have been developed for detecting offensive multimodal memes, only a few works have been done on explainability a… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: EACL2024

  26. arXiv:2401.09023  [pdf, other

    cs.CL

    Explain Thyself Bully: Sentiment Aided Cyberbullying Detection with Explanation

    Authors: Krishanu Maity, Prince Jha, Raghav Jain, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: Cyberbullying has become a big issue with the popularity of different social media networks and online communication apps. While plenty of research is going on to develop better models for cyberbullying detection in monolingual language, there is very little research on the code-mixed languages and explainability aspect of cyberbullying. Recent laws like "right to explanations" of General Data Pro… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: ICDAR 2023

  27. arXiv:2401.07729  [pdf, other

    cs.CV cs.AI cs.RO

    SSL-Interactions: Pretext Tasks for Interactive Trajectory Prediction

    Authors: Prarthana Bhattacharyya, Chengjie Huang, Krzysztof Czarnecki

    Abstract: This paper addresses motion forecasting in multi-agent environments, pivotal for ensuring safety of autonomous vehicles. Traditional as well as recent data-driven marginal trajectory prediction methods struggle to properly learn non-linear agent-to-agent interactions. We present SSL-Interactions that proposes pretext tasks to enhance interaction modeling for trajectory prediction. We introduce fou… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 13 pages, 5 figures, submitted to IV-2024

  28. arXiv:2401.07078  [pdf, other

    cs.CL

    PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities

    Authors: Settaluri Lakshmi Sravanthi, Meet Doshi, Tankala Pavan Kalyan, Rudra Murthy, Pushpak Bhattacharyya, Raj Dabre

    Abstract: LLMs have demonstrated remarkable capability for understanding semantics, but they often struggle with understanding pragmatics. To demonstrate this fact, we release a Pragmatics Understanding Benchmark (PUB) dataset consisting of fourteen tasks in four pragmatics phenomena, namely, Implicature, Presupposition, Reference, and Deixis. We curated high-quality test sets for each task, consisting of M… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  29. arXiv:2401.06103  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.soft

    In situ coherent X-ray scattering reveals polycrystalline structure and discrete annealing events in strongly-coupled nanocrystal superlattices

    Authors: Matthew J. Hurley, Christian P. N. Tanner, Joshua Portner, James K. Utterback, Igor Coropceanu, Garth J. Williams, Avishek Das, Andrei Fluerasu, Yanwen Sun, Sanghoon Song, Leo M. Hamerlynck, Alexander H. Miller, Priyadarshini Bhattacharyya, Dmitri V. Talapin, Naomi S. Ginsberg, Samuel W. Teitelbaum

    Abstract: Solution-phase bottom up self-assembly of nanocrystals into superstructures such as ordered superlattices is an attractive strategy to generate functional materials of increasing complexity, including very recent advances that incorporate strong interparticle electronic coupling. While the self-assembly kinetics in these systems have been elucidated and related to the product characteristics, the… ▽ More

    Submitted 3 March, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 26 pages, 20 figures

  30. arXiv:2401.05134  [pdf, other

    cs.AI cs.CL

    Yes, this is what I was looking for! Towards Multi-modal Medical Consultation Concern Summary Generation

    Authors: Abhisek Tiwari, Shreyangshu Bera, Sriparna Saha, Pushpak Bhattacharyya, Samrat Ghosh

    Abstract: Over the past few years, the use of the Internet for healthcare-related tasks has grown by leaps and bounds, posing a challenge in effectively managing and processing information to ensure its efficient utilization. During moments of emotional turmoil and psychological challenges, we frequently turn to the internet as our initial source of support, choosing this over discussing our feelings with o… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  31. arXiv:2312.11312  [pdf, other

    cs.CL

    APE-then-QE: Correcting then Filtering Pseudo Parallel Corpora for MT Training Data Creation

    Authors: Akshay Batheja, Sourabh Deoghare, Diptesh Kanojia, Pushpak Bhattacharyya

    Abstract: Automatic Post-Editing (APE) is the task of automatically identifying and correcting errors in the Machine Translation (MT) outputs. We propose a repair-filter-use methodology that uses an APE system to correct errors on the target side of the MT training data. We select the sentence pairs from the original and corrected sentence pairs based on the quality scores computed using a Quality Estimatio… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2306.03507

  32. arXiv:2312.09508  [pdf, ps, other

    cs.IR cs.CL

    IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages

    Authors: Saiful Haq, Ashutosh Sharma, Pushpak Bhattacharyya

    Abstract: In this paper, we introduce Neural Information Retrieval resources for 11 widely spoken Indian Languages (Assamese, Bengali, Gujarati, Hindi, Kannada, Malayalam, Marathi, Oriya, Punjabi, Tamil, and Telugu) from two major Indian language families (Indo-Aryan and Dravidian). These resources include (a) INDIC-MARCO, a multilingual version of the MSMARCO dataset in 11 Indian Languages created using Ma… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  33. arXiv:2311.17514  [pdf, other

    cs.CL cs.AI

    Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning

    Authors: Swaroop Nath, Harshad Khadilkar, Pushpak Bhattacharyya

    Abstract: Query-focused Summarization (QfS) deals with systems that generate summaries from document(s) based on a query. Motivated by the insight that Reinforcement Learning (RL) provides a generalization to Supervised Learning (SL) for Natural Language Generation, and thereby performs better (empirically) than SL, we use an RL-based approach for this task of QfS. Additionally, we also resolve the conflict… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  34. arXiv:2311.01621  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Spin-orbit-lattice entangled state in A$_2$MgReO$_6$ (A = Ca, Sr, Ba) revealed by resonant inelastic X-ray scattering

    Authors: Felix I. Frontini, Graham H. J. Johnstone, Naoya Iwahara, Pritam Bhattacharyya, Nikolay A. Bogdanov, Liviu Hozoi, Mary H. Upton, Diego M. Casa, Daigorou Hirai, Young-June Kim

    Abstract: The $5d^1$ ordered double perovskites present an exotic playground for studying novel multi-polar physics due to large spin-orbit coupling. We present Re L3 edge resonant inelastic X-ray scattering (RIXS) results that reveal the presence of the dynamic Jahn-Teller effect in the A$_2$MgReO$_6$ (A = Ca, Sr, Ba) family of $5d^1$ double perovskites. The spin-orbit excitations in these materials show a… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  35. arXiv:2310.18930  [pdf, other

    cs.CL

    Retrofitting Light-weight Language Models for Emotions using Supervised Contrastive Learning

    Authors: Sapan Shah, Sreedhar Reddy, Pushpak Bhattacharyya

    Abstract: We present a novel retrofitting method to induce emotion aspects into pre-trained language models (PLMs) such as BERT and RoBERTa. Our method updates pre-trained network weights using contrastive learning so that the text fragments exhibiting similar emotions are encoded nearby in the representation space, and the fragments with different emotion content are pushed apart. While doing so, it also e… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Camera Ready Version

  36. arXiv:2310.17818  [pdf, other

    cond-mat.str-el

    Distinct contiguous versus separated triplet-pair multiexcitons in an intramolecular singlet fission chromophore

    Authors: R. Chesler, P. Bhattacharyya, A. Shukla, S. Mazumdar

    Abstract: We show from many-body quantum mechanical calculations that there occur structurally distinct triplet-pair eigenstates in the intramolecular singlet fission (iSF) compound pentacene-tetracene-pentacene. Triplet excitons occupy neigboring pentacene and tetracene monomers in the higher energy doubly degenerate triplet-triplet multiexcitons, and terminal pentacene chromophores in the lower energy mul… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  37. arXiv:2310.16749  [pdf, other

    cs.CL cs.HC

    DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages

    Authors: Vineet Bhat, Preethi Jyothi, Pushpak Bhattacharyya

    Abstract: Disfluency correction (DC) is the process of removing disfluent elements like fillers, repetitions and corrections from spoken utterances to create readable and interpretable text. DC is a vital post-processing step applied to Automatic Speech Recognition (ASR) outputs, before subsequent processing by downstream language understanding tasks. Existing DC research has primarily focused on English du… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 Findings

  38. arXiv:2310.01430  [pdf, other

    cs.CL cs.AI

    Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection

    Authors: Swapnil Bhosale, Abhra Chaudhuri, Alex Lee Robert Williams, Divyank Tiwari, Anjan Dutta, Xiatian Zhu, Pushpak Bhattacharyya, Diptesh Kanojia

    Abstract: The introduction of the MUStARD dataset, and its emotion recognition extension MUStARD++, have identified sarcasm to be a multi-modal phenomenon -- expressed not only in natural language text, but also through manners of speech (like tonality and intonation) and visual cues (facial expression). With this work, we aim to perform a rigorous benchmarking of the MUStARD++ dataset by considering state-… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  39. Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization

    Authors: Abhisek Tiwari, Anisha Saha, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar

    Abstract: With the advancement of telemedicine, both researchers and medical practitioners are working hand-in-hand to develop various techniques to automate various medical operations, such as diagnosis report generation. In this paper, we first present a multi-modal clinical conversation summary generation task that takes a clinician-patient interaction (both textual and visual information) and generates… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  40. arXiv:2309.05804  [pdf, other

    cs.CL

    Hi Model, generating 'nice' instead of 'good' is not as bad as generating 'rice'! Towards Context and Semantic Infused Dialogue Generation Loss Function and Evaluation Metric

    Authors: Abhisek Tiwari, Muhammed Sinan, Kaushik Roy, Amit Sheth, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: Over the past two decades, dialogue modeling has made significant strides, moving from simple rule-based responses to personalized and persuasive response generation. However, despite these advancements, the objective functions and evaluation metrics for dialogue generation have remained stagnant. These lexical-based metrics, e.g., cross-entropy and BLEU, have two key limitations: (a) word-to-word… ▽ More

    Submitted 29 May, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

  41. arXiv:2308.07973  [pdf, other

    cs.CL

    "Beware of deception": Detecting Half-Truth and Debunking it through Controlled Claim Editing

    Authors: Sandeep Singamsetty, Nishtha Madaan, Sameep Mehta, Varad Bhatnagar, Pushpak Bhattacharyya

    Abstract: The prevalence of half-truths, which are statements containing some truth but that are ultimately deceptive, has risen with the increasing use of the internet. To help combat this problem, we have created a comprehensive pipeline consisting of a half-truth detection model and a claim editing model. Our approach utilizes the T5 model for controlled claim editing; "controlled" here means precise adj… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  42. arXiv:2308.03638  [pdf, other

    cs.CL

    KITLM: Domain-Specific Knowledge InTegration into Language Models for Question Answering

    Authors: Ankush Agarwal, Sakharam Gawade, Amar Prakash Azad, Pushpak Bhattacharyya

    Abstract: Large language models (LLMs) have demonstrated remarkable performance in a wide range of natural language tasks. However, as these models continue to grow in size, they face significant challenges in terms of computational costs. Additionally, LLMs often lack efficient domain-specific understanding, which is particularly crucial in specialized fields such as aviation and healthcare. To boost the d… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  43. arXiv:2308.03150  [pdf, other

    cs.AI

    "We care": Improving Code Mixed Speech Emotion Recognition in Customer-Care Conversations

    Authors: N V S Abhishek, Pushpak Bhattacharyya

    Abstract: Speech Emotion Recognition (SER) is the task of identifying the emotion expressed in a spoken utterance. Emotion recognition is essential in building robust conversational agents in domains such as law, healthcare, education, and customer support. Most of the studies published on SER use datasets created by employing professional actors in a noise-free environment. In natural settings such as a cu… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  44. arXiv:2308.03122  [pdf, other

    cs.CL

    "Kurosawa": A Script Writer's Assistant

    Authors: Prerak Gandhi, Vishal Pramanik, Pushpak Bhattacharyya

    Abstract: Storytelling is the lifeline of the entertainment industry -- movies, TV shows, and stand-up comedies, all need stories. A good and grip** script is the lifeline of storytelling and demands creativity and resource investment. Good scriptwriters are rare to find and often work under severe time pressure. Consequently, entertainment media are actively looking for automation. In this paper, we pres… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: 6 pages, 9 figures, 1 table

  45. arXiv:2308.02984  [pdf, other

    cs.IR

    Decision Knowledge Graphs: Construction of and Usage in Question Answering for Clinical Practice Guidelines

    Authors: Vasudhan Varma Kandula, Pushpak Bhattacharyya

    Abstract: In the medical domain, several disease treatment procedures have been documented properly as a set of instructions known as Clinical Practice Guidelines (CPGs). CPGs have been developed over the years on the basis of past treatments, and are updated frequently. A doctor treating a particular patient can use these CPGs to know how past patients with similar conditions were treated successfully and… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

  46. Vacaspati: A Diverse Corpus of Bangla Literature

    Authors: Pramit Bhattacharyya, Joydeep Mondal, Subhadip Maji, Arnab Bhattacharya

    Abstract: Bangla (or Bengali) is the fifth most spoken language globally; yet, the state-of-the-art NLP in Bangla is lagging for even simple tasks such as lemmatization, POS tagging, etc. This is partly due to lack of a varied quality corpus. To alleviate this need, we build Vacaspati, a diverse corpus of Bangla literature. The literary works are collected from various websites; only those works that are pu… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Report number: Accepted at IJCNLP-AACL 2023 main

  47. Anisotropic Coulomb exchange as source of Kitaev and off-diagonal symmetric anisotropic couplings

    Authors: Pritam Bhattacharyya, Thorben Petersen, Nikolay A. Bogdanov, Liviu Hozoi

    Abstract: Exchange underpins the magnetic properties of quantum matter. In its most basic form, it occurs through the interplay of Pauli's exclusion principle and Coulomb repulsion, being referred to as Coulomb exchange. Pauli's exclusion principle combined with inter-atomic electron hop** additionally leads to kinetic exchange and superexchange. Here we disentangle the different exchange channels in anis… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2302.00540, arXiv:2212.09365

    Journal ref: Commun. Phys. 7, 121 (2024)

  48. arXiv:2306.17180  [pdf, other

    cs.CL cs.AI cs.CV

    Replace and Report: NLP Assisted Radiology Report Generation

    Authors: Kaveri Kale, pushpak Bhattacharyya, Kshitij Jadhav

    Abstract: Clinical practice frequently uses medical imaging for diagnosis and treatment. A significant challenge for automatic radiology report generation is that the radiology reports are long narratives consisting of multiple sentences for both abnormal and normal findings. Therefore, applying conventional image captioning approaches to generate the whole report proves to be insufficient, as these are des… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: The 61st Annual Meeting of the Association for Computational Linguistics

  49. arXiv:2306.06384  [pdf, other

    cs.CL

    Adversarial Training For Low-Resource Disfluency Correction

    Authors: Vineet Bhat, Preethi Jyothi, Pushpak Bhattacharyya

    Abstract: Disfluencies commonly occur in conversational speech. Speech with disfluencies can result in noisy Automatic Speech Recognition (ASR) transcripts, which affects downstream tasks like machine translation. In this paper, we propose an adversarially-trained sequence-tagging model for Disfluency Correction (DC) that utilizes a small amount of labeled real disfluent data in conjunction with a large amo… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Accepted for Findings of ACL 2023

  50. arXiv:2306.03507  [pdf, other

    cs.CL

    "A Little is Enough": Few-Shot Quality Estimation based Corpus Filtering improves Machine Translation

    Authors: Akshay Batheja, Pushpak Bhattacharyya

    Abstract: Quality Estimation (QE) is the task of evaluating the quality of a translation when reference translation is not available. The goal of QE aligns with the task of corpus filtering, where we assign the quality score to the sentence pairs present in the pseudo-parallel corpus. We propose a Quality Estimation based Filtering approach to extract high-quality parallel data from the pseudo-parallel corp… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.