Skip to main content

Showing 1–50 of 200 results for author: Goyal, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00550  [pdf, other

    cs.NI cs.DC

    Challenging the Need for Packet Spraying in Large-Scale Distributed Training

    Authors: Vamsi Addanki, Prateesh Goyal, Ilias Marinos

    Abstract: Large-scale distributed training in production datacenters constitutes a challenging workload bottlenecked by network communication. In response, both major industry players (e.g., Ultra Ethernet Consortium) and parts of academia have surprisingly, and almost unanimously, agreed that packet spraying is necessary to improve the performance of large-scale distributed training workloads. In this pa… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2406.03986  [pdf, other

    cs.CL cs.IR

    On The Persona-based Summarization of Domain-Specific Documents

    Authors: Ankan Mullick, Sombit Bose, Rounak Saha, Ayan Kumar Bhowmick, Pawan Goyal, Niloy Ganguly, Prasenjit Dey, Ravi Kokku

    Abstract: In an ever-expanding world of domain-specific knowledge, the increasing complexity of consuming, and storing information necessitates the generation of summaries from large information repositories. However, every persona of a domain has different requirements of information and hence their summarization. For example, in the healthcare domain, a persona-based (such as Doctor, Nurse, Patient etc.)… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Journal ref: ACL 2024 Findings (Association for Computational Linguistics)

  3. arXiv:2405.06671  [pdf, other

    cs.CL cs.CE cs.LG

    Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

    Authors: Subhendu Khatuya, Rajdeep Mukherjee, Akash Ghosh, Manjunath Hegde, Koustuv Dasgupta, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal

    Abstract: We study the problem of automatically annotating relevant numerals (GAAP metrics) occurring in the financial documents with their corresponding XBRL tags. Different from prior works, we investigate the feasibility of solving this extreme classification problem using a generative paradigm through instruction tuning of Large Language Models (LLMs). To this end, we leverage metric metadata informatio… ▽ More

    Submitted 15 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: This work has been accepted to appear at North American Chapter of the Association for Computational Linguistics (NAACL), 2024

  4. arXiv:2405.06669  [pdf, other

    cs.CL cs.CE cs.IR cs.LG

    Instruction-Guided Bullet Point Summarization of Long Financial Earnings Call Transcripts

    Authors: Subhendu Khatuya, Koushiki Sinha, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal

    Abstract: While automatic summarization techniques have made significant advancements, their primary focus has been on summarizing short news articles or documents that have clear structural patterns like scientific articles or government reports. There has not been much exploration into develo** efficient methods for summarizing financial documents, which often contain complex facts and figures. Here, we… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted in SIGIR 2024

  5. arXiv:2405.04732  [pdf, other

    cs.RO cs.AI

    S-EQA: Tackling Situational Queries in Embodied Question Answering

    Authors: Vishnu Sashank Dorbala, Prasoon Goyal, Robinson Piramuthu, Michael Johnston, Dinesh Manocha, Reza Ghanadhan

    Abstract: We present and tackle the problem of Embodied Question Answering (EQA) with Situational Queries (S-EQA) in a household environment. Unlike prior EQA work tackling simple queries that directly reference target objects and quantifiable properties pertaining them, EQA with situational queries (such as "Is the bathroom clean and dry?") is more challenging, as the agent needs to figure out not just wha… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 8 Pages

  6. arXiv:2405.04235  [pdf, other

    cs.RO cs.LG

    LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based Planning

    Authors: Zeyu Feng, Hao Luan, Pranav Goyal, Harold Soh

    Abstract: Operating effectively in complex environments while complying with specified constraints is crucial for the safe and successful deployment of robots that interact with and operate around people. In this work, we focus on generating long-horizon trajectories that adhere to novel static and temporally-extended constraints/instructions at test time. We propose a data-driven diffusion-based framework,… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  7. arXiv:2404.17912  [pdf, other

    cs.CL cs.AI cs.LG

    SERPENT-VLM : Self-Refining Radiology Report Generation Using Vision Language Models

    Authors: Manav Nitin Kapadnis, Sohan Patnaik, Abhilash Nandy, Sourjyadip Ray, Pawan Goyal, Debdoot Sheet

    Abstract: Radiology Report Generation (R2Gen) demonstrates how Multi-modal Large Language Models (MLLMs) can automate the creation of accurate and coherent radiological reports. Existing methods often hallucinate details in text-based reports that don't accurately reflect the image content. To mitigate this, we introduce a novel strategy, SERPENT-VLM (SElf Refining Radiology RePort GENeraTion using Vision L… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, 4 tables, Accepted as oral at Clinical NLP workshop at NAACL 2024

  8. arXiv:2404.08827  [pdf, other

    cs.RO cs.CV

    "Don't forget to put the milk back!" Dataset for Enabling Embodied Agents to Detect Anomalous Situations

    Authors: James F. Mullen Jr, Prasoon Goyal, Robinson Piramuthu, Michael Johnston, Dinesh Manocha, Reza Ghanadan

    Abstract: Home robots intend to make their users lives easier. Our work assists in this goal by enabling robots to inform their users of dangerous or unsanitary anomalies in their home. Some examples of these anomalies include the user leaving their milk out, forgetting to turn off the stove, or leaving poison accessible to children. To move towards enabling home robots with these abilities, we have created… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  9. arXiv:2404.04676  [pdf, other

    cs.CL

    Order-Based Pre-training Strategies for Procedural Text Understanding

    Authors: Abhilash Nandy, Yash Kulkarni, Pawan Goyal, Niloy Ganguly

    Abstract: In this paper, we propose sequence-based pretraining methods to enhance procedural understanding in natural language processing. Procedural text, containing sequential instructions to accomplish a task, is difficult to understand due to the changing attributes of entities in the context. We focus on recipes, which are commonly represented as ordered instructions, and use this order as a supervisio… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 8 pages (Accepted for publication at NAACL 2024 (Main Conference))

  10. arXiv:2404.03598  [pdf, other

    cs.CL

    Intent Detection and Entity Extraction from BioMedical Literature

    Authors: Ankan Mullick, Mukur Gupta, Pawan Goyal

    Abstract: Biomedical queries have become increasingly prevalent in web searches, reflecting the growing interest in accessing biomedical literature. Despite recent research on large-language models (LLMs) motivated by endeavours to attain generalized intelligence, their efficacy in replacing task and domain-specific natural language understanding approaches remains questionable. In this paper, we address th… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted to CL4Health LREC-COLING 2024

  11. arXiv:2404.00401  [pdf, other

    cs.CL

    How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

    Authors: Akash Ghosh, B Venkata Sahith, Niloy Ganguly, Pawan Goyal, Mayank Singh

    Abstract: Question-answering (QA) on hybrid scientific tabular and textual data deals with scientific information, and relies on complex numerical reasoning. In recent years, while tabular QA has seen rapid progress, understanding their robustness on scientific information is lacking due to absence of any benchmark dataset. To investigate the robustness of the existing state-of-the-art QA models on scientif… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  12. arXiv:2403.04547  [pdf, other

    cs.LG cs.AI

    CLIP the Bias: How Useful is Balancing Data in Multimodal Learning?

    Authors: Ibrahim Alabdulmohsin, Xiao Wang, Andreas Steiner, Priya Goyal, Alexander D'Amour, Xiaohua Zhai

    Abstract: We study the effectiveness of data-balancing for mitigating biases in contrastive language-image pretraining (CLIP), identifying areas of strength and limitation. First, we reaffirm prior conclusions that CLIP models can inadvertently absorb societal stereotypes. To counter this, we present a novel algorithm, called Multi-Modal Moment Matching (M4), designed to reduce both representation and assoc… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 32 pages, 20 figures, 7 tables

    Journal ref: ICLR 2024

  13. arXiv:2403.00646  [pdf, other

    cs.LG math.DS math.OC

    Stability-Certified Learning of Control Systems with Quadratic Nonlinearities

    Authors: Igor Pontes Duff, Pawan Goyal, Peter Benner

    Abstract: This work primarily focuses on an operator inference methodology aimed at constructing low-dimensional dynamical models based on a priori hypotheses about their structure, often informed by established physics or expert insights. Stability is a fundamental attribute of dynamical systems, yet it is not always assured in models derived through inference. Our main objective is to develop a method tha… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 12 pages, 4 figures

  14. arXiv:2402.17698  [pdf, other

    math.NA cs.LG

    Learning reduced-order Quadratic-Linear models in Process Engineering using Operator Inference

    Authors: Ion Victor Gosea, Luisa Peterson, Pawan Goyal, Jens Bremer, Kai Sundmacher, Peter Benner

    Abstract: In this work, we address the challenge of efficiently modeling dynamical systems in process engineering. We use reduced-order model learning, specifically operator inference. This is a non-intrusive, data-driven method for learning dynamical systems from time-domain data. The application in our study is carbon dioxide methanation, an important reaction within the Power-to-X framework, to demonstra… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 10 pages, 3 figures

  15. arXiv:2402.16994  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    GEM3D: GEnerative Medial Abstractions for 3D Shape Synthesis

    Authors: Dmitry Petrov, Pradyumn Goyal, Vikas Thamizharasan, Vladimir G. Kim, Matheus Gadelha, Melinos Averkiou, Siddhartha Chaudhuri, Evangelos Kalogerakis

    Abstract: We introduce GEM3D -- a new deep, topology-aware generative model of 3D shapes. The key ingredient of our method is a neural skeleton-based representation encoding information on both shape topology and geometry. Through a denoising diffusion probabilistic model, our method first generates skeleton-based representations following the Medial Axis Transform (MAT), then generates surfaces through a s… ▽ More

    Submitted 10 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Webpage: https://lodurality.github.io/GEM3D/ -- Cond. accept. to SIGGRAPH 2024 (conf. track) -- Changes (based on reviews): changed style to sigconf; rearranged figures for readability; added missing citations; fixed misaligned centers in Fig. 3; added failure cases (Fig. 10); rewrote discussion; added categories averages to Tab. 8; added Tab. 10 with model capacities

  16. arXiv:2402.16986  [pdf, other

    cs.CL cs.IR

    Long Dialog Summarization: An Analysis

    Authors: Ankan Mullick, Ayan Kumar Bhowmick, Raghav R, Ravi Kokku, Prasenjit Dey, Pawan Goyal, Niloy Ganguly

    Abstract: Dialog summarization has become increasingly important in managing and comprehending large-scale conversations across various domains. This task presents unique challenges in capturing the key points, context, and nuances of multi-turn long conversations for summarization. It is worth noting that the summarization techniques may vary based on specific requirements such as in a shop**-chatbot sce… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  17. MatSciRE: Leveraging Pointer Networks to Automate Entity and Relation Extraction for Material Science Knowledge-base Construction

    Authors: Ankan Mullick, Akash Ghosh, G Sai Chaitanya, Samir Ghui, Tapas Nayak, Seung-Cheol Lee, Satadeep Bhattacharjee, Pawan Goyal

    Abstract: Material science literature is a rich source of factual information about various categories of entities (like materials and compositions) and various relations between these entities, such as conductivity, voltage, etc. Automatically extracting this information to generate a material science knowledge base is a challenging task. In this paper, we propose MatSciRE (Material Science Relation Extrac… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Journal ref: Computational Material Science 2023 (Elsevier)

  18. arXiv:2312.17254  [pdf, other

    cs.CL

    Faithful Model Evaluation for Model-Based Metrics

    Authors: Palash Goyal, Qian Hu, Rahul Gupta

    Abstract: Statistical significance testing is used in natural language processing (NLP) to determine whether the results of a study or experiment are likely to be due to chance or if they reflect a genuine relationship. A key step in significance testing is the estimation of confidence interval which is a function of sample variance. Sample variance calculation is straightforward when evaluating against gro… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  19. arXiv:2312.11779  [pdf, other

    cs.CL cs.AI cs.LG

    Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language Technologies

    Authors: Anaelia Ovalle, Ninareh Mehrabi, Palash Goyal, Jwala Dhamala, Kai-Wei Chang, Richard Zemel, Aram Galstyan, Yuval Pinter, Rahul Gupta

    Abstract: Gender-inclusive NLP research has documented the harmful limitations of gender binary-centric large language models (LLM), such as the inability to correctly use gender-diverse English neopronouns (e.g., xe, zir, fae). While data scarcity is a known culprit, the precise mechanisms through which scarcity affects this behavior remain underexplored. We discover LLM misgendering is significantly influ… ▽ More

    Submitted 6 April, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted to NAACL 2024 findings

  20. arXiv:2312.05671  [pdf, other

    cs.CL

    Hate Speech and Offensive Content Detection in Indo-Aryan Languages: A Battle of LSTM and Transformers

    Authors: Nikhil Narayan, Mrutyunjay Biswal, Pramod Goyal, Abhranta Panigrahi

    Abstract: Social media platforms serve as accessible outlets for individuals to express their thoughts and experiences, resulting in an influx of user-generated data spanning all age groups. While these platforms enable free expression, they also present significant challenges, including the proliferation of hate speech and offensive content. Such objectionable language disrupts objective discourse and can… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 14 pages, 3 figures. Accepted Working Notes at HASOC-FIRE 2023, to be published in CEUR Working Notes of FIRE

  21. arXiv:2311.09473  [pdf, other

    cs.AI cs.CL

    JAB: Joint Adversarial Prompting and Belief Augmentation

    Authors: Ninareh Mehrabi, Palash Goyal, Anil Ramakrishna, Jwala Dhamala, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

    Abstract: With the recent surge of language models in different applications, attention to safety and robustness of these models has gained significant importance. Here we introduce a joint framework in which we simultaneously probe and improve the robustness of a black-box target model via adversarial prompting and belief augmentation using iterative feedback loops. This framework utilizes an automated red… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  22. arXiv:2311.04978  [pdf, other

    cs.CL

    On the steerability of large language models toward data-driven personas

    Authors: Junyi Li, Ninareh Mehrabi, Charith Peris, Palash Goyal, Kai-Wei Chang, Aram Galstyan, Richard Zemel, Rahul Gupta

    Abstract: Large language models (LLMs) are known to generate biased responses where the opinions of certain groups and populations are underrepresented. Here, we present a novel approach to achieve controllable generation of specific viewpoints using LLMs, that can be leveraged to produce multiple perspectives and to reflect the diverse opinions. Moving beyond the traditional reliance on demographics like a… ▽ More

    Submitted 2 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  23. arXiv:2310.20274  [pdf, other

    cs.IR cs.CL cs.LG

    Extracting Entities of Interest from Comparative Product Reviews

    Authors: Jatin Arora, Sumit Agrawal, Pawan Goyal, Sayan Pathak

    Abstract: This paper presents a deep learning based approach to extract product comparison information out of user reviews on various e-commerce websites. Any comparative product review has three major entities of information: the names of the products being compared, the user opinion (predicate) and the feature or aspect under comparison. All these informing entities are dependent on each other and bound b… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Source Code: https://github.com/jatinarora2702/Review-Information-Extraction

    ACM Class: I.2.7; H.3.3

    Journal ref: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, Pages 1975 - 1978

  24. arXiv:2310.15577  [pdf, other

    cs.CL cs.AI

    CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet Extraction

    Authors: Rajdeep Mukherjee, Nithish Kannen, Saurabh Kumar Pandey, Pawan Goyal

    Abstract: Existing works on Aspect Sentiment Triplet Extraction (ASTE) explicitly focus on develo** more efficient fine-tuning techniques for the task. Instead, our motivation is to come up with a generic approach that can improve the downstream performances of multiple ABSA tasks simultaneously. Towards this, we present CONTRASTE, a novel pre-training strategy using CONTRastive learning to enhance the AS… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted as a Long Paper at EMNLP 2023 (Findings); 16 pages; Codes: https://github.com/nitkannen/CONTRASTE/

    ACM Class: I.2.7

  25. arXiv:2310.14326  [pdf, other

    cs.CL cs.AI

    CLMSM: A Multi-Task Learning Framework for Pre-training on Procedural Text

    Authors: Abhilash Nandy, Manav Nitin Kapadnis, Pawan Goyal, Niloy Ganguly

    Abstract: In this paper, we propose CLMSM, a domain-specific, continual pre-training framework, that learns from a large set of procedural recipes. CLMSM uses a Multi-Task Learning Framework to optimize two objectives - a) Contrastive Learning using hard triplets to learn fine-grained differences across entities in the procedures, and b) a novel Mask-Step Modelling objective to learn step-wise context of a… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP Findings 2023, 14 pages, 4 figures

  26. arXiv:2310.09501  [pdf, other

    cs.CL

    DepNeCTI: Dependency-based Nested Compound Type Identification for Sanskrit

    Authors: Jivnesh Sandhan, Yaswanth Narsupalli, Sreevatsa Muppirala, Sriram Krishnan, Pavankumar Satuluri, Amba Kulkarni, Pawan Goyal

    Abstract: Multi-component compounding is a prevalent phenomenon in Sanskrit, and understanding the implicit structure of a compound's components is crucial for deciphering its meaning. Earlier approaches in Sanskrit have focused on binary compounds and neglected the multi-component compound setting. This work introduces the novel task of nested compound type identification (NeCTI), which aims to identify ne… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: 9 Pages, Camera-ready version accepted at EMNLP23 (Findings)

  27. arXiv:2309.10811  [pdf, other

    cs.DL cs.CL

    Modeling interdisciplinary interactions among Physics, Mathematics & Computer Science

    Authors: Rima Hazra, Mayank Singh, Pawan Goyal, Bibhas Adhikari, Animesh Mukherjee

    Abstract: Interdisciplinarity has over the recent years have gained tremendous importance and has become one of the key ways of doing cutting edge research. In this paper we attempt to model the citation flow across three different fields -- Physics (PHY), Mathematics (MA) and Computer Science (CS). For instance, is there a specific pattern in which these fields cite one another? We carry out experiments on… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted at Journal of Physics: Complexity

  28. arXiv:2309.07193  [pdf, other

    math.DS cs.LG

    A Robust SINDy Approach by Combining Neural Networks and an Integral Form

    Authors: Ali Forootani, Pawan Goyal, Peter Benner

    Abstract: The discovery of governing equations from data has been an active field of research for decades. One widely used methodology for this purpose is sparse regression for nonlinear dynamics, known as SINDy. Despite several attempts, noisy and scarce data still pose a severe challenge to the success of the SINDy approach. In this work, we discuss a robust method to discover nonlinear governing equation… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  29. arXiv:2308.13835  [pdf, other

    cs.LG cs.AI math.DS

    Deep Learning for Structure-Preserving Universal Stable Koopman-Inspired Embeddings for Nonlinear Canonical Hamiltonian Dynamics

    Authors: Pawan Goyal, Süleyman Yıldız, Peter Benner

    Abstract: Discovering a suitable coordinate transformation for nonlinear systems enables the construction of simpler models, facilitating prediction, control, and optimization for complex nonlinear systems. To that end, Koopman operator theory offers a framework for global linearization for nonlinear systems, thereby allowing the usage of linear tools for design studies. In this work, we focus on the identi… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  30. arXiv:2308.13819  [pdf, other

    cs.LG math.DS math.NA

    Guaranteed Stable Quadratic Models and their applications in SINDy and Operator Inference

    Authors: Pawan Goyal, Igor Pontes Duff, Peter Benner

    Abstract: Scientific machine learning for inferring dynamical systems combines data-driven modeling, physics-based modeling, and empirical knowledge. It plays an essential role in engineering design and digital twinning. In this work, we primarily focus on an operator inference methodology that builds dynamical models, preferably in low-dimension, with a prior hypothesis on the model structure, often determ… ▽ More

    Submitted 7 January, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

  31. arXiv:2308.07081  [pdf, other

    cs.CL

    Aesthetics of Sanskrit Poetry from the Perspective of Computational Linguistics: A Case Study Analysis on Siksastaka

    Authors: Jivnesh Sandhan, Amruta Barbadikar, Malay Maity, Pavankumar Satuluri, Tushar Sandhan, Ravi M. Gupta, Pawan Goyal, Laxmidhar Behera

    Abstract: Sanskrit poetry has played a significant role in sha** the literary and cultural landscape of the Indian subcontinent for centuries. However, not much attention has been devoted to uncovering the hidden beauty of Sanskrit poetry in computational linguistics. This article explores the intersection of Sanskrit poetry and computational linguistics by proposing a roadmap of an interpretable framewor… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 15 pages

  32. arXiv:2308.05221  [pdf, other

    cs.HC cs.AI cs.RO

    Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI

    Authors: Hangjie Shi, Leslie Ball, Govind Thattai, Desheng Zhang, Lucy Hu, Qiaozi Gao, Suhaila Shakiah, Xiaofeng Gao, Aishwarya Padmakumar, Bofei Yang, Cadence Chung, Dinakar Guthy, Gaurav Sukhatme, Karthika Arumugam, Matthew Wen, Osman Ipek, Patrick Lange, Rohan Khanna, Shreyas Pansare, Vasu Sharma, Chao Zhang, Cris Flagg, Daniel Pressel, Lavina Vaz, Luke Dai , et al. (17 additional authors not shown)

    Abstract: The Alexa Prize program has empowered numerous university students to explore, experiment, and showcase their talents in building conversational agents through challenges like the SocialBot Grand Challenge and the TaskBot Challenge. As conversational agents increasingly appear in multimodal and embodied contexts, it is important to explore the affordances of conversational interaction augmented wi… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  33. arXiv:2308.04265  [pdf, other

    cs.AI

    FLIRT: Feedback Loop In-context Red Teaming

    Authors: Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

    Abstract: Warning: this paper contains content that may be inappropriate or offensive. As generative models become available for public use in various applications, testing and analyzing vulnerabilities of these models has become a priority. Here we propose an automatic red teaming framework that evaluates a given model and exposes its vulnerabilities against unsafe and inappropriate content generation. O… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  34. arXiv:2308.02562   

    cs.CV cs.AI cs.CY cs.LG

    Food Classification using Joint Representation of Visual and Textual Data

    Authors: Prateek Mittal, Puneet Goyal, Joohi Chauhan

    Abstract: Food classification is an important task in health care. In this work, we propose a multimodal classification framework that uses the modified version of EfficientNet with the Mish activation function for image classification, and the traditional BERT transformer-based network is used for text classification. The proposed network and the other state-of-the-art methods are evaluated on a large open… ▽ More

    Submitted 30 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Updated results and discussions to be posted and some sections needed to be expanded

  35. arXiv:2308.01084  [pdf, other

    cs.LG

    Data-Driven Identification of Quadratic Representations for Nonlinear Hamiltonian Systems using Weakly Symplectic Liftings

    Authors: Süleyman Yildiz, Pawan Goyal, Thomas Bendokat, Peter Benner

    Abstract: We present a framework for learning Hamiltonian systems using data. This work is based on a lifting hypothesis, which posits that nonlinear Hamiltonian systems can be written as nonlinear systems with cubic Hamiltonians. By leveraging this, we obtain quadratic dynamics that are Hamiltonian in a transformed coordinate system. To that end, for given generalized position and momentum data, we propose… ▽ More

    Submitted 8 February, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  36. arXiv:2307.05700  [pdf, other

    cs.CV eess.IV

    SepHRNet: Generating High-Resolution Crop Maps from Remote Sensing imagery using HRNet with Separable Convolution

    Authors: Priyanka Goyal, Sohan Patnaik, Adway Mitra, Manjira Sinha

    Abstract: The accurate map** of crop production is crucial for ensuring food security, effective resource management, and sustainable agricultural practices. One way to achieve this is by analyzing high-resolution satellite imagery. Deep Learning has been successful in analyzing images, including remote sensing imagery. However, capturing intricate crop patterns is challenging due to their complexity and… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  37. arXiv:2307.05390  [pdf, other

    cond-mat.mtrl-sci cs.LG

    CrysMMNet: Multimodal Representation for Crystal Property Prediction

    Authors: Kishalay Das, Pawan Goyal, Seung-Cheol Lee, Satadeep Bhattacharjee, Niloy Ganguly

    Abstract: Machine Learning models have emerged as a powerful tool for fast and accurate prediction of different crystalline properties. Exiting state-of-the-art models rely on a single modality of crystal data i.e. crystal graph structure, where they construct multi-graph by establishing edges between nearby atoms in 3D space and apply GNN to learn materials representation. Thereby, they encode local chemic… ▽ More

    Submitted 9 June, 2023; originally announced July 2023.

    Comments: 14 pages, 4 fifures

  38. arXiv:2306.06190  [pdf, other

    cs.CL cs.LG

    $FastDoc$: Domain-Specific Fast Pre-training Technique using Document-Level Metadata and Taxonomy

    Authors: Abhilash Nandy, Manav Nitin Kapadnis, Sohan Patnaik, Yash Parag Butala, Pawan Goyal, Niloy Ganguly

    Abstract: As the demand for sophisticated Natural Language Processing (NLP) models continues to grow, so does the need for efficient pre-training techniques. Current NLP models undergo resource-intensive pre-training. In response, we introduce $FastDoc$ (Fast Pre-training Technique using Document-Level Metadata and Taxonomy), a novel approach designed to significantly reduce computational demands.… ▽ More

    Submitted 14 November, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 38 pages, 7 figures

    MSC Class: 68T50 ACM Class: I.2.7

  39. arXiv:2306.03736  [pdf, ps, other

    cs.CL

    FinRED: A Dataset for Relation Extraction in Financial Domain

    Authors: Soumya Sharma, Tapas Nayak, Arusarka Bose, Ajay Kumar Meena, Koustuv Dasgupta, Niloy Ganguly, Pawan Goyal

    Abstract: Relation extraction models trained on a source domain cannot be applied on a different target domain due to the mismatch between relation sets. In the current literature, there is no extensive open-source relation extraction dataset specific to the finance domain. In this paper, we release FinRED, a relation extraction dataset curated from financial news and earning call transcripts containing rel… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted at FinWeb at WWW'22

  40. arXiv:2306.03723  [pdf, other

    cs.CL cs.AI cs.CE

    Financial Numeric Extreme Labelling: A Dataset and Benchmarking for XBRL Tagging

    Authors: Soumya Sharma, Subhendu Khatuya, Manjunath Hegde, Afreen Shaikh. Koustuv Dasgupta, Pawan Goyal, Niloy Ganguly

    Abstract: The U.S. Securities and Exchange Commission (SEC) mandates all public companies to file periodic financial statements that should contain numerals annotated with a particular label from a taxonomy. In this paper, we formulate the task of automating the assignment of a label to a particular numeral span in a sentence from an extremely large label set. Towards this task, we release a dataset, Financ… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL'23 Findings Paper

  41. arXiv:2305.14919  [pdf, other

    cs.CL

    Frugal Prompting for Dialog Models

    Authors: Bishal Santra, Sakya Basak, Abhinandan De, Manish Gupta, Pawan Goyal

    Abstract: The use of large language models (LLMs) in natural language processing (NLP) tasks is rapidly increasing, leading to changes in how researchers approach problems in the field. To fully utilize these models' abilities, a better understanding of their behavior for different input protocols is required. With LLMs, users can directly interact with the models through a text-based interface to define an… ▽ More

    Submitted 5 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Preprint, To appear in EMNLP 2023 (Findings); First two authors have equal contribution

  42. arXiv:2305.09941  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    "I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

    Authors: Anaelia Ovalle, Palash Goyal, Jwala Dhamala, Zachary Jaggers, Kai-Wei Chang, Aram Galstyan, Richard Zemel, Rahul Gupta

    Abstract: Transgender and non-binary (TGNB) individuals disproportionately experience discrimination and exclusion from daily life. Given the recent popularity and adoption of language generation technologies, the potential to further marginalize this population only grows. Although a multitude of NLP fairness literature focuses on illuminating and addressing gender biases, assessing gender harms for TGNB i… ▽ More

    Submitted 1 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    ACM Class: I.2; I.7; K.4

    Journal ref: 2023 ACM Conference on Fairness, Accountability, and Transparency

  43. arXiv:2303.16139  [pdf, other

    cs.NI

    DBO: Response Time Fairness for Cloud-Hosted Financial Exchanges

    Authors: Prateesh Goyal, Eashan Gupta, Ilias Marinos, Chenxingyu Zhao, Radhika Mittal, Ranveer Chandra

    Abstract: In this paper, we consider the problem of hosting financial exchanges in the cloud. Financial exchanges require predictable, equal latency to all market participants to ensure fairness for various tasks, such as high speed trading. However, it is extremely difficult to ensure equal latency to all market participants in existing cloud deployments, because of various reasons, such as congestion, and… ▽ More

    Submitted 29 March, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  44. arXiv:2302.09685  [pdf, other

    cs.IR cs.CL

    Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages

    Authors: Ankan Mullick, Ishani Mondal, Sourjyadip Ray, R Raghav, G Sai Chaitanya, Pawan Goyal

    Abstract: Scarcity of data and technological limitations for resource-poor languages in develo** countries like India poses a threat to the development of sophisticated NLU systems for healthcare. To assess the current status of various state-of-the-art language models in healthcare, this paper studies the problem by initially proposing two different Healthcare datasets, Indian Healthcare Query Intent-Web… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Journal ref: EACL 2023 Findings Full Paper

  45. arXiv:2302.09527  [pdf, other

    cs.CL

    SanskritShala: A Neural Sanskrit NLP Toolkit with Web-Based Interface for Pedagogical and Annotation Purposes

    Authors: Jivnesh Sandhan, Anshul Agarwal, Laxmidhar Behera, Tushar Sandhan, Pawan Goyal

    Abstract: We present a neural Sanskrit Natural Language Processing (NLP) toolkit named SanskritShala (a school of Sanskrit) to facilitate computational linguistic analyses for several tasks such as word segmentation, morphological tagging, dependency parsing, and compound type identification. Our systems currently report state-of-the-art performance on available benchmark datasets for all tasks. SanskritSha… ▽ More

    Submitted 29 May, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: 7 pages, Accepted at ACL23 (Demo track) to be held at Toronto, Canada

  46. arXiv:2301.10060  [pdf, other

    cs.LG math.DS

    Inference of Continuous Linear Systems from Data with Guaranteed Stability

    Authors: Pawan Goyal, Igor Pontes Duff, Peter Benner

    Abstract: Machine-learning technologies for learning dynamical systems from data play an important role in engineering design. This research focuses on learning continuous linear models from data. Stability, a key feature of dynamic systems, is especially important in design tasks such as prediction and control. Thus, there is a need to develop methodologies that provide stability guarantees. To that end, w… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

  47. arXiv:2301.09770  [pdf, other

    cs.AI

    Language-guided Task Adaptation for Imitation Learning

    Authors: Prasoon Goyal, Raymond J. Mooney, Scott Niekum

    Abstract: We introduce a novel setting, wherein an agent needs to learn a task from a demonstration of a related task with the difference between the tasks communicated in natural language. The proposed setting allows reusing demonstrations from other tasks, by providing low effort language descriptions, and can also be used to provide feedback to correct agent errors, which are both important desiderata fo… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  48. arXiv:2301.05852  [pdf, other

    cs.LG cond-mat.mtrl-sci

    CrysGNN : Distilling pre-trained knowledge to enhance property prediction for crystalline materials

    Authors: Kishalay Das, Bidisha Samanta, Pawan Goyal, Seung-Cheol Lee, Satadeep Bhattacharjee, Niloy Ganguly

    Abstract: In recent years, graph neural network (GNN) based approaches have emerged as a powerful technique to encode complex topological structure of crystal materials in an enriched representation space. These models are often supervised in nature and using the property-specific training data, learn relationship between crystal structure and different properties like formation energy, bandgap, bulk modulu… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 16 Pages,5 figures

  49. arXiv:2211.12503  [pdf, other

    cs.CL cs.CV cs.LG cs.MM

    Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models

    Authors: Ninareh Mehrabi, Palash Goyal, Apurv Verma, Jwala Dhamala, Varun Kumar, Qian Hu, Kai-Wei Chang, Richard Zemel, Aram Galstyan, Rahul Gupta

    Abstract: Natural language often contains ambiguities that can lead to misinterpretation and miscommunication. While humans can handle ambiguities effectively by asking clarifying questions and/or relying on contextual cues and common-sense knowledge, resolving ambiguities can be notoriously hard for machines. In this work, we study ambiguities that arise in text-to-image generative models. We curate a benc… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  50. arXiv:2211.00357  [pdf, other

    math.DS cs.LG

    Generalized Quadratic Embeddings for Nonlinear Dynamics using Deep Learning

    Authors: Pawan Goyal, Peter Benner

    Abstract: The engineering design process often relies on mathematical modeling that can describe the underlying dynamic behavior. In this work, we present a data-driven methodology for modeling the dynamics of nonlinear systems. To simplify this task, we aim to identify a coordinate transformation that allows us to represent the dynamics of nonlinear systems using a common, simple model structure. The advan… ▽ More

    Submitted 4 January, 2024; v1 submitted 1 November, 2022; originally announced November 2022.