Skip to main content

Showing 1–12 of 12 results for author: Kabra, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.09553  [pdf, other

    cs.AI

    Program-Aided Reasoners (better) Know What They Know

    Authors: Anubha Kabra, Sanketh Rangreji, Yash Mathur, Aman Madaan, Emmy Liu, Graham Neubig

    Abstract: Prior work shows that program-aided reasoning, in which large language models (LLMs) are combined with programs written in programming languages such as Python, can significantly improve accuracy on various reasoning tasks. However, while accuracy is essential, it is also important for such reasoners to "know what they know", which can be quantified through the calibration of the model. In this pa… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  2. arXiv:2311.09538  [pdf, other

    cs.CL cs.HC

    Reducing Privacy Risks in Online Self-Disclosures with Language Models

    Authors: Yao Dou, Isadora Krsek, Tarek Naous, Anubha Kabra, Sauvik Das, Alan Ritter, Wei Xu

    Abstract: Self-disclosure, while being common and rewarding in social media interaction, also poses privacy risks. In this paper, we take the initiative to protect the user-side privacy associated with online self-disclosure through detection and abstraction. We develop a taxonomy of 19 self-disclosure categories and curate a large corpus consisting of 4.8K annotated disclosure spans. We then fine-tune a la… ▽ More

    Submitted 23 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted at ACL 2024

  3. CESAR: Control Envelope Synthesis via Angelic Refinements

    Authors: Aditi Kabra, Jonathan Laurent, Stefan Mitsch, André Platzer

    Abstract: This paper presents an approach for synthesizing provably correct control envelopes for hybrid systems. Control envelopes characterize families of safe controllers and are used to monitor untrusted controllers at runtime. Our algorithm fills in the blanks of a hybrid system's sketch specifying the desired shape of the control envelope, the possible control actions, and the system's differential eq… ▽ More

    Submitted 4 April, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

  4. arXiv:2310.15113  [pdf

    cs.CL

    Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model

    Authors: Leonie Weissweiler, Valentin Hofmann, Anjali Kantharuban, Anna Cai, Ritam Dutt, Amey Hengle, Anubha Kabra, Atharva Kulkarni, Abhishek Vijayakumar, Haofei Yu, Hinrich Schütze, Kemal Oflazer, David R. Mortensen

    Abstract: Large language models (LLMs) have recently reached an impressive level of linguistic capability, prompting comparisons with human language skills. However, there have been relatively few systematic inquiries into the linguistic capabilities of the latest generation of LLMs, and those studies that do exist (i) ignore the remarkable ability of humans to generalize, (ii) focus only on English, and (i… ▽ More

    Submitted 26 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  5. arXiv:2305.16171  [pdf

    cs.CL

    Multi-lingual and Multi-cultural Figurative Language Understanding

    Authors: Anubha Kabra, Emmy Liu, Simran Khanuja, Alham Fikri Aji, Genta Indra Winata, Samuel Cahyawijaya, Anuoluwapo Aremu, Perez Ogayo, Graham Neubig

    Abstract: Figurative language permeates human communication, but at the same time is relatively understudied in NLP. Datasets have been created in English to accelerate progress towards measuring and improving figurative language processing in language models (LMs). However, the use of figurative language is an expression of our cultural and societal experiences, making it difficult for these phrases to be… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: ACL 2023 Findings

  6. arXiv:2305.14208  [pdf, other

    cs.CL cs.LG

    Domain Private Transformers for Multi-Domain Dialog Systems

    Authors: Anmol Kabra, Ethan R. Elenberg

    Abstract: Large, general purpose language models have demonstrated impressive performance across many different conversational domains. While multi-domain language models achieve low overall perplexity, their outputs are not guaranteed to stay within the domain of a given input prompt. This paper proposes domain privacy as a novel way to quantify how likely a conditional language model will leak across doma… ▽ More

    Submitted 7 December, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of EMNLP 2023 (short paper). Code available at https://github.com/asappresearch/domain-private-transformers

  7. arXiv:2112.14195  [pdf, other

    cs.LG stat.ML

    Exponential Family Model-Based Reinforcement Learning via Score Matching

    Authors: Gene Li, Junbo Li, Anmol Kabra, Nathan Srebro, Zhaoran Wang, Zhuoran Yang

    Abstract: We propose an optimistic model-based algorithm, dubbed SMRL, for finite-horizon episodic reinforcement learning (RL) when the transition model is specified by exponential family distributions with $d$ parameters and the reward is bounded and known. SMRL uses score matching, an unnormalized density estimation technique that enables efficient estimation of the model parameter by ridge regression. Un… ▽ More

    Submitted 8 January, 2023; v1 submitted 28 December, 2021; originally announced December 2021.

    Comments: NeurIPS 2022

  8. arXiv:2110.09393  [pdf

    cs.CL cs.AI

    Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language

    Authors: Arushi Sharma, Anubha Kabra, Minni Jain

    Abstract: Social media has become a bedrock for people to voice their opinions worldwide. Due to the greater sense of freedom with the anonymity feature, it is possible to disregard social etiquette online and attack others without facing severe consequences, inevitably propagating hate speech. The current measures to sift the online content and offset the hatred spread do not go far enough. One factor cont… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: Accepted in Elsevier Journal of Information Processing and Management. Sharma and Kabra made equal contribution

  9. arXiv:2012.02291  [pdf, other

    cs.IR cs.AI cs.LG

    Cluster Based Deep Contextual Reinforcement Learning for top-k Recommendations

    Authors: Anubha Kabra, Anu Agarwal, Anil Singh Parihar

    Abstract: Rapid advancements in the E-commerce sector over the last few decades have led to an imminent need for personalised, efficient and dynamic recommendation systems. To sufficiently cater to this need, we propose a novel method for generating top-k recommendations by creating an ensemble of clustering with reinforcement learning. We have incorporated DB Scan clustering to tackle vast item space, henc… ▽ More

    Submitted 29 November, 2020; originally announced December 2020.

    Comments: To be published in : Springer Lecture Notes in Networks and Systems ISSN 2367-3370

  10. arXiv:2009.01571  [pdf, other

    cs.LG stat.ML

    MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

    Authors: Anubha Kabra, Ayush Chopra, Nikaash Puri, Pinkesh Badjatiya, Sukriti Verma, Piyush Gupta, Balaji K

    Abstract: Training a classification model on a dataset where the instances of one class outnumber those of the other class is a challenging problem. Such imbalanced datasets are standard in real-world situations such as fraud detection, medical diagnosis, and computational advertising. We propose an iterative data augmentation method, MixBoost, which intelligently selects (Boost) and then combines (Mix) ins… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: Work done as part of internship at MDSR

  11. arXiv:2008.06890  [pdf, other

    cs.NI cs.CR

    Efficient, Flexible and Secure Group Key Management Protocol for Dynamic IoT Settings

    Authors: Adhirath Kabra, Sumit Kumar, Gaurav S. Kasbekar

    Abstract: Many Internet of Things (IoT) scenarios require communication to and data acquisition from multiple devices with similar functionalities. For such scenarios, group communication in the form of multicasting and broadcasting has proven to be effective. Group Key Management (GKM) involves the handling, revocation, updation and distribution of cryptographic keys to members of various groups. Classical… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

  12. arXiv:2007.06796  [pdf, other

    cs.CL cs.AI

    Evaluation Toolkit For Robustness Testing Of Automatic Essay Scoring Systems

    Authors: Anubha Kabra, Mehar Bhatia, Yaman Kumar, Junyi Jessy Li, Rajiv Ratn Shah

    Abstract: Automatic scoring engines have been used for scoring approximately fifteen million test-takers in just the last three years. This number is increasing further due to COVID-19 and the associated automation of education and testing. Despite such wide usage, the AI-based testing literature of these "intelligent" models is highly lacking. Most of the papers proposing new models rely only on quadratic… ▽ More

    Submitted 14 November, 2021; v1 submitted 13 July, 2020; originally announced July 2020.