Skip to main content

Showing 1–3 of 3 results for author: Kannen, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.15577  [pdf, other

    cs.CL cs.AI

    CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet Extraction

    Authors: Rajdeep Mukherjee, Nithish Kannen, Saurabh Kumar Pandey, Pawan Goyal

    Abstract: Existing works on Aspect Sentiment Triplet Extraction (ASTE) explicitly focus on develo** more efficient fine-tuning techniques for the task. Instead, our motivation is to come up with a generic approach that can improve the downstream performances of multiple ABSA tasks simultaneously. Towards this, we present CONTRASTE, a novel pre-training strategy using CONTRastive learning to enhance the AS… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted as a Long Paper at EMNLP 2023 (Findings); 16 pages; Codes: https://github.com/nitkannen/CONTRASTE/

    ACM Class: I.2.7

  2. arXiv:2203.11054  [pdf, other

    cs.CL cs.AI

    Targeted Extraction of Temporal Facts from Textual Resources for Improved Temporal Question Answering over Knowledge Bases

    Authors: Nithish Kannen, Udit Sharma, Sumit Neelam, Dinesh Khandelwal, Shajith Ikbal, Hima Karanam, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) systems have the goal of answering complex natural language questions by reasoning over relevant facts retrieved from Knowledge Bases (KB). One of the major challenges faced by these systems is their inability to retrieve all relevant facts due to factors such as incomplete KB and entity/relation linking errors. In this paper, we address this particular cha… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    ACM Class: I.2.7; I.2.4

  3. arXiv:2112.13237  [pdf, other

    cs.CL cs.AI cs.IR

    CABACE: Injecting Character Sequence Information and Domain Knowledge for Enhanced Acronym and Long-Form Extraction

    Authors: Nithish Kannen, Divyanshu Sheth, Abhranil Chandra, Shubhraneel Pal

    Abstract: Acronyms and long-forms are commonly found in research documents, more so in documents from scientific and legal domains. Many acronyms used in such documents are domain-specific and are very rarely found in normal text corpora. Owing to this, transformer-based NLP models often detect OOV (Out of Vocabulary) for acronym tokens, especially for non-English languages, and their performance suffers wh… ▽ More

    Submitted 25 December, 2021; originally announced December 2021.