Skip to main content

Showing 1–13 of 13 results for author: Pal, K K

.
  1. arXiv:2307.01420  [pdf, other

    cs.CL

    Modeling Tag Prediction based on Question Tagging Behavior Analysis of CommunityQA Platform Users

    Authors: Kuntal Kumar Pal, Michael Gamon, Nirupama Chandrasekaran, Silviu Cucerzan

    Abstract: In community question-answering platforms, tags play essential roles in effective information organization and retrieval, better question routing, faster response to questions, and assessment of topic popularity. Hence, automatic assistance for predicting and suggesting tags for posts is of high utility to users of such platforms. To develop better tag prediction across diverse communities and dom… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 20 pages

  2. arXiv:2305.16357  [pdf, other

    cs.CL

    EDM3: Event Detection as Multi-task Text Generation

    Authors: Ujjwala Anantheswaran, Himanshu Gupta, Mihir Parmar, Kuntal Kumar Pal, Chitta Baral

    Abstract: Event detection refers to identifying event occurrences in a text and comprises of two subtasks; event identification and classification. We present EDM3, a novel approach for Event Detection that formulates three generative tasks: identification, classification, and combined detection. We show that EDM3 helps to learn transferable knowledge that can be leveraged to perform Event Detection and its… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures, 10 tables, 5 Page appendix

  3. arXiv:2303.05400  [pdf, other

    cs.CL cs.AI cs.CR

    Prompt-Based Learning for Thread Structure Prediction in Cybersecurity Forums

    Authors: Kazuaki Kashihara, Kuntal Kumar Pal, Chitta Baral, Robert P Trevino

    Abstract: With recent trends indicating cyber crimes increasing in both frequency and cost, it is imperative to develop new methods that leverage data-rich hacker forums to assist in combating ever evolving cyber threats. Defining interactions within these forums is critical as it facilitates identifying highly skilled users, which can improve prediction of novel threats and future cyber attacks. We propose… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

    Comments: 16 pages, 7 figures, submitted to IntelliSys 2023

  4. arXiv:2302.10346  [pdf, other

    cs.CL cs.AI cs.CR

    Exploring the Limits of Transfer Learning with Unified Model in the Cybersecurity Domain

    Authors: Kuntal Kumar Pal, Kazuaki Kashihara, Ujjwala Anantheswaran, Kirby C. Kuznia, Siddhesh Jagtap, Chitta Baral

    Abstract: With the increase in cybersecurity vulnerabilities of software systems, the ways to exploit them are also increasing. Besides these, malware threats, irregular network interactions, and discussions about exploits in public forums are also on the rise. To identify these threats faster, to detect potentially relevant entities from any texts, and to be aware of software vulnerabilities, automated app… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 8 pages

  5. arXiv:2210.07471  [pdf, other

    cs.CL

    "John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of Feasibility

    Authors: Himanshu Gupta, Neeraj Varshney, Swaroop Mishra, Kuntal Kumar Pal, Saurabh Arjun Sawant, Kevin Scaria, Siddharth Goyal, Chitta Baral

    Abstract: In current NLP research, large-scale language models and their abilities are widely being discussed. Some recent works have also found notable failures of these models. Often these failure examples involve complex reasoning abilities. This work focuses on a simple commonsense ability, reasoning about when an action (or its effect) is feasible. To this end, we introduce FeasibilityQA, a question-an… ▽ More

    Submitted 2 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: EACL 2023

  6. arXiv:2204.07705  [pdf, other

    cs.CL cs.AI

    Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

    Authors: Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza , et al. (15 additional authors not shown)

    Abstract: How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting,… ▽ More

    Submitted 24 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted to EMNLP 2022, 25 pages

  7. arXiv:2109.04672  [pdf, other

    cs.CL cs.AI cs.LG

    Investigating Numeracy Learning Ability of a Text-to-Text Transfer Model

    Authors: Kuntal Kumar Pal, Chitta Baral

    Abstract: The transformer-based pre-trained language models have been tremendously successful in most of the conventional NLP tasks. But they often struggle in those tasks where numerical understanding is required. Some possible reasons can be the tokenizers and pre-training objectives which are not specifically designed to learn and preserve numeracy. Here we investigate the ability of text-to-text transfe… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: 7 pages, 10 figures, 5 tables, Accepted in the Findings of EMNLP 2021

  8. arXiv:2105.14357  [pdf, other

    cs.CL cs.AI cs.CR

    Constructing Flow Graphs from Procedural Cybersecurity Texts

    Authors: Kuntal Kumar Pal, Kazuaki Kashihara, Pratyay Banerjee, Swaroop Mishra, Ruoyu Wang, Chitta Baral

    Abstract: Following procedural texts written in natural languages is challenging. We must read the whole text to identify the relevant information or identify the instruction flows to complete a task, which is prone to failures. If such texts are structured, we can readily visualize instruction-flows, reason or infer a particular step, or even build automated systems to help novice agents achieve a goal. Ho… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

    Comments: 13 pages, 5 pages, accepted in the Findings of ACL 2021

  9. arXiv:2103.12801  [pdf, other

    cs.LG cs.CL cs.CR

    Variable Name Recovery in Decompiled Binary Code using Constrained Masked Language Modeling

    Authors: Pratyay Banerjee, Kuntal Kumar Pal, Fish Wang, Chitta Baral

    Abstract: Decompilation is the procedure of transforming binary programs into a high-level representation, such as source code, for human analysts to examine. While modern decompilers can reconstruct and recover much information that is discarded during compilation, inferring variable names is still extremely difficult. Inspired by recent advances in natural language processing, we propose a novel solution… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: Work In Progress

  10. arXiv:2003.03446  [pdf, other

    cs.CL cs.AI cs.LG

    Natural Language QA Approaches using Reasoning with External Knowledge

    Authors: Chitta Baral, Pratyay Banerjee, Kuntal Kumar Pal, Arindam Mitra

    Abstract: Question answering (QA) in natural language (NL) has been an important aspect of AI from its early days. Winograd's ``councilmen'' example in his 1972 paper and McCarthy's Mr. Hug example of 1976 highlights the role of external knowledge in NL understanding. While Machine Learning has been the go-to approach in NL processing as well as NL question answering (NLQA) for the last 30 years, recently t… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: 6 pages, 3 figures, Work in Progress

  11. arXiv:1911.03869  [pdf, other

    cs.CL cs.IR cs.LG

    Knowledge Guided Named Entity Recognition for BioMedical Text

    Authors: Pratyay Banerjee, Kuntal Kumar Pal, Murthy Devarakonda, Chitta Baral

    Abstract: In this work, we formulate the NER task as a multi-answer knowledge guided QA task (KGQA) which helps to predict entities only by assigning B, I and O tags without associating entity types with the tags. We provide different knowledge contexts, such as, entity types, questions, definitions and examples along with the text and train on a combined dataset of 18 biomedical corpora. This formulation (… ▽ More

    Submitted 18 September, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: 6 pages, 2 figures, 5 tables, WIP

  12. arXiv:1909.08855  [pdf, other

    cs.CL cs.IR cs.LG

    How Additional Knowledge can Improve Natural Language Commonsense Question Answering?

    Authors: Arindam Mitra, Pratyay Banerjee, Kuntal Kumar Pal, Swaroop Mishra, Chitta Baral

    Abstract: Recently several datasets have been proposed to encourage research in Question Answering domains where commonsense knowledge is expected to play an important role. Recent language models such as ROBERTA, BERT and GPT that have been pre-trained on Wikipedia articles and books have shown reasonable performance with little fine-tuning on several such Multiple Choice Question-Answering (MCQ) datasets.… ▽ More

    Submitted 17 April, 2020; v1 submitted 19 September, 2019; originally announced September 2019.

    Comments: 14 pages, 14 figures, 3 tables

  13. arXiv:1907.10738  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Careful Selection of Knowledge to solve Open Book Question Answering

    Authors: Pratyay Banerjee, Kuntal Kumar Pal, Arindam Mitra, Chitta Baral

    Abstract: Open book question answering is a type of natural language based QA (NLQA) where questions are expected to be answered with respect to a given set of open book facts, and common knowledge about a topic. Recently a challenge involving such QA, OpenBookQA, has been proposed. Unlike most other NLQA tasks that focus on linguistic understanding, OpenBookQA requires deeper reasoning involving linguistic… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: Accepted to ACL 2019