Skip to main content

Showing 1–17 of 17 results for author: Mullick, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03986  [pdf, other

    cs.CL cs.IR

    On The Persona-based Summarization of Domain-Specific Documents

    Authors: Ankan Mullick, Sombit Bose, Rounak Saha, Ayan Kumar Bhowmick, Pawan Goyal, Niloy Ganguly, Prasenjit Dey, Ravi Kokku

    Abstract: In an ever-expanding world of domain-specific knowledge, the increasing complexity of consuming, and storing information necessitates the generation of summaries from large information repositories. However, every persona of a domain has different requirements of information and hence their summarization. For example, in the healthcare domain, a persona-based (such as Doctor, Nurse, Patient etc.)… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Journal ref: ACL 2024 Findings (Association for Computational Linguistics)

  2. arXiv:2405.03513  [pdf, other

    cs.CR cs.CE

    QBER: Quantifying Cyber Risks for Strategic Decisions

    Authors: Muriel Figueredo Franco, Aiatur Rahaman Mullick, Santosh Jha

    Abstract: Quantifying cyber risks is essential for organizations to grasp their vulnerability to threats and make informed decisions. However, current approaches still need to work on blending economic viewpoints to provide insightful analysis. To bridge this gap, we introduce QBER approach to offer decision-makers measurable risk metrics. The QBER evaluates losses from cyberattacks, performs detailed risk… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 10 pages, 9 equations, 3 tables, 2 figures

  3. arXiv:2404.03598  [pdf, other

    cs.CL

    Intent Detection and Entity Extraction from BioMedical Literature

    Authors: Ankan Mullick, Mukur Gupta, Pawan Goyal

    Abstract: Biomedical queries have become increasingly prevalent in web searches, reflecting the growing interest in accessing biomedical literature. Despite recent research on large-language models (LLMs) motivated by endeavours to attain generalized intelligence, their efficacy in replacing task and domain-specific natural language understanding approaches remains questionable. In this paper, we address th… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted to CL4Health LREC-COLING 2024

  4. arXiv:2402.16986  [pdf, other

    cs.CL cs.IR

    Long Dialog Summarization: An Analysis

    Authors: Ankan Mullick, Ayan Kumar Bhowmick, Raghav R, Ravi Kokku, Prasenjit Dey, Pawan Goyal, Niloy Ganguly

    Abstract: Dialog summarization has become increasingly important in managing and comprehending large-scale conversations across various domains. This task presents unique challenges in capturing the key points, context, and nuances of multi-turn long conversations for summarization. It is worth noting that the summarization techniques may vary based on specific requirements such as in a shop**-chatbot sce… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  5. MatSciRE: Leveraging Pointer Networks to Automate Entity and Relation Extraction for Material Science Knowledge-base Construction

    Authors: Ankan Mullick, Akash Ghosh, G Sai Chaitanya, Samir Ghui, Tapas Nayak, Seung-Cheol Lee, Satadeep Bhattacharjee, Pawan Goyal

    Abstract: Material science literature is a rich source of factual information about various categories of entities (like materials and compositions) and various relations between these entities, such as conductivity, voltage, etc. Automatically extracting this information to generate a material science knowledge base is a challenging task. In this paper, we propose MatSciRE (Material Science Relation Extrac… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Journal ref: Computational Material Science 2023 (Elsevier)

  6. arXiv:2304.11058  [pdf, ps, other

    cs.CL cs.IR

    Novel Intent Detection and Active Learning Based Classification (Student Abstract)

    Authors: Ankan Mullick

    Abstract: Novel intent class detection is an important problem in real world scenario for conversational agents for continuous interaction. Several research works have been done to detect novel intents in a mono-lingual (primarily English) texts and images. But, current systems lack an end-to-end universal framework to detect novel intents across various different languages with less human annotation effort… ▽ More

    Submitted 22 February, 2023; originally announced April 2023.

    Comments: AAAI 2023 Student Abstract

  7. arXiv:2302.09685  [pdf, other

    cs.IR cs.CL

    Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages

    Authors: Ankan Mullick, Ishani Mondal, Sourjyadip Ray, R Raghav, G Sai Chaitanya, Pawan Goyal

    Abstract: Scarcity of data and technological limitations for resource-poor languages in develo** countries like India poses a threat to the development of sophisticated NLU systems for healthcare. To assess the current status of various state-of-the-art language models in healthcare, this paper studies the problem by initially proposing two different Healthcare datasets, Indian Healthcare Query Intent-Web… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Journal ref: EACL 2023 Findings Full Paper

  8. arXiv:2209.02881  [pdf, other

    eess.IV cs.LG

    Improving Self-supervised Learning for Out-of-distribution Task via Auxiliary Classifier

    Authors: Harshita Boonlia, Tanmoy Dam, Md Meftahul Ferdaus, Sreenatha G. Anavatti, Ankan Mullick

    Abstract: In real world scenarios, out-of-distribution (OOD) datasets may have a large distributional shift from training datasets. This phenomena generally occurs when a trained classifier is deployed on varying dynamic environments, which causes a significant drop in performance. To tackle this issue, we are proposing an end-to-end deep multi-task network in this work. Observing a strong relationship betw… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: The shorter version is accepted at the 29th IEEE International Conference on Image Processing (IEEE ICIP 2022)

  9. arXiv:2205.08478  [pdf, other

    cs.CL cs.IR cs.LG

    An Evaluation Framework for Legal Document Summarization

    Authors: Ankan Mullick, Abhilash Nandy, Manav Nitin Kapadnis, Sohan Patnaik, R Raghav, Roshni Kar

    Abstract: A law practitioner has to go through numerous lengthy legal case proceedings for their practices of various categories, such as land dispute, corruption, etc. Hence, it is important to summarize these documents, and ensure that summaries contain phrases with intent matching the category of the case. To the best of our knowledge, there is no evaluation metric that evaluates a summary based on its i… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 7 pages, 7 figures, 5 tables, To appear in LREC 2022

  10. arXiv:2205.03509  [pdf, other

    cs.CL cs.IR cs.LG

    Fine-grained Intent Classification in the Legal Domain

    Authors: Ankan Mullick, Abhilash Nandy, Manav Nitin Kapadnis, Sohan Patnaik, R Raghav

    Abstract: A law practitioner has to go through a lot of long legal case proceedings. To understand the motivation behind the actions of different parties/individuals in a legal case, it is essential that the parts of the document that express an intent corresponding to the case be clearly understood. In this paper, we introduce a dataset of 93 legal documents, belonging to the case categories of either Murd… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: 4 pages, 7 tables, 1 figure, appeared in the AAAI-22 workshop on Scientific Document Understanding

  11. arXiv:2205.02005  [pdf, other

    cs.CL cs.AI

    A Framework to Generate High-Quality Datapoints for Multiple Novel Intent Detection

    Authors: Ankan Mullick, Sukannya Purkayastha, Pawan Goyal, Niloy Ganguly

    Abstract: Systems like Voice-command based conversational agents are characterized by a pre-defined set of skills or intents to perform user specified tasks. In the course of time, newer intents may emerge requiring retraining. However, the newer intents may not be explicitly announced and need to be inferred dynamically. Thus, there are two important tasks at hand (a). identifying emerging new intents, (b)… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted as Full Paper at Findings of NAACL, 2022

  12. arXiv:2108.08184  [pdf, other

    cs.CL

    RTE: A Tool for Annotating Relation Triplets from Text

    Authors: Ankan Mullick, Animesh Bera, Tapas Nayak

    Abstract: In this work, we present a Web-based annotation tool `Relation Triplets Extractor' \footnote{https://abera87.github.io/annotate/} (RTE) for annotating relation triplets from the text. Relation extraction is an important task for extracting structured information about real-world entities from the unstructured text available on the Web. In relation extraction, we focus on binary relation that refer… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

  13. arXiv:2105.11412  [pdf, other

    cs.CL cs.AI

    Reproducibility Report: Contextualizing Hate Speech Classifiers with Post-hoc Explanation

    Authors: Kiran Purohit, Owais Iqbal, Ankan Mullick

    Abstract: The presented report evaluates Contextualizing Hate Speech Classifiers with Post-hoc Explanation paper within the scope of ML Reproducibility Challenge 2020. Our work focuses on both aspects constituting the paper: the method itself and the validity of the stated results. In the following sections, we have described the paper, related works, algorithmic frameworks, our experiments and evaluations.

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: 10 pages

  14. arXiv:2009.06819  [pdf, other

    cs.CL

    MatScIE: An automated tool for the generation of databases of methods and parameters used in the computational materials science literature

    Authors: Souradip Guha, Ankan Mullick, Jatin Agrawal, Swetarekha Ram, Samir Ghui, Seung-Cheol Lee, Satadeep Bhattacharjee, Pawan Goyal

    Abstract: The number of published articles in the field of materials science is growing rapidly every year. This comparatively unstructured data source, which contains a large amount of information, has a restriction on its re-usability, as the information needed to carry out further calculations using the data in it must be extracted manually. It is very important to obtain valid and contextually correct i… ▽ More

    Submitted 22 January, 2021; v1 submitted 14 September, 2020; originally announced September 2020.

    Comments: 13 pages, 8 figures, Accepted for publication in Computational Material Science

    Journal ref: Computational Material Science, 2021

  15. arXiv:1902.07946  [pdf, other

    cs.IR

    Public Sphere 2.0: Targeted Commenting in Online News Media

    Authors: Ankan Mullick, Sayan Ghosh, Ritam Dutt, Avijit Ghosh, Abhijnan Chakraborty

    Abstract: With the increase in online news consumption, to maximize advertisement revenue, news media websites try to attract and retain their readers on their sites. One of the most effective tools for reader engagement is commenting, where news readers post their views as comments against the news articles. Traditionally, it has been assumed that the comments are mostly made against the full article. In t… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

    Comments: Accepted at ECIR 2019

  16. arXiv:1805.10774  [pdf, other

    cs.CY

    Understanding Psycholinguistic Behavior of predominant drunk texters in Social Media

    Authors: Suman Kalyan Maity, Ankan Mullick, Surjya Ghosh, Anil Kumar, Sunny Dhamnani, Sudhanshu Bahety, Animesh Mukherjee

    Abstract: In the last decade, social media has evolved as one of the leading platform to create, share, or exchange information; it is commonly used as a way for individuals to maintain social connections. In this online digital world, people use to post texts or pictures to express their views socially and create user-user engagement through discussions and conversations. Thus, social media has established… ▽ More

    Submitted 28 May, 2018; originally announced May 2018.

    Comments: 6 pages, 8 Figures, ISCC 2018 Workshops - ICTS4eHealth 2018

  17. Understanding Book Popularity on Goodreads

    Authors: Suman Kalyan Maity, Ayush Kumar, Ankan Mullick, Vishnu Choudhary, Animesh Mukherjee

    Abstract: Goodreads has launched the Readers Choice Awards since 2009 where users are able to nominate/vote books of their choice, released in the given year. In this work, we question if the number of votes that a book would receive (aka the popularity of the book) can be predicted based on the characteristics of various entities on Goodreads. We are successful in predicting the popularity of the books wit… ▽ More

    Submitted 14 February, 2018; originally announced February 2018.

    Comments: 5 pages, 4 Tables, GROUP '18