Skip to main content

Showing 1–11 of 11 results for author: Chowdhury, M F M

.
  1. arXiv:2210.13952  [pdf, other

    cs.CL cs.AI cs.IR

    KnowGL: Knowledge Generation and Linking from Text

    Authors: Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Nandana Mihindukulasooriya, Owen Cornec, Alfio Massimiliano Gliozzo

    Abstract: We propose KnowGL, a tool that allows converting text into structured relational data represented as a set of ABox assertions compliant with the TBox of a given Knowledge Graph (KG), such as Wikidata. We address this problem as a sequence generation task by leveraging pre-trained sequence-to-sequence language models, e.g. BART. Given a sentence, we fine-tune such models to detect pairs of entity m… ▽ More

    Submitted 22 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: AAAI-23 Demo Track

  2. arXiv:2207.06300  [pdf, other

    cs.CL cs.AI cs.IR

    Re2G: Retrieve, Rerank, Generate

    Authors: Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Ankita Rajaram Naik, Pengshan Cai, Alfio Gliozzo

    Abstract: As demonstrated by GPT-3 and T5, transformers grow in capability as parameter spaces become larger and larger. However, for tasks that require a large amount of knowledge, non-parametric memory allows models to grow dramatically with a sub-linear increase in computational cost and GPU memory requirements. Recent models such as RAG and REALM have introduced retrieval into conditional generation. Th… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at NAACL 2022

  3. arXiv:2207.05188  [pdf, other

    cs.AI cs.CL cs.IR

    Knowledge Graph Induction enabling Recommending and Trend Analysis: A Corporate Research Community Use Case

    Authors: Nandana Mihindukulasooriya, Mike Sava, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Irene Yachbes, Aditya Gidh, Jillian Duckwitz, Kovit Nisar, Michael Santos, Alfio Gliozzo

    Abstract: A research division plays an important role of driving innovation in an organization. Drawing insights, following trends, kee** abreast of new research, and formulating strategies are increasingly becoming more challenging for both researchers and executives as the amount of information grows in both velocity and volume. In this paper we present a use case of how a corporate research community,… ▽ More

    Submitted 15 September, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted at ISWC 2022

    MSC Class: 68T01; 68T30 ACM Class: I.2.7; I.2.4; H.5

  4. arXiv:2204.03985  [pdf, other

    cs.CL cs.AI cs.LG

    KGI: An Integrated Framework for Knowledge Intensive Language Tasks

    Authors: Md Faisal Mahbub Chowdhury, Michael Glass, Gaetano Rossiello, Alfio Gliozzo, Nandana Mihindukulasooriya

    Abstract: In this paper, we present a system to showcase the capabilities of the latest state-of-the-art retrieval augmented generation models trained on knowledge-intensive language tasks, such as slot filling, open domain question answering, dialogue, and fact-checking. Moreover, given a user query, we show how the output from these different models can be combined to cross-examine the outputs of each oth… ▽ More

    Submitted 21 September, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: EMNLP 2022 Demo Track

  5. arXiv:2201.05302  [pdf, other

    cs.CL cs.AI

    Applying a Generic Sequence-to-Sequence Model for Simple and Effective Keyphrase Generation

    Authors: Md Faisal Mahbub Chowdhury, Gaetano Rossiello, Michael Glass, Nandana Mihindukulasooriya, Alfio Gliozzo

    Abstract: In recent years, a number of keyphrase generation (KPG) approaches were proposed consisting of complex model architectures, dedicated training paradigms and decoding strategies. In this work, we opt for simplicity and show how a commonly used seq2seq language model, BART, can be easily adapted to generate keyphrases from the text in a single batch computation using a simple training procedure. Emp… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  6. arXiv:2108.13934  [pdf, other

    cs.CL cs.AI cs.IR

    Robust Retrieval Augmented Generation for Zero-shot Slot Filling

    Authors: Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Alfio Gliozzo

    Abstract: Automatically inducing high quality knowledge graphs from a given collection of documents still remains a challenging problem in AI. One way to make headway for this problem is through advancements in a related task known as slot filling. In this task, given an entity query in form of [Entity, Slot, ?], a system is asked to fill the slot by generating or extracting the missing value exploiting evi… ▽ More

    Submitted 13 September, 2021; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted at EMNLP 2021. arXiv admin note: substantial text overlap with arXiv:2104.08610

  7. arXiv:2011.03722  [pdf, other

    cs.AI cs.CL cs.LG cs.NE

    Template Controllable keywords-to-text Generation

    Authors: Abhijit Mishra, Md Faisal Mahbub Chowdhury, Sagar Manohar, Dan Gutfreund, Karthik Sankaranarayanan

    Abstract: This paper proposes a novel neural model for the understudied task of generating text from keywords. The model takes as input a set of un-ordered keywords, and part-of-speech (POS) based template instructions. This makes it ideal for surface realization in any NLG setup. The framework is based on the encode-attend-decode paradigm, where keywords and templates are encoded first, and the decoder jud… ▽ More

    Submitted 7 November, 2020; originally announced November 2020.

  8. arXiv:1909.10572  [pdf, other

    cs.AI cs.CL cs.LG

    Hypernym Detection Using Strict Partial Order Networks

    Authors: Sarthak Dash, Md Faisal Mahbub Chowdhury, Alfio Gliozzo, Nandana Mihindukulasooriya, Nicolas Rodolfo Fauceglia

    Abstract: This paper introduces Strict Partial Order Networks (SPON), a novel neural network architecture designed to enforce asymmetry and transitive properties as soft constraints. We apply it to induce hypernymy relations by training with is-a pairs. We also present an augmented variant of SPON that can generalize type information learned for in-vocabulary terms to previously unseen ones. An extensive ev… ▽ More

    Submitted 22 November, 2019; v1 submitted 23 September, 2019; originally announced September 2019.

    Comments: 8 pages

    Journal ref: AAAI 2020

  9. arXiv:1905.09761  [pdf, other

    cs.DS cs.CL cs.IR

    An Efficient Approach for Super and Nested Term Indexing and Retrieval

    Authors: Md Faisal Mahbub Chowdhury, Robert Farrell

    Abstract: This paper describes a new approach, called Terminological Bucket Indexing (TBI), for efficient indexing and retrieval of both nested and super terms using a single method. We propose a hybrid data structure for facilitating faster indexing building. An evaluation of our approach with respect to widely used existing approaches on several publicly available dataset is provided. Compared to Trie bas… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

    Comments: 6 pages including references

  10. arXiv:1804.08057  [pdf, ps, other

    cs.CL cs.IR

    A Study on Passage Re-ranking in Embedding based Unsupervised Semantic Search

    Authors: Md Faisal Mahbub Chowdhury, Vijil Chenthamarakshan, Rishav Chakravarti, Alfio M. Gliozzo

    Abstract: State of the art approaches for (embedding based) unsupervised semantic search exploits either compositional similarity (of a query and a passage) or pair-wise word (or term) similarity (from the query and the passage). By design, word based approaches do not incorporate similarity in the larger context (query/passage), while compositional similarity based approaches are usually unable to take adv… ▽ More

    Submitted 13 March, 2019; v1 submitted 21 April, 2018; originally announced April 2018.

    Comments: Fixed latex compiling issues

  11. arXiv:1709.08074  [pdf, other

    cs.CL

    Language Independent Acquisition of Abbreviations

    Authors: Michael R. Glass, Md Faisal Mahbub Chowdhury, Alfio M. Gliozzo

    Abstract: This paper addresses automatic extraction of abbreviations (encompassing acronyms and initialisms) and corresponding long-form expansions from plain unstructured text. We create and are going to release a multilingual resource for abbreviations and their corresponding expansions, built automatically by exploiting Wikipedia redirect and disambiguation pages, that can be used as a benchmark for eval… ▽ More

    Submitted 23 September, 2017; originally announced September 2017.

    Comments: 9 pages, 7 figues, 2 tables