Skip to main content

Showing 1–8 of 8 results for author: Gajbhiye, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.14793  [pdf, other

    cs.CL cs.AI cs.LG

    What do Deck Chairs and Sun Hats Have in Common? Uncovering Shared Properties in Large Concept Vocabularies

    Authors: Amit Gajbhiye, Zied Bouraoui, Na Li, Usashi Chatterjee, Luis Espinosa Anke, Steven Schockaert

    Abstract: Concepts play a central role in many applications. This includes settings where concepts have to be modelled in the absence of sentence context. Previous work has therefore focused on distilling decontextualised concept embeddings from language models. But concepts can be modelled from different perspectives, whereas concept embeddings typically mostly capture taxonomic structure. To address this… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted for EMNLP 2023

  2. arXiv:2310.05481  [pdf, other

    cs.CL cs.AI

    Cabbage Sweeter than Cake? Analysing the Potential of Large Language Models for Learning Conceptual Spaces

    Authors: Usashi Chatterjee, Amit Gajbhiye, Steven Schockaert

    Abstract: The theory of Conceptual Spaces is an influential cognitive-linguistic framework for representing the meaning of concepts. Conceptual spaces are constructed from a set of quality dimensions, which essentially correspond to primitive perceptual features (e.g. hue or size). These quality dimensions are usually learned from human judgements, which means that applications of conceptual spaces tend to… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted for EMNLP 2023

  3. arXiv:2210.02771  [pdf, other

    cs.CL cs.AI cs.LG

    Modelling Commonsense Properties using Pre-Trained Bi-Encoders

    Authors: Amit Gajbhiye, Luis Espinosa-Anke, Steven Schockaert

    Abstract: Gras** the commonsense properties of everyday concepts is an important prerequisite to language understanding. While contextualised language models are reportedly capable of predicting such commonsense properties with human-level accuracy, we argue that such results have been inflated because of the high similarity between training and test concepts. This means that models which capture concept… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: COLING 2022

  4. arXiv:2108.01589  [pdf, other

    cs.CL

    ExBERT: An External Knowledge Enhanced BERT for Natural Language Inference

    Authors: Amit Gajbhiye, Noura Al Moubayed, Steven Bradley

    Abstract: Neural language representation models such as BERT, pre-trained on large-scale unstructured corpora lack explicit grounding to real-world commonsense knowledge and are often unable to remember facts required for reasoning and inference. Natural Language Inference (NLI) is a challenging reasoning task that relies on common human understanding of language and real-world commonsense knowledge. We int… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  5. arXiv:2107.00411  [pdf, other

    cs.CL

    Knowledge Distillation for Quality Estimation

    Authors: Amit Gajbhiye, Marina Fomicheva, Fernando Alva-Manchego, Frédéric Blain, Abiola Obamuyide, Nikolaos Aletras, Lucia Specia

    Abstract: Quality Estimation (QE) is the task of automatically predicting Machine Translation quality in the absence of reference translations, making it applicable in real-time settings, such as translating online social media conversations. Recent success in QE stems from the use of multilingual pre-trained representations, where very large models lead to impressive results. However, the inference time, d… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: ACL Findings 2021

  6. Bilinear Fusion of Commonsense Knowledge with Attention-Based NLI Models

    Authors: Amit Gajbhiye, Thomas Winterbottom, Noura Al Moubayed, Steven Bradley

    Abstract: We consider the task of incorporating real-world commonsense knowledge into deep Natural Language Inference (NLI) models. Existing external knowledge incorporation methods are limited to lexical level knowledge and lack generalization across NLI models, datasets, and commonsense knowledge sources. To address these issues, we propose a novel NLI model-independent neural framework, BiCAM. BiCAM inco… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Published in Lecture Notes in Computer Science, Springer International Publishing

  7. An Exploration of Dropout with RNNs for Natural Language Inference

    Authors: Amit Gajbhiye, Sardar Jaf, Noura Al Moubayed, A. Stephen McGough, Steven Bradley

    Abstract: Dropout is a crucial regularization technique for the Recurrent Neural Network (RNN) models of Natural Language Inference (NLI). However, dropout has not been evaluated for the effectiveness at different layers and dropout rates in NLI models. In this paper, we propose a novel RNN model for NLI and empirically evaluate the effect of applying dropout at different layers in the model. We also invest… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: Accepted in International Conference on Artificial Neural Networks, 2018

  8. arXiv:1806.02397  [pdf

    cs.DC

    Resource Provisioning and Scheduling Algorithm for Meeting Cost and Deadline-Constraints of Scientific Workflows in IaaS Clouds

    Authors: Amit Gajbhiye, Shailendra Singh

    Abstract: Infrastructure as a Service model of cloud computing is a desirable platform for the execution of cost and deadline constrained workflow applications as the elasticity of cloud computing allows large-scale complex scientific workflow applications to scale dynamically according to their deadline requirements. However, scheduling of these multitask workflow jobs in a distributed computing environmen… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

    Comments: 15 pages, 8 figures, This work is done in the year 2015 when the first author was part of NITTTR, Bhopal, India