Skip to main content

Showing 1–24 of 24 results for author: Aralikatte, R

.
  1. arXiv:2403.12596  [pdf, other

    cs.CL

    Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs

    Authors: Victor Carbune, Hassan Mansoor, Fangyu Liu, Rahul Aralikatte, Gilles Baechler, **dong Chen, Abhanshu Sharma

    Abstract: Vision-language models (VLMs) are achieving increasingly strong performance on multimodal tasks. However, reasoning capabilities remain limited particularly for smaller VLMs, while those of large-language models (LLMs) have seen numerous improvements. We propose a technique to transfer capabilities from LLMs to VLMs. On the recently introduced ChartQA, our method obtains state-of-the-art performan… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Findings of NAACL 2024

  2. arXiv:2310.08394  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Better Evaluation of Instruction-Following: A Case-Study in Summarization

    Authors: Ondrej Skopek, Rahul Aralikatte, Sian Gooding, Victor Carbune

    Abstract: Despite recent advances, evaluating how well large language models (LLMs) follow user instructions remains an open problem. While evaluation methods of language models have seen a rise in prompt-based approaches, limited work on the correctness of these methods has been conducted. In this work, we perform a meta-evaluation of a variety of metrics to quantify how accurately they measure the instruc… ▽ More

    Submitted 20 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: CoNLL 2023 camera-ready version

  3. arXiv:2305.05858  [pdf, other

    cs.CL

    Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages

    Authors: Rahul Aralikatte, Ziling Cheng, Sumanth Doddapaneni, Jackie Chi Kit Cheung

    Abstract: We present Vārta, a large-scale multilingual dataset for headline generation in Indic languages. This dataset includes 41.8 million news articles in 14 different Indic languages (and English), which come from a variety of high-quality sources. To the best of our knowledge, this is the largest collection of curated articles for Indic languages currently available. We use the data collected in a ser… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  4. arXiv:2212.05409  [pdf, other

    cs.CL

    Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages

    Authors: Sumanth Doddapaneni, Rahul Aralikatte, Gowtham Ramesh, Shreya Goyal, Mitesh M. Khapra, Anoop Kunchukuttan, Pratyush Kumar

    Abstract: Building Natural Language Understanding (NLU) capabilities for Indic languages, which have a collective speaker base of more than one billion speakers is absolutely crucial. In this work, we aim to improve the NLU capabilities of Indic languages by making contributions along 3 important axes (i) monolingual corpora (ii) NLU testsets (iii) multilingual LLMs focusing on Indic languages. Specifically… ▽ More

    Submitted 24 May, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  5. arXiv:2108.03509  [pdf

    cs.CL

    Compositional Generalization in Multilingual Semantic Parsing over Wikidata

    Authors: Ruixiang Cui, Rahul Aralikatte, Heather Lent, Daniel Hershcovich

    Abstract: Semantic parsing (SP) allows humans to leverage vast knowledge resources through natural interaction. However, parsers are mostly designed for and evaluated on English resources, such as CFQ (Keysers et al., 2020), the current standard benchmark based on English data generated from grammar rules and oriented towards Freebase, an outdated knowledge base. We propose a method for creating a multiling… ▽ More

    Submitted 31 May, 2022; v1 submitted 7 August, 2021; originally announced August 2021.

    Comments: Accepted to TACL; Authors' final version, pre-MIT Press publication; Previous title: Multilingual Compositional Wikidata Questions

  6. arXiv:2106.03269  [pdf, other

    cs.CL

    Itihasa: A large-scale corpus for Sanskrit to English translation

    Authors: Rahul Aralikatte, Miryam de Lhoneux, Anoop Kunchukuttan, Anders Søgaard

    Abstract: This work introduces Itihasa, a large-scale translation dataset containing 93,000 pairs of Sanskrit shlokas and their English translations. The shlokas are extracted from two Indian epics viz., The Ramayana and The Mahabharata. We first describe the motivation behind the curation of such a dataset and follow up with empirical analysis to bring out its nuances. We then benchmark the performance of… ▽ More

    Submitted 5 October, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Fixed typo

  7. arXiv:2106.01051  [pdf, other

    cs.CL

    Minimax and Neyman-Pearson Meta-Learning for Outlier Languages

    Authors: Edoardo Maria Ponti, Rahul Aralikatte, Disha Shrivastava, Siva Reddy, Anders Søgaard

    Abstract: Model-agnostic meta-learning (MAML) has been recently put forth as a strategy to learn resource-poor languages in a sample-efficient fashion. Nevertheless, the properties of these languages are often not well represented by those available during training. Hence, we argue that the i.i.d. assumption ingrained in MAML makes it ill-suited for cross-lingual NLP. In fact, under a decision-theoretic fra… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Findings of ACL 2021

  8. arXiv:2105.11921  [pdf, other

    cs.CL

    Focus Attention: Promoting Faithfulness and Diversity in Summarization

    Authors: Rahul Aralikatte, Shashi Narayan, Joshua Maynez, Sascha Rothe, Ryan McDonald

    Abstract: Professional summaries are written with document-level information, such as the theme of the document, in mind. This is in contrast with most seq2seq decoders which simultaneously learn to focus on salient content, while deciding what to generate, at each decoding step. With the motivation to narrow this gap, we introduce Focus Attention Mechanism, a simple yet effective method to encourage decode… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: ACL 2021

  9. arXiv:2010.05567  [pdf, other

    cs.CL

    Joint Semantic Analysis with Document-Level Cross-Task Coherence Rewards

    Authors: Rahul Aralikatte, Mostafa Abdou, Heather Lent, Daniel Hershcovich, Anders Søgaard

    Abstract: Coreference resolution and semantic role labeling are NLP tasks that capture different aspects of semantics, indicating respectively, which expressions refer to the same entity, and what semantic roles expressions serve in the sentence. However, they are often closely interdependent, and both generally necessitate natural language understanding. Do they form a coherent abstract representation of d… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  10. arXiv:1909.04402  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Compositional Generalization in Image Captioning

    Authors: Mitja Nikolaus, Mostafa Abdou, Matthew Lamm, Rahul Aralikatte, Desmond Elliott

    Abstract: Image captioning models are usually evaluated on their ability to describe a held-out set of images, not on their ability to generalize to unseen concepts. We study the problem of compositional generalization, which measures how well a model composes unseen combinations of concepts when describing images. State-of-the-art image captioning models show poor generalization performance on this task. W… ▽ More

    Submitted 16 September, 2019; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: To appear at CoNLL 2019, EMNLP

    Journal ref: Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pp. 87--98, ACL, 2019

  11. arXiv:1909.02392  [pdf, other

    cs.CL

    Rewarding Coreference Resolvers for Being Consistent with World Knowledge

    Authors: Rahul Aralikatte, Heather Lent, Ana Valeria Gonzalez, Daniel Hershcovich, Chen Qiu, Anders Sandholm, Michael Ringaard, Anders Søgaard

    Abstract: Unresolved coreference is a bottleneck for relation extraction, and high-quality coreference resolvers may produce an output that makes it a lot easier to extract knowledge triples. We show how to improve coreference resolvers by forwarding their input to a relation extraction system and reward the resolvers for producing triples that are found in knowledge bases. Since relation extraction systems… ▽ More

    Submitted 11 November, 2019; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: To appear in EMNLP 2019 (with corrected Fig. 2)

  12. arXiv:1908.11141  [pdf, other

    cs.CL

    Ellipsis Resolution as Question Answering: An Evaluation

    Authors: Rahul Aralikatte, Matthew Lamm, Daniel Hardt, Anders Søgaard

    Abstract: Most, if not all forms of ellipsis (e.g., so does Mary) are similar to reading comprehension questions (what does Mary do), in that in order to resolve them, we need to identify an appropriate text span in the preceding discourse. Following this observation, we present an alternative approach for English ellipsis resolution relying on architectures developed for question answering (QA). We present… ▽ More

    Submitted 19 January, 2021; v1 submitted 29 August, 2019; originally announced August 2019.

    Comments: To appear in EACL 2021

  13. arXiv:1908.05111  [pdf, other

    cs.CL

    X-WikiRE: A Large, Multilingual Resource for Relation Extraction as Machine Comprehension

    Authors: Mostafa Abdou, Cezar Sas, Rahul Aralikatte, Isabelle Augenstein, Anders Søgaard

    Abstract: Although the vast majority of knowledge bases KBs are heavily biased towards English, Wikipedias do cover very different topics in different languages. Exploiting this, we introduce a new multilingual dataset (X-WikiRE), framing relation extraction as a multilingual machine reading problem. We show that by leveraging this resource it is possible to robustly transfer models cross-lingually and that… ▽ More

    Submitted 15 August, 2019; v1 submitted 14 August, 2019; originally announced August 2019.

  14. arXiv:1906.10724  [pdf, other

    cs.CL

    Model-based annotation of coreference

    Authors: Rahul Aralikatte, Anders Søgaard

    Abstract: Humans do not make inferences over texts, but over models of what texts are about. When annotators are asked to annotate coreferent spans of text, it is therefore a somewhat unnatural task. This paper presents an alternative in which we preprocess documents, linking entities to a knowledge base, and turn the coreference annotation task -- in our case limited to pronouns -- into an annotation task… ▽ More

    Submitted 1 March, 2020; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: To appear in LREC 2020

  15. arXiv:1905.02486  [pdf, other

    cs.HC cs.LG

    A Visual Programming Paradigm for Abstract Deep Learning Model Development

    Authors: Srikanth Tamilselvam, Naveen Panwar, Shreya Khare, Rahul Aralikatte, Anush Sankaran, Senthil Mani

    Abstract: Deep learning is one of the fastest growing technologies in computer science with a plethora of applications. But this unprecedented growth has so far been limited to the consumption of deep learning experts. The primary challenge being a steep learning curve for learning the programming libraries and the lack of intuitive systems enabling non-experts to consume deep learning. Towards this goal, w… ▽ More

    Submitted 19 August, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

  16. arXiv:1811.01312  [pdf, other

    cs.CR cs.LG cs.NE

    Adversarial Black-Box Attacks on Automatic Speech Recognition Systems using Multi-Objective Evolutionary Optimization

    Authors: Shreya Khare, Rahul Aralikatte, Senthil Mani

    Abstract: Fooling deep neural networks with adversarial input have exposed a significant vulnerability in the current state-of-the-art systems in multiple domains. Both black-box and white-box approaches have been used to either replicate the model itself or to craft examples which cause the model to fail. In this work, we propose a framework which uses multi-objective evolutionary optimization to perform b… ▽ More

    Submitted 3 July, 2019; v1 submitted 3 November, 2018; originally announced November 2018.

    Comments: Published in Interspeech 2019

  17. arXiv:1804.07927  [pdf, other

    cs.CL

    DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension

    Authors: Amrita Saha, Rahul Aralikatte, Mitesh M. Khapra, Karthik Sankaranarayanan

    Abstract: We propose DuoRC, a novel dataset for Reading Comprehension (RC) that motivates several new challenges for neural approaches in language understanding beyond those offered by existing RC datasets. DuoRC contains 186,089 unique question-answer pairs created from a collection of 7680 pairs of movie plots where each pair in the collection reflects two versions of the same movie - one from Wikipedia a… ▽ More

    Submitted 10 October, 2018; v1 submitted 21 April, 2018; originally announced April 2018.

    Comments: Accepted in ACL 2018

  18. arXiv:1801.01275  [pdf, other

    cs.SE cs.LG

    DeepTriage: Exploring the Effectiveness of Deep Learning for Bug Triaging

    Authors: Senthil Mani, Anush Sankaran, Rahul Aralikatte

    Abstract: For a given software bug report, identifying an appropriate developer who could potentially fix the bug is the primary task of a bug triaging process. A bug title (summary) and a detailed description is present in most of the bug tracking systems. Automatic bug triaging algorithm can be formulated as a classification problem, with the bug title and description as the input, map** it to one of th… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

  19. arXiv:1801.00428  [pdf, other

    cs.CL

    Sanskrit Sandhi Splitting using seq2(seq)^2

    Authors: Rahul Aralikatte, Neelamadhav Gantayat, Naveen Panwar, Anush Sankaran, Senthil Mani

    Abstract: In Sanskrit, small words (morphemes) are combined to form compound words through a process known as Sandhi. Sandhi splitting is the process of splitting a given compound word into its constituent morphemes. Although rules governing word splitting exists in the language, it is highly challenging to identify the location of the splits in a compound word. Though existing Sandhi splitting systems inco… ▽ More

    Submitted 15 July, 2019; v1 submitted 1 January, 2018; originally announced January 2018.

    Comments: Accepted in EMNLP 2018

  20. arXiv:1711.02012  [pdf, other

    cs.CL cs.AI

    Hi, how can I help you?: Automating enterprise IT support help desks

    Authors: Senthil Mani, Neelamadhav Gantayat, Rahul Aralikatte, Monika Gupta, Sampath Dechu, Anush Sankaran, Shreya Khare, Barry Mitchell, Hemamalini Subramanian, Hema Venkatarangan

    Abstract: Question answering is one of the primary challenges of natural language understanding. In realizing such a system, providing complex long answers to questions is a challenging task as opposed to factoid answering as the former needs context disambiguation. The different methods explored in the literature can be broadly classified into three categories namely: 1) classification based, 2) knowledge… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

    Comments: To appear in IAAI 2018

  21. Fault in your stars: An Analysis of Android App Reviews

    Authors: Rahul Aralikatte, Giriprasad Sridhara, Neelamadhav Gantayat, Senthil Mani

    Abstract: Mobile app distribution platforms such as Google Play Store allow users to share their feedback about downloaded apps in the form of a review comment and a corresponding star rating. Typically, the star rating ranges from one to five stars, with one star denoting a high sense of dissatisfaction with the app and five stars denoting a high sense of satisfaction. Unfortunately, due to a variety of… ▽ More

    Submitted 11 August, 2018; v1 submitted 16 August, 2017; originally announced August 2017.

    Comments: Accepted in CoDS-COMAD 2018. Preprint

  22. arXiv:1708.04923  [pdf, other

    cs.CL cs.LG

    mAnI: Movie Amalgamation using Neural Imitation

    Authors: Naveen Panwar, Shreya Khare, Neelamadhav Gantayat, Rahul Aralikatte, Senthil Mani, Anush Sankaran

    Abstract: Cross-modal data retrieval has been the basis of various creative tasks performed by Artificial Intelligence (AI). One such highly challenging task for AI is to convert a book into its corresponding movie, which most of the creative film makers do as of today. In this research, we take the first step towards it by visualizing the content of a book using its corresponding movie visuals. Given a set… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

    Comments: Accepted in ML4Creativity workshop in KDD 2017. Preprint

  23. DARVIZ: Deep Abstract Representation, Visualization, and Verification of Deep Learning Models

    Authors: Anush Sankaran, Rahul Aralikatte, Senthil Mani, Shreya Khare, Naveen Panwar, Neelamadhav Gantayat

    Abstract: Traditional software engineering programming paradigms are mostly object or procedure oriented, driven by deterministic algorithms. With the advent of deep learning and cognitive sciences there is an emerging trend for data-driven programming, creating a shift in the programming paradigm among the software engineering communities. Visualizing and interpreting the execution of a current large scale… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

    Comments: Accepted in ICSE NIER 2017. Preprint

  24. arXiv:1603.09051  [pdf, other

    cs.AI cs.NE

    Phoenix: A Self-Optimizing Chess Engine

    Authors: Rahul Aralikatte, G Srinivasaraghavan

    Abstract: Since the advent of computers, many tasks which required humans to spend a lot of time and energy have been trivialized by the computers' ability to perform repetitive tasks extremely quickly. Playing chess is one such task. It was one of the first games which was `solved' using AI. With the advent of deep learning, chess playing agents can surpass human ability with relative ease. However algorit… ▽ More

    Submitted 20 August, 2017; v1 submitted 30 March, 2016; originally announced March 2016.

    Comments: Accepted in CICN 2015. Preprint