Skip to main content

Showing 1–25 of 25 results for author: Sankaranarayanan, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.08194  [pdf

    cond-mat.mtrl-sci cs.CV cs.LG

    Machine learning for classifying and interpreting coherent X-ray speckle patterns

    Authors: Mingren Shen, Dina Sheyfer, Troy David Loeffler, Subramanian K. R. S. Sankaranarayanan, G. Brian Stephenson, Maria K. Y. Chan, Dane Morgan

    Abstract: Speckle patterns produced by coherent X-ray have a close relationship with the internal structure of materials but quantitative inversion of the relationship to determine structure from speckle patterns is challenging. Here, we investigate the link between coherent X-ray speckle patterns and sample structures using a model 2D disk system and explore the ability of machine learning to learn aspects… ▽ More

    Submitted 1 September, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  2. arXiv:2109.10534  [pdf, other

    cs.CL

    Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages

    Authors: Tejas Indulal Dhamecha, Rudra Murthy V, Samarth Bharadwaj, Karthik Sankaranarayanan, Pushpak Bhattacharyya

    Abstract: We explore the impact of leveraging the relatedness of languages that belong to the same family in NLP models using multilingual fine-tuning. We hypothesize and validate that multilingual fine-tuning of pre-trained language models can yield better performance on downstream NLP applications, compared to models fine-tuned on individual languages. A first of its kind detailed study is presented to tr… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Accepted in EMNLP 2021

  3. arXiv:2109.07377  [pdf, other

    cs.CL cs.AI

    Topic Transferable Table Question Answering

    Authors: Saneem Ahmed Chemmengath, Vishwajeet Kumar, Samarth Bharadwaj, Jaydeep Sen, Mustafa Canim, Soumen Chakrabarti, Alfio Gliozzo, Karthik Sankaranarayanan

    Abstract: Weakly-supervised table question-answering(TableQA) models have achieved state-of-art performance by using pre-trained BERT transformer to jointly encoding a question and a table to produce structured query for the question. However, in practical settings TableQA systems are deployed over table corpora having topic and word distributions quite distinct from BERT's pretraining corpus. In this work… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: To appear at EMNLP 2021

  4. Representation based meta-learning for few-shot spoken intent recognition

    Authors: Ashish Mittal, Samarth Bharadwaj, Shreya Khare, Saneem Chemmengath, Karthik Sankaranarayanan, Brian Kingsbury

    Abstract: Spoken intent detection has become a popular approach to interface with various smart devices with ease. However, such systems are limited to the preset list of intents-terms or commands, which restricts the quick customization of personal devices to new intents. This paper presents a few-shot spoken intent classification approach with task-agnostic representations via meta-learning paradigm. Spec… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: Accepted paper at Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October, 2020

  5. arXiv:2106.12944  [pdf, other

    cs.CL cs.AI

    AIT-QA: Question Answering Dataset over Complex Tables in the Airline Industry

    Authors: Yannis Katsis, Saneem Chemmengath, Vishwajeet Kumar, Samarth Bharadwaj, Mustafa Canim, Michael Glass, Alfio Gliozzo, Feifei Pan, Jaydeep Sen, Karthik Sankaranarayanan, Soumen Chakrabarti

    Abstract: Recent advances in transformers have enabled Table Question Answering (Table QA) systems to achieve high accuracy and SOTA results on open domain datasets like WikiTableQuestions and WikiSQL. Such transformers are frequently pre-trained on open-domain content such as Wikipedia, where they effectively encode questions and corresponding tables from Wikipedia as seen in Table QA dataset. However, web… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  6. Multilingual and code-switching ASR challenges for low resource Indian languages

    Authors: Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, Ankita Singh, Srinivasa Raghavan, Shreya Khare, Vinit Unni, Saurabh Vyas, Akash Rajpuria, Chiranjeevi Yarra, Ashish Mittal, Prasanta Kumar Ghosh, Preethi Jyothi, Kalika Bali, Vivek Seshadri, Sunayana Sitaram, Samarth Bharadwaj, Jai Nanavati, Raoul Nanavati, Karthik Sankaranarayanan, Tejaswi Seeram, Basil Abraham

    Abstract: Recently, there is increasing interest in multilingual automatic speech recognition (ASR) where a speech recognition system caters to multiple low resource languages by taking advantage of low amounts of labeled corpora in multiple languages. With multilingualism becoming common in today's world, there has been increasing interest in code-switching ASR as well. In code-switching, multiple language… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

    Comments: 6 pages

  7. arXiv:2011.03722  [pdf, other

    cs.AI cs.CL cs.LG cs.NE

    Template Controllable keywords-to-text Generation

    Authors: Abhijit Mishra, Md Faisal Mahbub Chowdhury, Sagar Manohar, Dan Gutfreund, Karthik Sankaranarayanan

    Abstract: This paper proposes a novel neural model for the understudied task of generating text from keywords. The model takes as input a set of un-ordered keywords, and part-of-speech (POS) based template instructions. This makes it ideal for surface realization in any NLG setup. The framework is based on the encode-attend-decode paradigm, where keywords and templates are encoded first, and the decoder jud… ▽ More

    Submitted 7 November, 2020; originally announced November 2020.

  8. arXiv:2009.00202  [pdf, other

    cs.AR

    Helper Without Threads: Customized Prefetching for Delinquent Irregular Loads

    Authors: Karthik Sankaranarayanan, Chit-Kwan Lin, Gautham Chinya

    Abstract: The growing memory footprints of cloud and big data applications mean that data center CPUs can spend significant time waiting for memory. An attractive approach to improving performance in such centralized compute settings is to employ prefetchers that are customized per application, where gains can be easily scaled across thousands of machines. Helper thread prefetching is such a technique but h… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

    Comments: 13 pages, 10 figures

  9. arXiv:2003.00845  [pdf, other

    cs.LG cs.CV stat.ML

    Addressing target shift in zero-shot learning using grouped adversarial learning

    Authors: Saneem Ahmed Chemmengath, Soumava Paul, Samarth Bharadwaj, Suranjana Samanta, Karthik Sankaranarayanan

    Abstract: Zero-shot learning (ZSL) algorithms typically work by exploiting attribute correlations to be able to make predictions in unseen classes. However, these correlations do not remain intact at test time in most practical settings and the resulting change in these correlations lead to adverse effects on zero-shot learning performance. In this paper, we present a new paradigm for ZSL that: (i) utilizes… ▽ More

    Submitted 16 June, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: Under submission at Neurips 2020

  10. arXiv:2002.10401  [pdf

    cs.CE cond-mat.mes-hall cond-mat.mtrl-sci

    BLAST: Bridging Length/time scales via Atomistic Simulation Toolkit

    Authors: Henry Chan, Badri Narayanan, Mathew Cherukara, Troy D. Loeffler, Michael G. Sternberg, Anthony Avarca, Subramanian K. R. S. Sankaranarayanan

    Abstract: The ever-increasing power of supercomputers coupled with highly scalable simulation codes have made molecular dynamics an indispensable tool in applications ranging from predictive modeling of materials to computational design and discovery of new materials for a broad range of applications. Multi-fidelity scale bridging between the various flavors of molecular dynamics i.e. ab-initio, classical a… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

  11. A Visual Analytics Framework for Adversarial Text Generation

    Authors: Brandon Laughlin, Christopher Collins, Karthik Sankaranarayanan, Khalil El-Khatib

    Abstract: This paper presents a framework which enables a user to more easily make corrections to adversarial texts. While attack algorithms have been demonstrated to automatically build adversaries, changes made by the algorithms can often have poor semantics or syntax. Our framework is designed to facilitate human intervention by aiding users in making corrections. The framework extends existing attack al… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

    Journal ref: 2019 IEEE Symposium on Visualization for Cyber Security (VizSec)

  12. arXiv:1906.05062  [pdf, other

    cs.CL cs.LG

    Unified Semantic Parsing with Weak Supervision

    Authors: Priyanka Agrawal, Parag Jain, Ayushi Dalmia, Abhishek Bansal, Ashish Mittal, Karthik Sankaranarayanan

    Abstract: Semantic parsing over multiple knowledge bases enables a parser to exploit structural similarities of programs across the multiple domains. However, the fundamental challenge lies in obtaining high-quality annotations of (utterance, program) pairs across various domains needed for training such models. To overcome this, we propose a novel framework to build a unified multi-domain enabled semantic… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: Association for Computational Linguistics (ACL) 2019

  13. arXiv:1810.11975  [pdf, other

    cs.LG cs.CL stat.ML

    On Controllable Sparse Alternatives to Softmax

    Authors: Anirban Laha, Saneem A. Chemmengath, Priyanka Agrawal, Mitesh M. Khapra, Karthik Sankaranarayanan, Harish G. Ramaswamy

    Abstract: Converting an n-dimensional vector to a probability distribution over n objects is a commonly used component in many machine learning tasks like multiclass classification, multilabel classification, attention mechanisms etc. For this, several probability map** functions have been proposed and employed in literature such as softmax, sum-normalization, spherical softmax, and sparsemax, but there i… ▽ More

    Submitted 30 October, 2018; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: To appear in NIPS 2018, Total 16 pages including appendix

  14. arXiv:1810.07931  [pdf, other

    cs.CL cs.LG

    Unsupervised Neural Text Simplification

    Authors: Sai Surya, Abhijit Mishra, Anirban Laha, Parag Jain, Karthik Sankaranarayanan

    Abstract: The paper presents a first attempt towards unsupervised neural text simplification that relies only on unlabeled text corpora. The core framework is composed of a shared encoder and a pair of attentional-decoders and gains knowledge of simplification through discrimination based-losses and denoising. The framework is trained using unlabeled text collected from en-Wikipedia dump. Our analysis (both… ▽ More

    Submitted 21 August, 2019; v1 submitted 18 October, 2018; originally announced October 2018.

    Comments: ACL 2019

  15. arXiv:1810.02889  [pdf, other

    cs.CL

    Scalable Micro-planned Generation of Discourse from Structured Data

    Authors: Anirban Laha, Parag Jain, Abhijit Mishra, Karthik Sankaranarayanan

    Abstract: We present a framework for generating natural language description from structured data such as tables; the problem comes under the category of data-to-text natural language generation (NLG). Modern data-to-text NLG systems typically employ end-to-end statistical and neural architectures that learn from a limited amount of task-specific labeled data, and therefore, exhibit limited scalability, dom… ▽ More

    Submitted 4 October, 2019; v1 submitted 5 October, 2018; originally announced October 2018.

    Comments: Accepted for Computational Linguistics journal on 17 Sep 2019

  16. arXiv:1809.04556  [pdf, other

    cs.CL cs.LG

    Unsupervised Controllable Text Formalization

    Authors: Parag Jain, Abhijit Mishra, Amar Prakash Azad, Karthik Sankaranarayanan

    Abstract: We propose a novel framework for controllable natural language transformation. Realizing that the requirement of parallel corpus is practically unsustainable for controllable generation tasks, an unsupervised training scheme is introduced. The crux of the framework is a deep neural encoder-decoder that is reinforced with text-transformation knowledge through auxiliary modules (called scorers). The… ▽ More

    Submitted 20 February, 2019; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: AAAI

  17. arXiv:1809.00410  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Modeling Topical Coherence in Discourse without Supervision

    Authors: Disha Shrivastava, Abhijit Mishra, Karthik Sankaranarayanan

    Abstract: Coherence of text is an important attribute to be measured for both manually and automatically generated discourse; but well-defined quantitative metrics for it are still elusive. In this paper, we present a metric for scoring topical coherence of an input paragraph on a real-valued scale by analyzing its underlying topical structure. We first extract all possible topics that the sentences of a pa… ▽ More

    Submitted 2 September, 2018; originally announced September 2018.

    Comments: 9 pages

  18. Neural Machine Translation for Low Resource Languages using Bilingual Lexicon Induced from Comparable Corpora

    Authors: Sree Harsha Ramesh, Krishna Prasad Sankaranarayanan

    Abstract: Resources for the non-English languages are scarce and this paper addresses this problem in the context of machine translation, by automatically extracting parallel sentence pairs from the multilingual articles available on the Internet. In this paper, we have used an end-to-end Siamese bidirectional recurrent neural network to generate parallel sentences from comparable multilingual articles in W… ▽ More

    Submitted 25 June, 2018; originally announced June 2018.

    Comments: 8 pages, 3 figures, 4 tables, NAACL-SRW (2018)

  19. arXiv:1804.07927  [pdf, other

    cs.CL

    DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension

    Authors: Amrita Saha, Rahul Aralikatte, Mitesh M. Khapra, Karthik Sankaranarayanan

    Abstract: We propose DuoRC, a novel dataset for Reading Comprehension (RC) that motivates several new challenges for neural approaches in language understanding beyond those offered by existing RC datasets. DuoRC contains 186,089 unique question-answer pairs created from a collection of 7680 pairs of movie plots where each pair in the collection reflects two versions of the same movie - one from Wikipedia a… ▽ More

    Submitted 10 October, 2018; v1 submitted 21 April, 2018; originally announced April 2018.

    Comments: Accepted in ACL 2018

  20. arXiv:1804.07790  [pdf, other

    cs.CL cs.AI

    A Mixed Hierarchical Attention based Encoder-Decoder Approach for Standard Table Summarization

    Authors: Parag Jain, Anirban Laha, Karthik Sankaranarayanan, Preksha Nema, Mitesh M. Khapra, Shreyas Shetty

    Abstract: Structured data summarization involves generation of natural language summaries from structured input data. In this work, we consider summarizing structured data occurring in the form of tables as they are prevalent across a wide variety of domains. We formulate the standard table summarization problem, which deals with tables conforming to a single predefined schema. To this end, we propose a mix… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

    Comments: Accepted in NAACL-HLT 2018 (Short paper)

  21. arXiv:1804.07789  [pdf, other

    cs.CL cs.AI cs.LG

    Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

    Authors: Preksha Nema, Shreyas Shetty, Parag Jain, Anirban Laha, Karthik Sankaranarayanan, Mitesh M. Khapra

    Abstract: In this work, we focus on the task of generating natural language descriptions from a structured table of facts containing fields (such as nationality, occupation, etc) and values (such as Indian, actor, director, etc). One simple choice is to treat the table as a sequence of fields and values and then use a standard seq2seq model for this task. However, such a model is too generic and does not ex… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

    Comments: Accepted in NAACL-HLT 2018

  22. arXiv:1801.10314  [pdf, other

    cs.CL

    Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph

    Authors: Amrita Saha, Vardaan Pahuja, Mitesh M. Khapra, Karthik Sankaranarayanan, Sarath Chandar

    Abstract: While conversing with chatbots, humans typically tend to ask many questions, a significant portion of which can be answered by referring to large-scale knowledge graphs (KG). While Question Answering (QA) and dialog systems have been studied independently, there is a need to study them closely to evaluate such real-world scenarios faced by bots involving both these tasks. Towards this end, we intr… ▽ More

    Submitted 4 October, 2018; v1 submitted 31 January, 2018; originally announced January 2018.

    Comments: Accepted in AAAI'18

  23. arXiv:1707.05501  [pdf, other

    cs.CL

    Story Generation from Sequence of Independent Short Descriptions

    Authors: Parag Jain, Priyanka Agrawal, Abhijit Mishra, Mohak Sukhwani, Anirban Laha, Karthik Sankaranarayanan

    Abstract: Existing Natural Language Generation (NLG) systems are weak AI systems and exhibit limited capabilities when language generation tasks demand higher levels of creativity, originality and brevity. Effective solutions or, at least evaluations of modern NLG paradigms for such creative tasks have been elusive, unfortunately. This paper introduces and addresses the task of coherent story generation fro… ▽ More

    Submitted 21 August, 2017; v1 submitted 18 July, 2017; originally announced July 2017.

    Comments: Accepted in SIGKDD Workshop on Machine Learning for Creativity (ML4Creativity), 2017

  24. arXiv:1707.05499  [pdf, other

    cs.LG cs.AI stat.ML

    A Machine Learning Approach for Evaluating Creative Artifacts

    Authors: Disha Shrivastava, Saneem Ahmed CG, Anirban Laha, Karthik Sankaranarayanan

    Abstract: Much work has been done in understanding human creativity and defining measures to evaluate creativity. This is necessary mainly for the reason of having an objective and automatic way of quantifying creative artifacts. In this work, we propose a regression-based learning framework which takes into account quantitatively the essential criteria for creativity like novelty, influence, value and unex… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

    Comments: Accepted at SIGKDD Workshop on Machine Learning for Creativity (ML4Creativity), 2017

  25. arXiv:1704.00200  [pdf, other

    cs.CL

    Towards Building Large Scale Multimodal Domain-Aware Conversation Systems

    Authors: Amrita Saha, Mitesh Khapra, Karthik Sankaranarayanan

    Abstract: While multimodal conversation agents are gaining importance in several domains such as retail, travel etc., deep learning research in this area has been limited primarily due to the lack of availability of large-scale, open chatlogs. To overcome this bottleneck, in this paper we introduce the task of multimodal, domain-aware conversations, and propose the MMD benchmark dataset. This dataset was ga… ▽ More

    Submitted 31 January, 2018; v1 submitted 1 April, 2017; originally announced April 2017.