Skip to main content

Showing 101–150 of 255 results for author: Goyal, P

.
  1. arXiv:2109.05897  [pdf, other

    cs.CL cs.IR cs.LG

    Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework

    Authors: Abhilash Nandy, Soumya Sharma, Shubham Maddhashiya, Kapil Sachdeva, Pawan Goyal, Niloy Ganguly

    Abstract: Answering questions asked from instructional corpora such as E-manuals, recipe books, etc., has been far less studied than open-domain factoid context-based question answering. This can be primarily attributed to the absence of standard benchmark datasets. In this paper we meticulously create a large amount of data connected with E-manuals and develop suitable algorithm to exploit it. We collect E… ▽ More

    Submitted 14 September, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: EMNLP Findings 2021, Long

  2. The geometrical meaning of statistical isotropy of smooth random fields in two dimensions

    Authors: Pravabati Chingangbam, Priya Goyal, K. P. Yogendran, Stephen Appleby

    Abstract: We revisit the geometrical meaning of statistical isotropy that is manifest in excursion sets of smooth random fields in two dimensions. Using the contour Minkowski tensor, $\W_1$, as our basic tool we first examine geometrical properties of single structures. For simple closed curves in two dimensions we show that $\W_1$ is proportional to the identity matrix if the curve has $m$-fold symmetry, w… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: 14 pages

  3. arXiv:2109.04997  [pdf, other

    cs.CL cs.LG

    Box Embeddings: An open-source library for representation learning using geometric structures

    Authors: Tejas Chheda, Purujit Goyal, Trang Tran, Dhruvesh Patel, Michael Boratko, Shib Sankar Dasgupta, Andrew McCallum

    Abstract: A major factor contributing to the success of modern representation learning is the ease of performing various vector operations. Recently, objects with geometric structures (eg. distributions, complex or hyperbolic vectors, or regions such as cones, disks, or boxes) have been explored for their alternative inductive biases and additional representational capacities. In this work, we introduce Box… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: The source code and the usage and API documentation for the library is available at https://github.com/iesl/box-embeddings and https://www.iesl.cs.umass.edu/box-embeddings/main/index.html

  4. arXiv:2108.04366  [pdf, other

    cs.CL

    COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer Reviews

    Authors: Shruti Singh, Mayank Singh, Pawan Goyal

    Abstract: Comparing research papers is a conventional method to demonstrate progress in experimental research. We present COMPARE, a taxonomy and a dataset of comparison discussions in peer reviews of research papers in the domain of experimental deep learning. From a thorough observation of a large set of review sentences, we build a taxonomy of categories in comparison discussions and present a detailed a… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 4 pages, JCDL 2021

  5. arXiv:2108.00524  [pdf, other

    cs.SI cs.CL cs.LG

    You too Brutus! Trap** Hateful Users in Social Media: Challenges, Solutions & Insights

    Authors: Mithun Das, Punyajoy Saha, Ritam Dutt, Pawan Goyal, Animesh Mukherjee, Binny Mathew

    Abstract: Hate speech is regarded as one of the crucial issues plaguing the online social media. The current literature on hate speech detection leverages primarily the textual content to find hateful posts and subsequently identify hateful users. However, this methodology disregards the social connections between users. In this paper, we run a detailed exploration of the problem space and investigate an ar… ▽ More

    Submitted 1 August, 2021; originally announced August 2021.

    Comments: Extended Version of this paper has been accepted at ACM HT'21. Link to the Code: https://github.com/hate-alert/Hateful-users-detection

  6. arXiv:2107.12950  [pdf, ps, other

    eess.SY math.OC

    A Greedy Data Collection Scheme For Linear Dynamical Systems

    Authors: Karim Cherifi, Pawan Goyal, Peter Benner

    Abstract: Mathematical models are essential to analyze and understand the dynamics of complex systems. Recently, data-driven methodologies have got a lot of attention which is leveraged by advancements in sensor technology. However, the quality of obtained data plays a vital role in learning a good and reliable model. Therefore, in this paper, we propose an efficient heuristic methodology to collect data bo… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

  7. arXiv:2106.10862  [pdf, other

    cs.CL

    ArgFuse: A Weakly-Supervised Framework for Document-Level Event Argument Aggregation

    Authors: Debanjana Kar, Sudeshna Sarkar, Pawan Goyal

    Abstract: Most of the existing information extraction frameworks (Wadden et al., 2019; Veysehet al., 2020) focus on sentence-level tasks and are hardly able to capture the consolidated information from a given document. In our endeavour to generate precise document-level information frames from lengthy textual records, we introduce the task of Information Aggregation or Argument Aggregation. More specifical… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: 11 pages, 8 figures, Accepted in Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE) @ACL-IJCNLP 2021

    ACM Class: I.2.7

  8. arXiv:2106.05852  [pdf

    eess.AS cs.CL cs.LG cs.SD

    Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights

    Authors: Devaraja Adiga, Rishabh Kumar, Amrith Krishna, Preethi Jyothi, Ganesh Ramakrishnan, Pawan Goyal

    Abstract: Automatic speech recognition (ASR) in Sanskrit is interesting, owing to the various linguistic peculiarities present in the language. The Sanskrit language is lexically productive, undergoes euphonic assimilation of phones at the word boundaries and exhibits variations in spelling conventions and in pronunciations. In this work, we propose the first large scale study of automatic speech recognitio… ▽ More

    Submitted 23 July, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted paper at the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021 Findings)

  9. arXiv:2106.05468  [pdf, other

    cs.LG

    Multi-VFL: A Vertical Federated Learning System for Multiple Data and Label Owners

    Authors: Vaikkunth Mugunthan, Pawan Goyal, Lalana Kagal

    Abstract: Vertical Federated Learning (VFL) refers to the collaborative training of a model on a dataset where the features of the dataset are split among multiple data owners, while label information is owned by a single data owner. In this paper, we propose a novel method, Multi Vertical Federated Learning (Multi-VFL), to train VFL models when there are multiple data and label owners. Our approach is the… ▽ More

    Submitted 16 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

  10. arXiv:2106.02972  [pdf, other

    cs.AI cs.CL cs.LG

    Zero-shot Task Adaptation using Natural Language

    Authors: Prasoon Goyal, Raymond J. Mooney, Scott Niekum

    Abstract: Imitation learning and instruction-following are two common approaches to communicate a user's intent to a learning agent. However, as the complexity of tasks grows, it could be beneficial to use both demonstrations and language to communicate with an agent. In this work, we propose a novel setting where an agent is given both a demonstration and a description, and must combine information from bo… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

  11. arXiv:2105.10473  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Synthesis and water permeation studies of polysulfone based composite membranes having vertically aligned CNTs

    Authors: Bhakti Hirani, P. S. Goyal

    Abstract: Polymeric membranes, including Polysulfone (PSf) membranes, are routinely used for water treatment. It is known for quite some time that water permeability of above membranes can be improved if one incorporates carbon nanotubes (single-walled, SWCNTs or multi-walled, MWCNTs) in to the membrane and aligns them in direction of flow of water. This paper reports a method of synthesizing polymeric memb… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: 17 pages, 9 figures, 2 tables

  12. Discovery of Nonlinear Dynamical Systems using a Runge-Kutta Inspired Dictionary-based Sparse Regression Approach

    Authors: Pawan Goyal, Peter Benner

    Abstract: Discovering dynamical models to describe underlying dynamical behavior is essential to draw decisive conclusions and engineering studies, e.g., optimizing a process. Experimental data availability notwithstanding has increased significantly, but interpretable and explainable models in science and engineering yet remain incomprehensible. In this work, we blend machine learning and dictionary-based… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

  13. arXiv:2105.00477  [pdf, other

    cs.CL

    Event Argument Extraction using Causal Knowledge Structures

    Authors: Debanjana Kar, Sudeshna Sarkar, Pawan Goyal

    Abstract: Event Argument extraction refers to the task of extracting structured information from unstructured text for a particular event of interest. The existing works exhibit poor capabilities to extract causal event arguments like Reason and After Effects. Furthermore, most of the existing works model this task at a sentence level, restricting the context to a local scope. While it may be effective for… ▽ More

    Submitted 2 May, 2021; originally announced May 2021.

    Comments: 10 pages, 6 figures, Accepted in 17th International Conference on Natural Language Processing (ICON 2020)

  14. arXiv:2104.10869  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    CrysXPP:An Explainable Property Predictor for Crystalline Materials

    Authors: Kishalay Das, Bidisha Samanta, Pawan Goyal, Seung-Cheol Lee, Satadeep Bhattacharjee, Niloy Ganguly

    Abstract: We present a deep-learning framework, CrysXPP, to allow rapid prediction of electronic, magnetic and elastic properties of a wide range of materials with reasonable precision. Although our work is consistent with several recent attempts to build deep learning-based property predictors, it overcomes some of their limitations. CrysXPP lowers the need for a large volume of tagged data to train a deep… ▽ More

    Submitted 2 February, 2022; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: To be published in NPJ Computational Materials

  15. Local patch analysis for testing statistical isotropy of the Planck convergence map

    Authors: Priya Goyal, Pravabati Chingangbam

    Abstract: The small but measurable effect of weak gravitational lensing on the cosmic microwave background radiation provide information about the large-scale distribution of matter in the universe. We use the all sky distribution of matter, as represented by the {\em convergence map} that is inferred from CMB lensing measurement by Planck survey, to test the fundamental assumption of Statistical Isotropy (… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: 23 pages, 6 figures

  16. arXiv:2104.00270  [pdf, other

    cs.CL

    Evaluating Neural Word Embeddings for Sanskrit

    Authors: Jivnesh Sandhan, Om Adideva, Digumarthi Komal, Laxmidhar Behera, Pawan Goyal

    Abstract: Recently, the supervised learning paradigm's surprisingly remarkable performance has garnered considerable attention from Sanskrit Computational Linguists. As a result, the Sanskrit community has put laudable efforts to build task-specific labeled data for various downstream Natural Language Processing (NLP) tasks. The primary component of these approaches comes from representations of word embedd… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: 14 pages, The work is submitted at WSC 2022, Canberra, Australia

  17. Deep Neural Approaches to Relation Triplets Extraction: A Comprehensive Survey

    Authors: Tapas Nayak, Navonil Majumder, Pawan Goyal, Soujanya Poria

    Abstract: Recently, with the advances made in continuous representation of words (word embeddings) and deep neural architectures, many research works are published in the area of relation extraction and it is very difficult to keep track of so many papers. To help future research, we present a comprehensive review of the recently published research works in relation extraction. We mostly focus on relation e… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: A survey paper for relation extraction. Cogn Comput (2021)

  18. arXiv:2103.10043  [pdf, other

    cs.CV cs.LG cs.MM

    Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training

    Authors: Saurabh Sahu, Palash Goyal

    Abstract: The introduction of Transformer model has led to tremendous advancements in sequence modeling, especially in text domain. However, the use of attention-based models for video understanding is still relatively unexplored. In this paper, we introduce Gated Adversarial Transformer (GAT) to enhance the applicability of attention-based models to videos. GAT uses a multi-level attention gate to model th… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  19. arXiv:2103.02249  [pdf, other

    cs.LG cs.AI math.DS

    LQResNet: A Deep Neural Network Architecture for Learning Dynamic Processes

    Authors: Pawan Goyal, Peter Benner

    Abstract: Mathematical modeling is an essential step, for example, to analyze the transient behavior of a dynamical process and to perform engineering studies such as optimization and control. With the help of first-principles and expert knowledge, a dynamic model can be built, but for complex dynamic processes, appearing, e.g., in biology, chemical plants, neuroscience, financial markets, this often remain… ▽ More

    Submitted 27 March, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  20. arXiv:2103.01988  [pdf, other

    cs.CV cs.AI

    Self-supervised Pretraining of Visual Features in the Wild

    Authors: Priya Goyal, Mathilde Caron, Benjamin Lefaudeux, Min Xu, Pengchao Wang, Vivek Pai, Mannat Singh, Vitaliy Liptchinsky, Ishan Misra, Armand Joulin, Piotr Bojanowski

    Abstract: Recently, self-supervised learning methods like MoCo, SimCLR, BYOL and SwAV have reduced the gap with supervised methods. These results have been achieved in a control environment, that is the highly curated ImageNet dataset. However, the premise of self-supervised learning is that it can learn from any random image and from any unbounded dataset. In this work, we explore if self-supervision lives… ▽ More

    Submitted 5 March, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

  21. arXiv:2103.01314  [pdf, other

    cs.NI

    SWP: Microsecond Network SLOs Without Priorities

    Authors: Kevin Zhao, Prateesh Goyal, Mohammad Alizadeh, Thomas E. Anderson

    Abstract: The increasing use of cloud computing for latency-sensitive applications has sparked renewed interest in providing tight bounds on network tail latency. Achieving this in practice at reasonable network utilization has proved elusive, due to a combination of highly bursty application demand, faster link speeds, and heavy-tailed message sizes. While priority scheduling can be used to reduce tail lat… ▽ More

    Submitted 2 March, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

  22. Random Sampling in Reproducing Kernel Subspace of Mixed Lebesgue Spaces

    Authors: Prashant Goyal, Dhiraj Patel, Sivananthan Sampath

    Abstract: In this article, we consider the random sampling in the image space $V$ of mixed Lebesgue space $L^{p,q}(\mathbb{R}^{n+1})$ under an idempotent integral operator. We assume some decay and regularity conditions of the kernel and approximate the unit sphere in $V$ on a bounded cube $C_{R,S}$ by a finite-dimensional subspace of $V$. Consequently, the set of concentrated functions is totally bounded.… ▽ More

    Submitted 20 July, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: Communicated

  23. arXiv:2102.06551  [pdf, other

    cs.CL

    A Little Pretraining Goes a Long Way: A Case Study on Dependency Parsing Task for Low-resource Morphologically Rich Languages

    Authors: Jivnesh Sandhan, Amrith Krishna, Ashim Gupta, Laxmidhar Behera, Pawan Goyal

    Abstract: Neural dependency parsing has achieved remarkable performance for many domains and languages. The bottleneck of massive labeled data limits the effectiveness of these approaches for low resource languages. In this work, we focus on dependency parsing for morphological rich languages (MRLs) in a low-resource setting. Although morphological information is essential for the dependency parsing task, t… ▽ More

    Submitted 12 April, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: 6 pages, The work is accepted at EACL-SRW, 2021, Kyiv, Ukraine Typos corrected in Section 3.2

  24. arXiv:2101.09743  [pdf, other

    cs.CL

    A Novel Two-stage Framework for Extracting Opinionated Sentences from News Articles

    Authors: Rajkumar Pujari, Swara Desai, Niloy Ganguly, Pawan Goyal

    Abstract: This paper presents a novel two-stage framework to extract opinionated sentences from a given news article. In the first stage, Naive Bayes classifier by utilizing the local features assigns a score to each sentence - the score signifies the probability of the sentence to be opinionated. In the second stage, we use this prior within the HITS (Hyperlink-Induced Topic Search) schema to exploit the g… ▽ More

    Submitted 24 January, 2021; originally announced January 2021.

    Comments: Presented as a talk at TextGraphs-9: the workshop on Graph-based Methods for Natural Language Processing at EMNLP 2014

  25. Reproducibility, Replicability and Beyond: Assessing Production Readiness of Aspect Based Sentiment Analysis in the Wild

    Authors: Rajdeep Mukherjee, Shreyas Shetty, Subrata Chattopadhyay, Subhadeep Maji, Samik Datta, Pawan Goyal

    Abstract: With the exponential growth of online marketplaces and user-generated content therein, aspect-based sentiment analysis has become more important than ever. In this work, we critically review a representative sample of the models published during the past six years through the lens of a practitioner, with an eye towards deployment in production. First, our rigorous empirical evaluation reveals poor… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

    Comments: 12 pages, accepted at ECIR 2021

    ACM Class: I.2.7

  26. arXiv:2101.08729  [pdf, other

    cs.IR cs.SE

    Joint Autoregressive and Graph Models for Software and Developer Social Networks

    Authors: Rima Hazra, Hardik Aggarwal, Pawan Goyal, Animesh Mukherjee, Soumen Chakrabarti

    Abstract: Social network research has focused on hyperlink graphs, bibliographic citations, friend/follow patterns, influence spread, etc. Large software repositories also form a highly valuable networked artifact, usually in the form of a collection of packages, their developers, dependencies among them, and bug reports. This "social network of code" is rarely studied by social network researchers. We intr… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: Accepted at ECIR 2021

  27. Medical Entity Linking using Triplet Network

    Authors: Ishani Mondal, Sukannya Purkayastha, Sudeshna Sarkar, Pawan Goyal, Jitesh Pillai, Amitava Bhattacharyya, Mahanandeeshwar Gattu

    Abstract: Entity linking (or Normalization) is an essential task in text mining that maps the entity mentions in the medical text to standard entities in a given Knowledge Base (KB). This task is of great importance in the medical domain. It can also be used for merging different medical and clinical ontologies. In this paper, we center around the problem of disease linking or normalization. This task is ex… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: ClinicalNLP@NAACL 2019

  28. arXiv:2012.10289  [pdf, other

    cs.CL cs.AI cs.SI

    HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

    Authors: Binny Mathew, Punyajoy Saha, Seid Muhie Yimam, Chris Biemann, Pawan Goyal, Animesh Mukherjee

    Abstract: Hate speech is a challenging issue plaguing the online social media. While better models for hate speech detection are continuously being developed, there is little research on the bias and interpretability aspects of hate speech. In this paper, we introduce HateXplain, the first benchmark hate speech dataset covering multiple aspects of the issue. Each post in our dataset is annotated from three… ▽ More

    Submitted 12 April, 2022; v1 submitted 18 December, 2020; originally announced December 2020.

    Comments: 12 pages, 7 figues, 8 tables. Accepted at AAAI 2021

  29. arXiv:2012.07553  [pdf, other

    cs.CL cs.AI

    An End-to-End Solution for Named Entity Recognition in eCommerce Search

    Authors: Xiang Cheng, Mitchell Bowden, Bhushan Ramesh Bhange, Priyanka Goyal, Thomas Packer, Faizan Javed

    Abstract: Named entity recognition (NER) is a critical step in modern search query understanding. In the domain of eCommerce, identifying the key entities, such as brand and product type, can help a search engine retrieve relevant products and therefore offer an engaging shop** experience. Recent research shows promising results on shared benchmark NER tasks using deep learning methods, but there are stil… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: Accepted by AAAI IAAI-2021 Highly Innovative Applications of AI track

  30. arXiv:2011.10337  [pdf, other

    cs.LG cs.CL cs.IR

    Finding Prerequisite Relations between Concepts using Textbook

    Authors: Shivam Pal, Vipul Arora, Pawan Goyal

    Abstract: A prerequisite is anything that you need to know or understand first before attempting to learn or understand something new. In the current work, we present a method of finding prerequisite relations between concepts using related textbooks. Previous researchers have focused on finding these relations using Wikipedia link structure through unsupervised and supervised learning approaches. In the cu… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  31. arXiv:2011.08067  [pdf, other

    cs.CL

    Hierarchical Transformer for Task Oriented Dialog Systems

    Authors: Bishal Santra, Potnuru Anusha, Pawan Goyal

    Abstract: Generative models for dialog systems have gained much interest because of the recent success of RNN and Transformer based models in tasks like question answering and summarization. Although the task of dialog response generation is generally seen as a sequence-to-sequence (Seq2Seq) problem, researchers in the past have found it challenging to train dialog systems using the standard Seq2Seq models.… ▽ More

    Submitted 9 May, 2021; v1 submitted 24 October, 2020; originally announced November 2020.

    Comments: v3: Latest camera ready version; 10 pages; Codes: https://github.com/bsantraigi/HIER , https://github.com/bsantraigi/hier-transformer-pytorch v2: To appear in NAACL 2021 (Long Paper) v1: preprint

  32. Site-to-Site Internet Traffic Control

    Authors: Frank Cangialosi, Akshay Narayan, Prateesh Goyal, Radhika Mittal, Mohammad Alizadeh, Hari Balakrishnan

    Abstract: Queues allow network operators to control traffic: where queues build, they can enforce scheduling and sha** policies. In the Internet today, however, there is a mismatch between where queues build and where control is most effectively enforced; queues build at bottleneck links that are often not under the control of the data sender. To resolve this mismatch, we propose a new kind of middlebox,… ▽ More

    Submitted 27 April, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: 15 pages, 14 figures

  33. arXiv:2010.06701  [pdf, other

    math.DS cs.LG math.NA

    Operator Inference and Physics-Informed Learning of Low-Dimensional Models for Incompressible Flows

    Authors: Peter Benner, Pawan Goyal, Jan Heiland, Igor Pontes Duff

    Abstract: Reduced-order modeling has a long tradition in computational fluid dynamics. The ever-increasing significance of data for the synthesis of low-order models is well reflected in the recent successes of data-driven approaches such as Dynamic Mode Decomposition and Operator Inference. With this work, we suggest a new approach to learning structured low-order models for incompressible flow from data t… ▽ More

    Submitted 7 December, 2020; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: 23 pages, 14 figures

    MSC Class: 37N10; 68T05; 76D05; 65F22; 93A15; 93C10

  34. arXiv:2009.06819  [pdf, other

    cs.CL

    MatScIE: An automated tool for the generation of databases of methods and parameters used in the computational materials science literature

    Authors: Souradip Guha, Ankan Mullick, Jatin Agrawal, Swetarekha Ram, Samir Ghui, Seung-Cheol Lee, Satadeep Bhattacharjee, Pawan Goyal

    Abstract: The number of published articles in the field of materials science is growing rapidly every year. This comparatively unstructured data source, which contains a large amount of information, has a restriction on its re-usability, as the information needed to carry out further calculations using the data in it must be extracted manually. It is very important to obtain valid and contextually correct i… ▽ More

    Submitted 22 January, 2021; v1 submitted 14 September, 2020; originally announced September 2020.

    Comments: 13 pages, 8 figures, Accepted for publication in Computational Material Science

    Journal ref: Computational Material Science, 2021

  35. arXiv:2007.15543  [pdf, other

    cs.LG cs.AI stat.ML

    PixL2R: Guiding Reinforcement Learning Using Natural Language by Map** Pixels to Rewards

    Authors: Prasoon Goyal, Scott Niekum, Raymond J. Mooney

    Abstract: Reinforcement learning (RL), particularly in sparse reward settings, often requires prohibitively large numbers of interactions with the environment, thereby limiting its applicability to complex problems. To address this, several prior approaches have used natural language to guide the agent's exploration. However, these approaches typically operate on structured representations of the environmen… ▽ More

    Submitted 19 November, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Conference on Robot Learning (CoRL), 2020

  36. arXiv:2007.14079  [pdf, other

    math.NA

    Data-Driven Learning of Reduced-order Dynamics for a Parametrized Shallow Water Equation

    Authors: Süleyman Yıldız, Pawan Goyal, Peter Benner, Bülent Karasözen

    Abstract: This paper discusses a non-intrusive data-driven model order reduction method that learns low-dimensional dynamical models for a parametrized shallow water equation. We consider the shallow water equation in non-traditional form (NTSWE). We focus on learning low-dimensional models in a non-intrusive way. That means, we assume not to have access to a discretized form of the NTSWE in any form. Inste… ▽ More

    Submitted 4 August, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

  37. arXiv:2007.10784  [pdf, other

    cs.LG cs.NE stat.ML

    OccamNet: A Fast Neural Model for Symbolic Regression at Scale

    Authors: Owen Dugan, Rumen Dangovski, Allan Costa, Samuel Kim, Pawan Goyal, Joseph Jacobson, Marin Soljačić

    Abstract: Neural networks' expressiveness comes at the cost of complex, black-box models that often extrapolate poorly beyond the domain of the training dataset, conflicting with the goal of finding compact analytic expressions to describe scientific data. We introduce OccamNet, a neural network model that finds interpretable, compact, and sparse symbolic fits to data, à la Occam's razor. Our model defines… ▽ More

    Submitted 27 November, 2023; v1 submitted 16 July, 2020; originally announced July 2020.

  38. Logic Constrained Pointer Networks for Interpretable Textual Similarity

    Authors: Subhadeep Maji, Rohan Kumar, Manish Bansal, Kalyani Roy, Pawan Goyal

    Abstract: Systematically discovering semantic relationships in text is an important and extensively studied area in Natural Language Processing, with various tasks such as entailment, semantic similarity, etc. Decomposability of sentence-level scores via subsequence alignments has been proposed as a way to make models more interpretable. We study the problem of aligning components of sentences leading to an… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: Accepted at IJCAI 2020 Main Track. Sole copyright holder is IJCAI, all rights reserved. Available at https://www.ijcai.org/Proceedings/2020/333

    Journal ref: IJCAI 2020, Pages 2405-2411

  39. arXiv:2006.09882  [pdf, other

    cs.CV

    Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

    Authors: Mathilde Caron, Ishan Misra, Julien Mairal, Priya Goyal, Piotr Bojanowski, Armand Joulin

    Abstract: Unsupervised image representations have significantly reduced the gap with supervised pretraining, notably with the recent achievements of contrastive learning methods. These contrastive methods typically work online and rely on a large number of explicit pairwise feature comparisons, which is computationally challenging. In this paper, we propose an online algorithm, SwAV, that takes advantage of… ▽ More

    Submitted 8 January, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  40. Read what you need: Controllable Aspect-based Opinion Summarization of Tourist Reviews

    Authors: Rajdeep Mukherjee, Hari Chandana Peruri, Uppada Vishnu, Pawan Goyal, Sourangshu Bhattacharya, Niloy Ganguly

    Abstract: Manually extracting relevant aspects and opinions from large volumes of user-generated text is a time-consuming process. Summaries, on the other hand, help readers with limited time budgets to quickly consume the key ideas from the data. State-of-the-art approaches for multi-document summarization, however, do not consider user preferences while generating summaries. In this work, we argue the nee… ▽ More

    Submitted 9 June, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 4 pages, accepted in the Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2020

    ACM Class: H.3.3

  41. arXiv:2006.03629  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Hierarchical Class-Based Curriculum Loss

    Authors: Palash Goyal, Shalini Ghosh

    Abstract: Classification algorithms in machine learning often assume a flat label space. However, most real world data have dependencies between the labels, which can often be captured by using a hierarchy. Utilizing this relation can help develop a model capable of satisfying the dependencies and improving model accuracy and interpretability. Further, as different levels in the hierarchy correspond to diff… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

  42. arXiv:2006.03257  [pdf, other

    cs.CL cs.DL cs.LG

    Aspect-based Sentiment Analysis of Scientific Reviews

    Authors: Souvic Chakraborty, Pawan Goyal, Animesh Mukherjee

    Abstract: Scientific papers are complex and understanding the usefulness of these papers requires prior knowledge. Peer reviews are comments on a paper provided by designated experts on that field and hold a substantial amount of information, not only for the editors and chairs to make the final decision, but also to judge the potential impact of the paper. In this paper, we propose to use aspect-based sent… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: Accepted in JCDL'20

    ACM Class: I.5.4; I.2.6; I.5.2; I.5.4; I.5.1

  43. arXiv:2005.14613  [pdf, other

    cs.CL cs.IR

    Using Large Pretrained Language Models for Answering User Queries from Product Specifications

    Authors: Kalyani Roy, Smit Shah, Nithish Pai, Jaidam Ramtej, Prajit Prashant Nadkarn, Jyotirmoy Banerjee, Pawan Goyal, Surender Kumar

    Abstract: While buying a product from the e-commerce websites, customers generally have a plethora of questions. From the perspective of both the e-commerce service provider as well as the customers, there must be an effective question answering system to provide immediate answers to the user queries. While certain questions can only be answered after using the product, there are many questions which can be… ▽ More

    Submitted 29 May, 2020; originally announced May 2020.

    Comments: 5 pages

  44. arXiv:2005.10893  [pdf, other

    cs.CL

    Evaluating Neural Morphological Taggers for Sanskrit

    Authors: Ashim Gupta, Amrith Krishna, Pawan Goyal, Oliver Hellwig

    Abstract: Neural sequence labelling approaches have achieved state of the art results in morphological tagging. We evaluate the efficacy of four standard sequence labelling models on Sanskrit, a morphologically rich, fusional Indian language. As its label space can theoretically contain more than 40,000 labels, systems that explicitly model the internal structure of a label are more suited for the task, bec… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: Accepted to SIGMORPHON Workshop at ACL 2020

  45. arXiv:2005.09371  [pdf, ps, other

    eess.SY

    A Non-Intrusive Method to Inferring Linear Port-Hamiltonian Realizations using Time-Domain Data

    Authors: Karim Cherifi, Pawan Goyal, Peter Benner

    Abstract: Port-Hamiltonian systems have gained a lot of attention in recent years due to their inherent valuable properties in modeling and control. In this paper, we are interested in constructing linear port-Hamiltonian systems from time-domain input-output data. We discuss a non-intrusive methodology that is comprised of two main ingredients -- (a) inferring frequency response data from time-domain data… ▽ More

    Submitted 17 November, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

    MSC Class: 93A30; 93B30; 93B15; 93B20

  46. arXiv:2004.08076  [pdf

    cs.CL

    Neural Approaches for Data Driven Dependency Parsing in Sanskrit

    Authors: Amrith Krishna, Ashim Gupta, Deepak Garasangi, Jivnesh Sandhan, Pavankumar Satuluri, Pawan Goyal

    Abstract: Data-driven approaches for dependency parsing have been of great interest in Natural Language Processing for the past couple of decades. However, Sanskrit still lacks a robust purely data-driven dependency parser, probably with an exception to Krishna (2019). This can primarily be attributed to the lack of availability of task-specific labelled data and the morphologically rich nature of the langu… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

    Comments: submitted to WSC 2021

  47. arXiv:2004.05553  [pdf, other

    cs.SI cs.LG stat.ML

    Exploring Effects of Random Walk Based Minibatch Selection Policy on Knowledge Graph Completion

    Authors: Bishal Santra, Prakhar Sharma, Sumegh Roychowdhury, Pawan Goyal

    Abstract: In this paper, we have explored the effects of different minibatch sampling techniques in Knowledge Graph Completion. Knowledge Graph Completion (KGC) or Link Prediction is the task of predicting missing facts in a knowledge graph. KGC models are usually trained using margin, soft-margin or cross-entropy loss function that promotes assigning a higher score or probability for true fact triplets. Mi… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

    Comments: 7 pages, 3 figures

  48. arXiv:2003.06928  [pdf, other

    math.PR math.NA

    Low-dimensional approximations of high-dimensional asset price models

    Authors: Martin Redmann, Christian Bayer, Pawan Goyal

    Abstract: We consider high-dimensional asset price models that are reduced in their dimension in order to reduce the complexity of the problem or the effect of the curse of dimensionality in the context of option pricing. We apply model order reduction (MOR) to obtain a reduced system. MOR has been previously studied for asymptotically stable controlled stochastic systems with zero initial conditions. Howev… ▽ More

    Submitted 1 April, 2021; v1 submitted 15 March, 2020; originally announced March 2020.

    MSC Class: Primary: 91G20; 91G60; 93A15 Secondary: 60H10; 65C30

  49. arXiv:2003.05698  [pdf, other

    cs.CV

    Low-Rank and Total Variation Regularization and Its Application to Image Recovery

    Authors: Pawan Goyal, Hussam Al Daas, Peter Benner

    Abstract: In this paper, we study the problem of image recovery from given partial (corrupted) observations. Recovering an image using a low-rank model has been an active research area in data analysis and machine learning. But often, images are not only of low-rank but they also exhibit sparsity in a transformed space. In this work, we propose a new problem formulation in such a way that we seek to recover… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

  50. arXiv:2003.03501  [pdf, other

    cs.CV cs.LG

    Cross-modal Learning for Multi-modal Video Categorization

    Authors: Palash Goyal, Saurabh Sahu, Shalini Ghosh, Chul Lee

    Abstract: Multi-modal machine learning (ML) models can process data in multiple modalities (e.g., video, audio, text) and are useful for video content analysis in a variety of problems (e.g., object detection, scene understanding, activity recognition). In this paper, we focus on the problem of video categorization using a multi-modal ML technique. In particular, we have developed a novel multi-modal ML app… ▽ More

    Submitted 5 June, 2020; v1 submitted 6 March, 2020; originally announced March 2020.