Skip to main content

Showing 1–8 of 8 results for author: Arjit

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.00046  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    SpliceOut: A Simple and Efficient Audio Augmentation Method

    Authors: Arjit Jain, Pranay Reddy Samala, Deepak Mittal, Preethi Jyoti, Maneesh Singh

    Abstract: Time masking has become a de facto augmentation technique for speech and audio tasks, including automatic speech recognition (ASR) and audio classification, most notably as a part of SpecAugment. In this work, we propose SpliceOut, a simple modification to time masking which makes it computationally more efficient. SpliceOut performs comparably to (and sometimes outperforms) SpecAugment on a wide… ▽ More

    Submitted 13 October, 2021; v1 submitted 30 September, 2021; originally announced October 2021.

  2. arXiv:2108.12134  [pdf, ps, other

    cs.AI

    WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving

    Authors: Arjit Sharma, Sahil Sharma

    Abstract: Urban autonomous driving is an open and challenging problem to solve as the decision-making system has to account for several dynamic factors like multi-agent interactions, diverse scene perceptions, complex road geometries, and other rarely occurring real-world events. On the other side, with deep reinforcement learning (DRL) techniques, agents have learned many complex policies. They have even a… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: 10 pages, 8 figures, and 4 tables

  3. arXiv:2106.03837  [pdf, other

    cs.LG cs.AI

    MemStream: Memory-Based Streaming Anomaly Detection

    Authors: Siddharth Bhatia, Arjit Jain, Shivin Srivastava, Kenji Kawaguchi, Bryan Hooi

    Abstract: Given a stream of entries over time in a multi-dimensional data setting where concept drift is present, how can we detect anomalous activities? Most of the existing unsupervised anomaly detection approaches seek to detect anomalous events in an offline fashion and require a large amount of data for training. This is not practical in real-life scenarios where we receive the data in a streaming mann… ▽ More

    Submitted 4 March, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: The Web Conference (WWW), 2022

  4. arXiv:2104.03986  [pdf, other

    cs.DB cs.AI cs.LG stat.ML

    Deep Indexed Active Learning for Matching Heterogeneous Entity Representations

    Authors: Arjit Jain, Sunita Sarawagi, Prithviraj Sen

    Abstract: Given two large lists of records, the task in entity resolution (ER) is to find the pairs from the Cartesian product of the lists that correspond to the same real world entity. Typically, passive learning methods on such tasks require large amounts of labeled data to yield useful models. Active Learning is a promising approach for ER in low resource settings. However, the search space, to find inf… ▽ More

    Submitted 17 January, 2022; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: VLDB 2022

  5. arXiv:2009.08454  [pdf, other

    cs.LG cs.AI stat.ML

    ExGAN: Adversarial Generation of Extreme Samples

    Authors: Siddharth Bhatia, Arjit Jain, Bryan Hooi

    Abstract: Mitigating the risk arising from extreme events is a fundamental goal with many applications, such as the modelling of natural disasters, financial crashes, epidemics, and many others. To manage this risk, a vital step is to be able to understand or generate a wide range of extreme scenarios. Existing approaches based on Generative Adversarial Networks (GANs) excel at generating realistic samples,… ▽ More

    Submitted 15 March, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: AAAI Conference on Artificial Intelligence (AAAI), 2021

  6. arXiv:2009.08451  [pdf, other

    cs.LG cs.AI stat.ML

    MSTREAM: Fast Anomaly Detection in Multi-Aspect Streams

    Authors: Siddharth Bhatia, Arjit Jain, Pan Li, Ritesh Kumar, Bryan Hooi

    Abstract: Given a stream of entries in a multi-aspect data setting i.e., entries having multiple dimensions, how can we detect anomalous activities in an unsupervised manner? For example, in the intrusion detection setting, existing work seeks to detect anomalous events or edges in dynamic graph streams, but this does not allow us to take into account additional attributes of each entry. Our work aims to de… ▽ More

    Submitted 30 March, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: The Web Conference (WWW), 2021

  7. arXiv:1711.05680  [pdf, other

    cs.CL

    An Unsupervised Approach for Map** between Vector Spaces

    Authors: Syed Sarfaraz Akhtar, Arihant Gupta, Avijit Vajpayee, Arjit Srivastava, Madan Gopal Jhawar, Manish Shrivastava

    Abstract: We present a language independent, unsupervised approach for transforming word embeddings from source language to target language using a transformation matrix. Our model handles the problem of data scarcity which is faced by many languages in the world and yields improved word embeddings for words in the target language by relying on transformed embeddings of words of the source language. We init… ▽ More

    Submitted 20 November, 2017; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: CICLing 2017

  8. arXiv:1711.05678  [pdf, other

    cs.CL

    Unsupervised Morphological Expansion of Small Datasets for Improving Word Embeddings

    Authors: Syed Sarfaraz Akhtar, Arihant Gupta, Avijit Vajpayee, Arjit Srivastava, Manish Shrivastava

    Abstract: We present a language independent, unsupervised method for building word embeddings using morphological expansion of text. Our model handles the problem of data sparsity and yields improved word embeddings by relying on training word embeddings on artificially generated sentences. We evaluate our method using small sized training sets on eleven test sets for the word similarity task across seven l… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

    Comments: CICLing 2017