Skip to main content

Showing 1–50 of 66 results for author: Verma, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07166  [pdf, other

    cs.CV

    Resource Efficient Perception for Vision Systems

    Authors: A V Subramanyam, Niyati Singal, Vinay K Verma

    Abstract: Despite the rapid advancement in the field of image recognition, the processing of high-resolution imagery remains a computational challenge. However, this processing is pivotal for extracting detailed object insights in areas ranging from autonomous vehicle navigation to medical imaging analyses. Our study introduces a framework aimed at mitigating these challenges by leveraging memory efficient… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  2. arXiv:2404.19341  [pdf, other

    cs.CV cs.AI

    Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs

    Authors: Soham Mitra, Atri Sukul, Swalpa Kumar Roy, Pravendra Singh, Vinay Verma

    Abstract: Deep learning models have achieved remarkable success across diverse domains. However, the intricate nature of these models often impedes a clear understanding of their decision-making processes. This is where Explainable AI (XAI) becomes indispensable, offering intuitive explanations for model decisions. In this work, we propose a simple yet highly effective approach, ScoreCAM++, which introduces… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  3. arXiv:2403.20317  [pdf, other

    cs.CV

    Convolutional Prompting meets Language Models for Continual Learning

    Authors: Anurag Roy, Riddhiman Moulick, Vinay K. Verma, Saptarshi Ghosh, Abir Das

    Abstract: Continual Learning (CL) enables machine learning models to learn from continuously shifting new training data in absence of data from old tasks. Recently, pretrained vision transformers combined with prompt tuning have shown promise for overcoming catastrophic forgetting in CL. These approaches rely on a pool of learnable prompts which can be inefficient in sharing knowledge across tasks leading t… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: CVPR 2024 Camera Ready

  4. arXiv:2401.11932  [pdf, other

    cs.DC

    Accelerating Causal Algorithms for Industrial-scale Data: A Distributed Computing Approach with Ray Framework

    Authors: Vishal Verma, Vinod Reddy, Jaiprakash Ravi

    Abstract: The increasing need for causal analysis in large-scale industrial datasets necessitates the development of efficient and scalable causal algorithms for real-world applications. This paper addresses the challenge of scaling causal algorithms in the context of conducting causal analysis on extensive datasets commonly encountered in industrial settings. Our proposed solution involves enhancing the sc… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    ACM Class: C.4; E.2; I.2.1

  5. arXiv:2401.07465  [pdf, other

    eess.SY cs.LG cs.NE

    Power Flow Analysis Using Deep Neural Networks in Three-Phase Unbalanced Smart Distribution Grids

    Authors: Deepak Tiwari, Mehdi Jabbari Zideh, Veeru Talreja, Vishal Verma, Sarika K. Solanki, Jignesh Solanki

    Abstract: Most power systems' approaches are currently tending towards stochastic and probabilistic methods due to the high variability of renewable sources and the stochastic nature of loads. Conventional power flow (PF) approaches such as forward-backward sweep (FBS) and Newton-Raphson require a high number of iterations to solve non-linear PF equations making them computationally very intensive. PF is th… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  6. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  7. arXiv:2312.01188  [pdf, other

    cs.LG cs.CV stat.ML

    Efficient Expansion and Gradient Based Task Inference for Replay Free Incremental Learning

    Authors: Soumya Roy, Vinay K Verma, Deepak Gupta

    Abstract: This paper proposes a simple but highly efficient expansion-based model for continual learning. The recent feature transformation, masking and factorization-based methods are efficient, but they grow the model only over the global or shared parameter. Therefore, these approaches do not fully utilize the previously learned information because the same task-specific parameter forgets the earlier kno… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: To be Appeared in WACV, 2024

  8. arXiv:2312.01167  [pdf, other

    cs.CV cs.LG stat.ML

    Meta-Learned Attribute Self-Interaction Network for Continual and Generalized Zero-Shot Learning

    Authors: Vinay K Verma, Nikhil Mehta, Kevin J Liang, Aakansha Mishra, Lawrence Carin

    Abstract: Zero-shot learning (ZSL) is a promising approach to generalizing a model to categories unseen during training by leveraging class attributes, but challenges remain. Recently, methods using generative models to combat bias towards classes seen during training have pushed state of the art, but these generative models can be slow or computationally expensive to train. Also, these generative models as… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024. arXiv admin note: substantial text overlap with arXiv:2102.11856

  9. arXiv:2311.16496  [pdf, other

    cs.LG

    DPOD: Domain-Specific Prompt Tuning for Multimodal Fake News Detection

    Authors: Debarshi Brahma, Amartya Bhattacharya, Suraj Nagaje Mahadev, Anmol Asati, Vikas Verma, Soma Biswas

    Abstract: The spread of fake news using out-of-context images has become widespread and is a relevant problem in this era of information overload. Such out-of-context fake news may arise across different domains like politics, sports, entertainment, etc. In practical scenarios, an inherent problem of imbalance exists among news articles from such widely varying domains, resulting in a few domains with abund… ▽ More

    Submitted 12 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  10. arXiv:2310.05651  [pdf, other

    cs.LG cs.AI

    FENCE: Fairplay Ensuring Network Chain Entity for Real-Time Multiple ID Detection at Scale In Fantasy Sports

    Authors: Akriti Upreti, Kartavya Kothari, Utkarsh Thukral, Vishal Verma

    Abstract: Dream11 takes pride in being a unique platform that enables over 190 million fantasy sports users to demonstrate their skills and connect deeper with their favorite sports. While managing such a scale, one issue we are faced with is duplicate/multiple account creation in the system. This is done by some users with the intent of abusing the platform, typically for bonus offers. The challenge is to… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 7 pages, 7 figures, accepted in AIML Systems 2023

    ACM Class: I.2.1

  11. arXiv:2309.08227  [pdf, other

    cs.LG cs.AI cs.CV

    VERSE: Virtual-Gradient Aware Streaming Lifelong Learning with Anytime Inference

    Authors: Soumya Banerjee, Vinay K. Verma, Avideep Mukherjee, Deepak Gupta, Vinay P. Namboodiri, Piyush Rai

    Abstract: Lifelong learning or continual learning is the problem of training an AI agent continuously while also preventing it from forgetting its previously acquired knowledge. Streaming lifelong learning is a challenging setting of lifelong learning with the goal of continuous learning in a dynamic non-stationary environment without forgetting. We introduce a novel approach to lifelong learning, which is… ▽ More

    Submitted 19 February, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  12. arXiv:2308.11357  [pdf, other

    cs.CV

    Exemplar-Free Continual Transformer with Convolutions

    Authors: Anurag Roy, Vinay Kumar Verma, Sravan Voonna, Kripabandhu Ghosh, Saptarshi Ghosh, Abir Das

    Abstract: Continual Learning (CL) involves training a machine learning model in a sequential manner to learn new information while retaining previously learned tasks without the presence of previous training data. Although there has been significant interest in CL, most recent CL approaches in computer vision have focused on convolutional architectures only. However, with the recent success of vision transf… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted in ICCV 2023

  13. arXiv:2305.15047  [pdf, other

    cs.CL cs.AI

    Ghostbuster: Detecting Text Ghostwritten by Large Language Models

    Authors: Vivek Verma, Eve Fleisig, Nicholas Tomlin, Dan Klein

    Abstract: We introduce Ghostbuster, a state-of-the-art system for detecting AI-generated text. Our method works by passing documents through a series of weaker language models, running a structured search over possible combinations of their features, and then training a classifier on the selected features to predict whether documents are AI-generated. Crucially, Ghostbuster does not require access to token… ▽ More

    Submitted 5 April, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: NAACL 2024

  14. arXiv:2305.12084  [pdf, other

    cs.CL

    Revisiting Entropy Rate Constancy in Text

    Authors: Vivek Verma, Nicholas Tomlin, Dan Klein

    Abstract: The uniform information density (UID) hypothesis states that humans tend to distribute information roughly evenly across an utterance or discourse. Early evidence in support of the UID hypothesis came from Genzel & Charniak (2002), which proposed an entropy rate constancy principle based on the probability of English text under n-gram language models. We re-evaluate the claims of Genzel & Charniak… ▽ More

    Submitted 17 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

  15. arXiv:2301.11892  [pdf, other

    cs.LG cs.AI cs.CV

    Streaming LifeLong Learning With Any-Time Inference

    Authors: Soumya Banerjee, Vinay Kumar Verma, Vinay P. Namboodiri

    Abstract: Despite rapid advancements in lifelong learning (LLL) research, a large body of research mainly focuses on improving the performance in the existing \textit{static} continual learning (CL) setups. These methods lack the ability to succeed in a rapidly changing \textit{dynamic} environment, where an AI agent needs to quickly learn new instances in a `single pass' from the non-i.i.d (also possibly t… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.10741

  16. arXiv:2212.13381  [pdf, other

    cs.LG cs.CV

    MixupE: Understanding and Improving Mixup from Directional Derivative Perspective

    Authors: Yingtian Zou, Vikas Verma, Sarthak Mittal, Wai Hoh Tang, Hieu Pham, Juho Kannala, Yoshua Bengio, Arno Solin, Kenji Kawaguchi

    Abstract: Mixup is a popular data augmentation technique for training deep neural networks where additional samples are generated by linearly interpolating pairs of inputs and their labels. This technique is known to improve the generalization performance in many learning paradigms and applications. In this work, we first analyze Mixup and show that it implicitly regularizes infinitely many directional deri… ▽ More

    Submitted 15 October, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    Comments: 16 pages, Best Student Paper Award at UAI 2023

  17. arXiv:2210.12818  [pdf, other

    cs.CV

    Pushing the Efficiency Limit Using Structured Sparse Convolutions

    Authors: Vinay Kumar Verma, Nikhil Mehta, Shi**g Si, Ricardo Henao, Lawrence Carin

    Abstract: Weight pruning is among the most popular approaches for compressing deep convolutional neural networks. Recent work suggests that in a randomly initialized deep neural network, there exist sparse subnetworks that achieve performance comparable to the original network. Unfortunately, finding these subnetworks involves iterative stages of training and pruning, which can be computationally expensive.… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: Accepted at the IEEE Winter Conference on Applications of Computer Vision, WACV 2023

  18. arXiv:2210.09505  [pdf, other

    cs.LG stat.ML

    CNT (Conditioning on Noisy Targets): A new Algorithm for Leveraging Top-Down Feedback

    Authors: Alexia Jolicoeur-Martineau, Alex Lamb, Vikas Verma, Aniket Didolkar

    Abstract: We propose a novel regularizer for supervised learning called Conditioning on Noisy Targets (CNT). This approach consists in conditioning the model on a noisy version of the target(s) (e.g., actions in imitation learning or labels in classification) at a random noise level (from small to large noise). At inference time, since we do not know the target, we run the network with only noise in place o… ▽ More

    Submitted 26 October, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

  19. arXiv:2111.10970  [pdf, other

    cs.RO cs.AI cs.HC eess.SY

    Operations for Autonomous Spacecraft

    Authors: Rebecca Castano, Tiago Vaquero, Federico Rossi, Vandi Verma, Ellen Van Wyk, Dan Allard, Bennett Huffmann, Erin M. Murphy, Nihal Dhamani, Robert A. Hewitt, Scott Davidoff, Rashied Amini, Anthony Barrett, Julie Castillo-Rogez, Steve A. Chien, Mathieu Choukroun, Alain Dadaian, Raymond Francis, Benjamin Gorr, Mark Hofstadter, Mitch Ingham, Cristina Sorice, Iain Tierney

    Abstract: Onboard autonomy technologies such as planning and scheduling, identification of scientific targets, and content-based data summarization, will lead to exciting new space science missions. However, the challenge of operating missions with such onboard autonomous capabilities has not been studied to a level of detail sufficient for consideration in mission concepts. These autonomy capabilities will… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

    Comments: 16 pages, 18 Figures, 1 Table, to be published in IEEE Aerospace 2022 (AeroConf 2022)

    Journal ref: Proceedings of the 2022 IEEE Aerospace Conference (IEEE AERO 2022), 1-20

  20. arXiv:2110.10741  [pdf, other

    cs.LG cs.AI cs.CV

    Class Incremental Online Streaming Learning

    Authors: Soumya Banerjee, Vinay Kumar Verma, Toufiq Parag, Maneesh Singh, Vinay P. Namboodiri

    Abstract: A wide variety of methods have been developed to enable lifelong learning in conventional deep neural networks. However, to succeed, these methods require a `batch' of samples to be available and visited multiple times during training. While this works well in a static setting, these methods continue to suffer in a more realistic situation where data arrives in \emph{online streaming manner}. We e… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

  21. arXiv:2110.01856  [pdf, other

    cs.LG cs.CV stat.ML

    Hypernetworks for Continual Semi-Supervised Learning

    Authors: Dhanajit Brahma, Vinay Kumar Verma, Piyush Rai

    Abstract: Learning from data sequentially arriving, possibly in a non i.i.d. way, with changing task distribution over time is called continual learning. Much of the work thus far in continual learning focuses on supervised learning and some recent works on unsupervised learning. In many domains, each task contains a mix of labelled (typically very few) and unlabelled (typically plenty) training examples, w… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted to CSSL workshop at IJCAI 2021 (Best Student Paper Award)

  22. arXiv:2106.06795  [pdf, other

    cs.LG cs.AI cs.CV

    Knowledge Consolidation based Class Incremental Online Learning with Limited Data

    Authors: Mohammed Asad Karim, Vinay Kumar Verma, Pravendra Singh, Vinay Namboodiri, Piyush Rai

    Abstract: We propose a novel approach for class incremental online learning in a limited data setting. This problem setting is challenging because of the following constraints: (1) Classes are given incrementally, which necessitates a class incremental learning approach; (2) Data for each class is given in an online fashion, i.e., each training example is seen only once during training; (3) Each class has v… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI-2021)

  23. arXiv:2104.04765  [pdf, other

    eess.IV cs.LG cs.MM

    Q-matrix Unaware Double JPEG Detection using DCT-Domain Deep BiLSTM Network

    Authors: Vinay Verma, Deepak Singh, Nitin Khanna

    Abstract: The double JPEG compression detection has received much attention in recent years due to its applicability as a forensic tool for the most widely used JPEG file format. Existing state-of-the-art CNN-based methods either use histograms of all the frequencies or rely on heuristics to select histograms of specific low frequencies to classify single and double compressed images. However, even amidst l… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

  24. arXiv:2103.13558  [pdf, other

    cs.LG cs.AI cs.CV

    Efficient Feature Transformations for Discriminative and Generative Continual Learning

    Authors: Vinay Kumar Verma, Kevin J Liang, Nikhil Mehta, Piyush Rai, Lawrence Carin

    Abstract: As neural networks are increasingly being applied to real-world applications, mechanisms to address distributional shift and sequential task learning without forgetting are critical. Methods incorporating network expansion have shown promise by naturally adding model capacity for learning new tasks while simultaneously avoiding catastrophic forgetting. However, the growth in the number of addition… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted in CVPR 2021

  25. arXiv:2103.04032  [pdf, other

    cs.LG cs.CV stat.ML

    CAM-GAN: Continual Adaptation Modules for Generative Adversarial Networks

    Authors: Sakshi Varshney, Vinay Kumar Verma, Srijith P K, Lawrence Carin, Piyush Rai

    Abstract: We present a continual learning approach for generative adversarial networks (GANs), by designing and leveraging parameter-efficient feature map transformations. Our approach is based on learning a set of global and task-specific parameters. The global parameters are fixed across tasks whereas the task-specific parameters act as local adapters for each task, and help in efficiently obtaining task-… ▽ More

    Submitted 30 July, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: Under Submission

  26. arXiv:2102.11856  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Meta-Learned Attribute Self-Gating for Continual Generalized Zero-Shot Learning

    Authors: Vinay Kumar Verma, Kevin Liang, Nikhil Mehta, Lawrence Carin

    Abstract: Zero-shot learning (ZSL) has been shown to be a promising approach to generalizing a model to categories unseen during training by leveraging class attributes, but challenges still remain. Recently, methods using generative models to combat bias towards classes seen during training have pushed the state of the art of ZSL, but these generative models can be slow or computationally expensive to trai… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: Under Review

  27. arXiv:2011.07279  [pdf, other

    cs.CV

    Towards Zero-Shot Learning with Fewer Seen Class Examples

    Authors: Vinay Kumar Verma, Ashish Mishra, Anubha Pandey, Hema A. Murthy, Piyush Rai

    Abstract: We present a meta-learning based generative model for zero-shot learning (ZSL) towards a challenging setting when the number of training examples from each \emph{seen} class is very few. This setup contrasts with the conventional ZSL approaches, where training typically assumes the availability of a sufficiently large number of training examples from each of the seen classes. The proposed approach… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: Accepted in WACV 2021

  28. arXiv:2011.04419  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Domain-Agnostic Contrastive Learning

    Authors: Vikas Verma, Minh-Thang Luong, Kenji Kawaguchi, Hieu Pham, Quoc V. Le

    Abstract: Despite recent success, most contrastive self-supervised learning methods are domain-specific, relying heavily on data augmentation techniques that require knowledge about a particular domain, such as image crop** and rotation. To overcome such limitation, we propose a novel domain-agnostic approach to contrastive learning, named DACL, that is applicable to domains where invariances, and thus, d… ▽ More

    Submitted 19 July, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Published in ICML 2021

  29. arXiv:2007.12212  [pdf, other

    cs.CV cs.CL cs.IR cs.LG

    ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions

    Authors: Anurag Roy, Vinay Kumar Verma, Kripabandhu Ghosh, Saptarshi Ghosh

    Abstract: Most existing algorithms for cross-modal Information Retrieval are based on a supervised train-test setup, where a model learns to align the mode of the query (e.g., text) to the mode of the documents (e.g., images) from a given training set. Such a setup assumes that the training set contains an exhaustive representation of all possible classes of queries. In reality, a retrieval model may need t… ▽ More

    Submitted 23 September, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: Accepted in CIKM-2020

  30. PatchUp: A Feature-Space Block-Level Regularization Technique for Convolutional Neural Networks

    Authors: Mojtaba Faramarzi, Mohammad Amini, Akilesh Badrinaaraayanan, Vikas Verma, Sarath Chandar

    Abstract: Large capacity deep learning models are often prone to a high generalization gap when trained with a limited amount of labeled training data. A recent class of methods to address this problem uses various ways to construct a new training sample by mixing a pair (or more) of training samples. We propose PatchUp, a hidden state block-level regularization technique for Convolutional Neural Networks (… ▽ More

    Submitted 7 January, 2023; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: AAAI - 2022

    Journal ref: AAAI, vol. 36, no. 1, pp. 589-597, Jun. 2022

  31. arXiv:2006.02158  [pdf, other

    cs.CV

    Interpolation-based semi-supervised learning for object detection

    Authors: Jisoo Jeong, Vikas Verma, Minsung Hyun, Juho Kannala, Nojun Kwak

    Abstract: Despite the data labeling cost for the object detection tasks being substantially more than that of the classification tasks, semi-supervised learning methods for object detection have not been studied much. In this paper, we propose an Interpolation-based Semi-supervised learning method for object Detection (ISD), which considers and solves the problems caused by applying conventional Interpolati… ▽ More

    Submitted 29 December, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

  32. arXiv:2005.09111  [pdf, other

    cs.CE math.OC

    Topology optimization of nonlinear periodically microstructured materials for tailored homogenized constitutive properties

    Authors: Reza Behrou, Maroun Abi Ghanem, Brianna C. Macnider, Vimarsh Verma, Ryan Alvey, **ho Hong, Ashley F. Emery, Hyunsun Alicia Kim, Nicholas Boechler

    Abstract: A topology optimization method is presented for the design of periodic microstructured materials with prescribed homogenized nonlinear constitutive properties over finite strain ranges. The mechanical model assumes linear elastic isotropic materials, geometric nonlinearity at finite strain, and a quasi-static response. The optimization problem is solved by a nonlinear programming method and the se… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

  33. arXiv:2004.10098  [pdf, other

    cs.LG stat.ML

    Continual Learning using a Bayesian Nonparametric Dictionary of Weight Factors

    Authors: Nikhil Mehta, Kevin J Liang, Vinay K Verma, Lawrence Carin

    Abstract: Naively trained neural networks tend to experience catastrophic forgetting in sequential task settings, where data from previous tasks are unavailable. A number of methods, using various model expansion strategies, have been proposed recently as possible solutions. However, determining how much to expand the model is left to the practitioner, and often a constant schedule is chosen for simplicity,… ▽ More

    Submitted 27 April, 2021; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021 Post-conference updates: Fixed typo in equation (11) and updated references

  34. arXiv:2001.06657  [pdf, other

    cs.CV cs.IR cs.LG stat.ML

    Stacked Adversarial Network for Zero-Shot Sketch based Image Retrieval

    Authors: Anubha Pandey, Ashish Mishra, Vinay Kumar Verma, Anurag Mittal, Hema A. Murthy

    Abstract: Conventional approaches to Sketch-Based Image Retrieval (SBIR) assume that the data of all the classes are available during training. The assumption may not always be practical since the data of a few classes may be unavailable, or the classes may not appear at the time of training. Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) relaxes this constraint and allows the algorithm to handle previous… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

    Comments: Accepted in WACV'2020

  35. arXiv:2001.05545  [pdf, other

    cs.CV cs.LG stat.ML

    A "Network Pruning Network" Approach to Deep Model Compression

    Authors: Vinay Kumar Verma, Pravendra Singh, Vinay P. Namboodiri, Piyush Rai

    Abstract: We present a filter pruning approach for deep model compression, using a multitask network. Our approach is based on learning a a pruner network to prune a pre-trained target network. The pruner is essentially a multitask deep neural network with binary outputs that help identify the filters from each layer of the original network that do not have any significant contribution to the model and can… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

    Comments: Accepted in WACV'20

  36. arXiv:1912.11570  [pdf, other

    cs.CV cs.LG stat.ML

    SketchTransfer: A Challenging New Task for Exploring Detail-Invariance and the Abstractions Learned by Deep Networks

    Authors: Alex Lamb, Sherjil Ozair, Vikas Verma, David Ha

    Abstract: Deep networks have achieved excellent results in perceptual tasks, yet their ability to generalize to variations not seen during training has come under increasing scrutiny. In this work we focus on their ability to have invariance towards the presence or absence of details. For example, humans are able to watch cartoons, which are missing many visual details, without being explicitly trained to d… ▽ More

    Submitted 24 December, 2019; originally announced December 2019.

    Comments: Accepted WACV 2020

  37. arXiv:1909.11715  [pdf, other

    cs.LG stat.ML

    GraphMix: Improved Training of GNNs for Semi-Supervised Learning

    Authors: Vikas Verma, Meng Qu, Kenji Kawaguchi, Alex Lamb, Yoshua Bengio, Juho Kannala, Jian Tang

    Abstract: We present GraphMix, a regularization method for Graph Neural Network based semi-supervised object classification, whereby we propose to train a fully-connected network jointly with the graph neural network via parameter sharing and interpolation-based regularization. Further, we provide a theoretical analysis of how GraphMix improves the generalization bounds of the underlying graph neural networ… ▽ More

    Submitted 8 October, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: https://github.com/vikasverma1077/GraphMix

  38. arXiv:1909.04344  [pdf, other

    stat.ML cs.CV cs.LG

    A Meta-Learning Framework for Generalized Zero-Shot Learning

    Authors: Vinay Kumar Verma, Dhanajit Brahma, Piyush Rai

    Abstract: Learning to classify unseen class samples at test time is popularly referred to as zero-shot learning (ZSL). If test samples can be from training (seen) as well as unseen classes, it is a more challenging problem due to the existence of strong bias towards seen classes. This problem is generally known as \emph{generalized} zero-shot learning (GZSL). Thanks to the recent advances in generative mode… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: Under Submission

  39. arXiv:1908.01000  [pdf, other

    cs.LG cs.AI stat.ML

    InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization

    Authors: Fan-Yun Sun, Jordan Hoffmann, Vikas Verma, Jian Tang

    Abstract: This paper studies learning the representations of whole graphs in both unsupervised and semi-supervised scenarios. Graph-level representations are critical in a variety of real-world applications such as predicting the properties of molecules and community analysis in social networks. Traditional graph kernel based methods are simple, yet effective for obtaining fixed-length representations for g… ▽ More

    Submitted 17 January, 2020; v1 submitted 31 July, 2019; originally announced August 2019.

    Comments: ICLR 2020 (spotlight)

  40. arXiv:1907.07287  [pdf, other

    cs.LG cs.CV stat.ML

    Towards Understanding Generalization in Gradient-Based Meta-Learning

    Authors: Simon Guiroy, Vikas Verma, Christopher Pal

    Abstract: In this work we study generalization of neural networks in gradient-based meta-learning by analyzing various properties of the objective landscapes. We experimentally demonstrate that as meta-training progresses, the meta-test solutions, obtained after adapting the meta-train solution of the model, to new tasks via few steps of gradient-based fine-tuning, become flatter, lower in loss, and further… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

  41. Interpolated Adversarial Training: Achieving Robust Neural Networks without Sacrificing Too Much Accuracy

    Authors: Alex Lamb, Vikas Verma, Kenji Kawaguchi, Alexander Matyasko, Savya Khosla, Juho Kannala, Yoshua Bengio

    Abstract: Adversarial robustness has become a central goal in deep learning, both in the theory and the practice. However, successful methods to improve the adversarial robustness (such as adversarial training) greatly hurt generalization performance on the unperturbed data. This could have a major impact on how the adversarial robustness affects real world systems (i.e. many may opt to forego robustness if… ▽ More

    Submitted 19 October, 2022; v1 submitted 16 June, 2019; originally announced June 2019.

    Comments: This is the latest version, which is published in the Journal, "Neural Networks", in 2022. All the previous results are unchanged. First two authors contributed equally

    Journal ref: Neural Networks, volume 154, pages 218-233 (2022)

  42. arXiv:1906.03038  [pdf, other

    cs.LG cs.CV stat.ML

    A Generative Framework for Zero-Shot Learning with Adversarial Domain Adaptation

    Authors: Varun Khare, Divyat Mahajan, Homanga Bharadhwaj, Vinay Verma, Piyush Rai

    Abstract: We present a domain adaptation based generative framework for zero-shot learning. Our framework addresses the problem of domain shift between the seen and unseen class distributions in zero-shot learning and minimizes the shift by develo** a generative model trained via adversarial domain adaptation. Our approach is based on end-to-end learning of the class distributions of seen classes and unse… ▽ More

    Submitted 22 February, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: Proceedings of Winter Conference on Applications of Computer Vision (WACV) 2020

  43. arXiv:1905.04446  [pdf, other

    cs.CV cs.AI cs.LG

    Play and Prune: Adaptive Filter Pruning for Deep Model Compression

    Authors: Pravendra Singh, Vinay Kumar Verma, Piyush Rai, Vinay P. Namboodiri

    Abstract: While convolutional neural networks (CNN) have achieved impressive performance on various classification/recognition tasks, they typically consist of a massive number of parameters. This results in significant memory requirement as well as computational overheads. Consequently, there is a growing need for filter-level pruning approaches for compressing CNN based models that not only reduce the tot… ▽ More

    Submitted 11 May, 2019; originally announced May 2019.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI-2019)

  44. arXiv:1904.08542  [pdf, other

    cs.CV cs.IR stat.ML

    Generative Model for Zero-Shot Sketch-Based Image Retrieval

    Authors: Vinay Kumar Verma, Aakansha Mishra, Ashish Mishra, Piyush Rai

    Abstract: We present a probabilistic model for Sketch-Based Image Retrieval (SBIR) where, at retrieval time, we are given sketches from novel classes, that were not present at training time. Existing SBIR methods, most of which rely on learning class-wise correspondences between sketches and images, typically work well only for previously seen sketch classes, and result in poor retrieval performance on nove… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Comments: Accepted at CVPR-Workshop 2019

  45. arXiv:1903.04120  [pdf, other

    cs.CV cs.AI cs.LG

    HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs

    Authors: Pravendra Singh, Vinay Kumar Verma, Piyush Rai, Vinay P. Namboodiri

    Abstract: We present a novel deep learning architecture in which the convolution operation leverages heterogeneous kernels. The proposed HetConv (Heterogeneous Kernel-Based Convolution) reduces the computation (FLOPs) and the number of parameters as compared to standard convolution operation while still maintaining representational efficiency. To show the effectiveness of our proposed convolution, we presen… ▽ More

    Submitted 25 March, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

    Comments: Accepted in CVPR 2019

  46. arXiv:1903.03825  [pdf

    stat.ML cs.AI cs.LG

    Interpolation Consistency Training for Semi-Supervised Learning

    Authors: Vikas Verma, Kenji Kawaguchi, Alex Lamb, Juho Kannala, Arno Solin, Yoshua Bengio, David Lopez-Paz

    Abstract: We introduce Interpolation Consistency Training (ICT), a simple and computation efficient algorithm for training Deep Neural Networks in the semi-supervised learning paradigm. ICT encourages the prediction at an interpolation of unlabeled points to be consistent with the interpolation of the predictions at those points. In classification problems, ICT moves the decision boundary to low-density reg… ▽ More

    Submitted 19 October, 2022; v1 submitted 9 March, 2019; originally announced March 2019.

    Comments: This is the latest version, which is published in the Journal, "Neural Networks", in 2022. All the previous results are unchanged. Keyword: Deep Learning, Semi-supervised Learning, Mixup

    Journal ref: Neural Networks, volume 145, pages 90-106 (2022)

  47. arXiv:1903.02709  [pdf, other

    stat.ML cs.LG

    On Adversarial Mixup Resynthesis

    Authors: Christopher Beckham, Sina Honari, Vikas Verma, Alex Lamb, Farnoosh Ghadiri, R Devon Hjelm, Yoshua Bengio, Christopher Pal

    Abstract: In this paper, we explore new approaches to combining information encoded within the learned representations of auto-encoders. We explore models that are capable of combining the attributes of multiple inputs such that a resynthesised output is trained to fool an adversarial discriminator for real versus synthesised data. Furthermore, we explore the use of such an architecture in the context of se… ▽ More

    Submitted 23 October, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: 'Camera-ready draft'

  48. arXiv:1811.10559  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Leveraging Filter Correlations for Deep Model Compression

    Authors: Pravendra Singh, Vinay Kumar Verma, Piyush Rai, Vinay P. Namboodiri

    Abstract: We present a filter correlation based model compression approach for deep convolutional neural networks. Our approach iteratively identifies pairs of filters with the largest pairwise correlations and drops one of the filters from each such pair. However, instead of discarding one of the filters from each such pair naïvely, the model is re-optimized to make the filters in these pairs maximally cor… ▽ More

    Submitted 15 January, 2020; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: IEEE Winter Conference on Applications of Computer Vision (WACV), 2020

  49. arXiv:1806.06765  [pdf, other

    cs.LG cs.NE q-bio.NC stat.ML

    Modularity Matters: Learning Invariant Relational Reasoning Tasks

    Authors: Jason Jo, Vikas Verma, Yoshua Bengio

    Abstract: We focus on two supervised visual reasoning tasks whose labels encode a semantic relational rule between two or more objects in an image: the MNIST Parity task and the colorized Pentomino task. The objects in the images undergo random translation, scaling, rotation and coloring transformations. Thus these tasks involve invariant relational reasoning. We report uneven performance of various deep CN… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: Modified abstract to fit arXiv character limit

  50. arXiv:1806.05236  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Manifold Mixup: Better Representations by Interpolating Hidden States

    Authors: Vikas Verma, Alex Lamb, Christopher Beckham, Amir Najafi, Ioannis Mitliagkas, Aaron Courville, David Lopez-Paz, Yoshua Bengio

    Abstract: Deep neural networks excel at learning the training data, but often provide incorrect and confident predictions when evaluated on slightly different test examples. This includes distribution shifts, outliers, and adversarial examples. To address these issues, we propose Manifold Mixup, a simple regularizer that encourages neural networks to predict less confidently on interpolations of hidden repr… ▽ More

    Submitted 11 May, 2019; v1 submitted 13 June, 2018; originally announced June 2018.

    Comments: To appear in ICML 2019