Skip to main content

Showing 51–85 of 85 results for author: Balasubramanian, V N

.
  1. arXiv:2006.07828  [pdf, other

    cs.CV cs.LG

    On Saliency Maps and Adversarial Robustness

    Authors: Puneet Mangla, Vedant Singh, Vineeth N Balasubramanian

    Abstract: A Very recent trend has emerged to couple the notion of interpretability and adversarial robustness, unlike earlier efforts which solely focused on good interpretations or robustness against adversaries. Works have shown that adversarially trained models exhibit more interpretable saliency maps than their non-robust counterparts, and that this behavior can be quantified by considering the alignmen… ▽ More

    Submitted 13 July, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Accepted at ECML-PKDD 2020, Acknowledgements added

  2. arXiv:2005.08632  [pdf, other

    cs.LG cs.CV stat.ML

    Universalization of any adversarial attack using very few test examples

    Authors: Sandesh Kamath, Amit Deshpande, K V Subrahmanyam, Vineeth N Balasubramanian

    Abstract: Deep learning models are known to be vulnerable not only to input-dependent adversarial attacks but also to input-agnostic or universal adversarial attacks. Dezfooli et al. \cite{Dezfooli17,Dezfooli17anal} construct universal adversarial attack on a given model by looking at a large number of training data points and the geometry of the decision boundary near them. Subsequent work \cite{Khrulkov18… ▽ More

    Submitted 28 October, 2022; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: Appeared in ACM CODS-COMAD 2022 (Research Track)

  3. arXiv:2005.00364  [pdf, other

    cs.CV

    Generative Adversarial Data Programming

    Authors: Arghya Pal, Vineeth N Balasubramanian

    Abstract: The paucity of large curated hand-labeled training data forms a major bottleneck in the deployment of machine learning models in computer vision and other fields. Recent work (Data Programming) has shown how distant supervision signals in the form of labeling functions can be used to obtain labels for given data in near-constant time. In this work, we present Adversarial Data Programming (ADP), wh… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

    Comments: arXiv admin note: text overlap with arXiv:1803.05137

  4. arXiv:2003.08798  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Incremental Object Detection via Meta-Learning

    Authors: K J Joseph, Jathushan Rajasegaran, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian

    Abstract: In a real-world setting, object instances from new classes can be continuously encountered by object detectors. When existing object detectors are applied to such scenarios, their performance on old classes deteriorates significantly. A few efforts have been reported to address this limitation, all of which apply variants of knowledge distillation to avoid catastrophic forgetting. We note that alt… ▽ More

    Submitted 15 December, 2021; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: Published in IEEE Transactions on Pattern Analysis & Machine Intelligence, Nov 2021. Code is available in https://github.com/JosephKJ/iOD

    Journal ref: TPAMI, Nov 2021

  5. arXiv:2003.06566  [pdf, other

    cs.LG cs.CV stat.ML

    On the benefits of defining vicinal distributions in latent space

    Authors: Puneet Mangla, Vedant Singh, Shreyas Jayant Havaldar, Vineeth N Balasubramanian

    Abstract: The vicinal risk minimization (VRM) principle is an empirical risk minimization (ERM) variant that replaces Dirac masses with vicinal functions. There is strong numerical and theoretical evidence showing that VRM outperforms ERM in terms of generalization if appropriate vicinal functions are chosen. Mixup Training (MT), a popular choice of vicinal distribution, improves the generalization performa… ▽ More

    Submitted 18 October, 2021; v1 submitted 14 March, 2020; originally announced March 2020.

    Comments: Accepted at Elsevier Pattern Recognition Letters (2021), Best Paper Award at CVPR 2021 Workshop on Adversarial Machine Learning in Real-World Computer Vision (AML-CV), Also accepted at ICLR 2021 Workshops on Robust-Reliable Machine Learning (Oral) and Generalization beyond the training distribution (Abstract)

  6. arXiv:2002.11318  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Can we have it all? On the Trade-off between Spatial and Adversarial Robustness of Neural Networks

    Authors: Sandesh Kamath, Amit Deshpande, K V Subrahmanyam, Vineeth N Balasubramanian

    Abstract: (Non-)robustness of neural networks to small, adversarial pixel-wise perturbations, and as more recently shown, to even random spatial transformations (e.g., translations, rotations) entreats both theoretical and empirical understanding. Spatial robustness to random translations and rotations is commonly attained via equivariant models (e.g., StdCNNs, GCNNs) and training augmentation, whereas adve… ▽ More

    Submitted 10 November, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: Accepted NeurIPS 2021. Preliminary version consisting early experimental results was presented in ICML 2018 Workshop on "Towards learning with limited labels: Equivariance, Invariance,and Beyond" as "Understanding Adversarial Robustness of Symmetric Networks"

  7. arXiv:2001.05873  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    A Little Fog for a Large Turn

    Authors: Harshitha Machiraju, Vineeth N Balasubramanian

    Abstract: Small, carefully crafted perturbations called adversarial perturbations can easily fool neural networks. However, these perturbations are largely additive and not naturally found. We turn our attention to the field of Autonomous navigation wherein adverse weather conditions such as fog have a drastic effect on the predictions of these systems. These weather conditions are capable of acting like na… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: Accepted to WACV 2020

  8. arXiv:1911.13073  [pdf, other

    cs.CV cs.LG eess.IV

    Attributional Robustness Training using Input-Gradient Spatial Alignment

    Authors: Mayank Singh, Nupur Kumari, Puneet Mangla, Abhishek Sinha, Vineeth N Balasubramanian, Balaji Krishnamurthy

    Abstract: Interpretability is an emerging area of research in trustworthy machine learning. Safe deployment of machine learning system mandates that the prediction and its explanation be reliable and robust. Recently, it has been shown that the explanations could be manipulated easily by adding visually imperceptible perturbations to the input while kee** the model's prediction intact. In this work, we st… ▽ More

    Submitted 18 July, 2020; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: ECCV 2020, Code at https://github.com/nupurkmr9/Attributional-Robustness

  9. Active Learning with Point Supervision for Cost-Effective Panicle Detection in Cereal Crops

    Authors: Akshay L Chandra, Sai Vikas Desai, Vineeth N Balasubramanian, Seishi Ninomiya, Wei Guo

    Abstract: Panicle density of cereal crops such as wheat and sorghum is one of the main components for plant breeders and agronomists in understanding the yield of their crops. To phenotype the panicle density effectively, researchers agree there is a significant need for computer vision-based object detection techniques. Especially in recent times, research in deep learning-based object detection shows prom… ▽ More

    Submitted 17 April, 2020; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: Accepted as a journal paper at BMC Plant Methods (February 2020)

  10. arXiv:1908.02454  [pdf, other

    cs.CV

    An Adaptive Supervision Framework for Active Learning in Object Detection

    Authors: Sai Vikas Desai, Akshay L Chandra, Wei Guo, Seishi Ninomiya, Vineeth N Balasubramanian

    Abstract: Active learning approaches in computer vision generally involve querying strong labels for data. However, previous works have shown that weak supervision can be effective in training models for vision tasks while greatly reducing annotation costs. Using this knowledge, we propose an adaptive supervision framework for active learning and demonstrate its effectiveness on the task of object detection… ▽ More

    Submitted 15 October, 2019; v1 submitted 7 August, 2019; originally announced August 2019.

    Comments: Accepted in BMVC 2019

  11. arXiv:1908.00706  [pdf, other

    cs.CV cs.LG

    AdvGAN++ : Harnessing latent layers for adversary generation

    Authors: Puneet Mangla, Surgan Jandial, Sakshi Varshney, Vineeth N Balasubramanian

    Abstract: Adversarial examples are fabricated examples, indistinguishable from the original image that mislead neural networks and drastically lower their performance. Recently proposed AdvGAN, a GAN based approach, takes input image as a prior for generating adversaries to target a model. In this work, we show how latent features can serve as better priors than input images for adversary generation by prop… ▽ More

    Submitted 23 December, 2019; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: Accepted at Neural Architects Workshop, ICCV 2019

  12. arXiv:1907.12087  [pdf, other

    cs.LG cs.CV stat.ML

    Charting the Right Manifold: Manifold Mixup for Few-shot Learning

    Authors: Puneet Mangla, Mayank Singh, Abhishek Sinha, Nupur Kumari, Vineeth N Balasubramanian, Balaji Krishnamurthy

    Abstract: Few-shot learning algorithms aim to learn model parameters capable of adapting to unseen classes with the help of only a few labeled examples. A recent regularization technique - Manifold Mixup focuses on learning a general-purpose representation, robust to small changes in the data distribution. Since the goal of few-shot learning is closely linked to robust representation learning, we study Mani… ▽ More

    Submitted 18 January, 2020; v1 submitted 28 July, 2019; originally announced July 2019.

    Comments: WACV 2020, Code: https://github.com/nupurkmr9/S2M2_fewshot

  13. arXiv:1906.08771  [pdf, other

    cs.LG stat.ML

    Submodular Batch Selection for Training Deep Neural Networks

    Authors: K J Joseph, Vamshi Teja R, Krishnakant Singh, Vineeth N Balasubramanian

    Abstract: Mini-batch gradient descent based methods are the de facto algorithms for training neural network architectures today. We introduce a mini-batch selection strategy based on submodular function maximization. Our novel submodular formulation captures the informativeness of each sample and diversity of the whole subset. We design an efficient, greedy algorithm which can give high-quality solutions to… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: IJCAI 2019

  14. Automatic estimation of heading date of paddy rice using deep learning

    Authors: Sai Vikas Desai, Vineeth N Balasubramanian, Tokihiro Fukatsu, Seishi Ninomiya, Wei Guo

    Abstract: Accurate estimation of heading date of paddy rice greatly helps the breeders to understand the adaptability of different crop varieties in a given location. The heading date also plays a vital role in determining grain yield for research experiments. Visual examination of the crop is laborious and time consuming. Therefore, quick and precise estimation of heading date of paddy rice is highly essen… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  15. Borrow from Anywhere: Pseudo Multi-modal Object Detection in Thermal Imagery

    Authors: Chaitanya Devaguptapu, Ninad Akolekar, Manuj M Sharma, Vineeth N Balasubramanian

    Abstract: Can we improve detection in the thermal domain by borrowing features from rich domains like visual RGB? In this paper, we propose a pseudo-multimodal object detector trained on natural image domain data to help improve the performance of object detection in thermal images. We assume access to a large-scale dataset in the visual RGB domain and relatively smaller dataset (in terms of instances) in t… ▽ More

    Submitted 15 July, 2020; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: Accepted at Perception Beyond Visible Spectrum Workshop, CVPR 2019

  16. arXiv:1905.05186  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Harnessing the Vulnerability of Latent Layers in Adversarially Trained Models

    Authors: Mayank Singh, Abhishek Sinha, Nupur Kumari, Harshitha Machiraju, Balaji Krishnamurthy, Vineeth N Balasubramanian

    Abstract: Neural networks are vulnerable to adversarial attacks -- small visually imperceptible crafted noise which when added to the input drastically changes the output. The most effective method of defending against these adversarial attacks is to use the methodology of adversarial training. We analyze the adversarially trained robust models to study their vulnerability against adversarial attacks at the… ▽ More

    Submitted 25 June, 2019; v1 submitted 13 May, 2019; originally announced May 2019.

    Comments: Accepted at IJCAI 2019

  17. arXiv:1904.03620  [pdf, other

    cs.GR cs.LG stat.ML

    Teaching GANs to Sketch in Vector Format

    Authors: Varshaneya V, S Balasubramanian, Vineeth N Balasubramanian

    Abstract: Sketching is more fundamental to human cognition than speech. Deep Neural Networks (DNNs) have achieved the state-of-the-art in speech-related tasks but have not made significant development in generating stroke-based sketches a.k.a sketches in vector format. Though there are Variational Auto Encoders (VAEs) for generating sketches in vector format, there is no Generative Adversarial Network (GAN)… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.

  18. arXiv:1903.01092  [pdf, other

    cs.CV

    Zero-Shot Task Transfer

    Authors: Arghya Pal, Vineeth N Balasubramanian

    Abstract: In this work, we present a novel meta-learning algorithm, i.e. TTNet, that regresses model parameters for novel tasks for which no ground truth is available (zero-shot tasks). In order to adapt to novel zero-shot tasks, our meta-learner learns from the model parameters of known tasks (with ground truth) and the correlation of known tasks to zero-shot tasks. Such intuition finds its foothold in cog… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.

  19. arXiv:1902.02302  [pdf, other

    cs.LG stat.ML

    Neural Network Attributions: A Causal Perspective

    Authors: Aditya Chattopadhyay, Piyushi Manupriya, Anirban Sarkar, Vineeth N Balasubramanian

    Abstract: We propose a new attribution method for neural networks developed using first principles of causality (to the best of our knowledge, the first such). The neural network architecture is viewed as a Structural Causal Model, and a methodology to compute the causal effect of each feature on the output is presented. With reasonable assumptions on the causal structure of the input data, we propose algor… ▽ More

    Submitted 3 July, 2019; v1 submitted 6 February, 2019; originally announced February 2019.

    Comments: 17 pages, 10 Figures. Accepted in the Proceedings of the 36th International Conference on Machine Learning (ICML2019). Modifications: Added github link to code and fixed a typo in Fig. 3

    Journal ref: Proceedings of the 36th International Conference on Machine Learning 97 (2019) 981-990

  20. DANTE: Deep AlterNations for Training nEural networks

    Authors: Vaibhav B Sinha, Sneha Kudugunta, Adepu Ravi Sankar, Surya Teja Chavali, Purushottam Kar, Vineeth N Balasubramanian

    Abstract: We present DANTE, a novel method for training neural networks using the alternating minimization principle. DANTE provides an alternate perspective to traditional gradient-based backpropagation techniques commonly used to train deep networks. It utilizes an adaptation of quasi-convexity to cast training a neural network as a bi-quasi-convex optimization problem. We show that for neural network con… ▽ More

    Submitted 9 August, 2020; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: 19 pages

    Journal ref: Neural Networks 131 (2020) 127-143

  21. arXiv:1901.02675  [pdf, other

    cs.CV

    Low-Cost Transfer Learning of Face Tasks

    Authors: Thrupthi Ann John, Isha Dua, Vineeth N Balasubramanian, C. V. Jawahar

    Abstract: Do we know what the different filters of a face network represent? Can we use this filter information to train other tasks without transfer learning? For instance, can age, head pose, emotion and other face related tasks be learned from face recognition network without transfer learning? Understanding the role of these filters allows us to transfer knowledge across tasks and take advantage of larg… ▽ More

    Submitted 9 January, 2019; originally announced January 2019.

  22. arXiv:1809.10238  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis

    Authors: K J Joseph, Arghya Pal, Sailaja Rajanala, Vineeth N Balasubramanian

    Abstract: Generating an image from its description is a challenging task worth solving because of its numerous practical applications ranging from image editing to virtual reality. All existing methods use one single caption to generate a plausible image. A single caption by itself, can be limited, and may not be able to capture the variety of concepts and behavior that may be present in the image. We propo… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: To appear in the proceedings of IEEE Winter Conference on Applications of Computer Vision, WACV-2019

  23. arXiv:1809.07499  [pdf, other

    cs.CV cs.AI cs.LG

    MASON: A Model AgnoStic ObjectNess Framework

    Authors: K J Joseph, Vineeth N Balasubramanian

    Abstract: This paper proposes a simple, yet very effective method to localize dominant foreground objects in an image, to pixel-level precision. The proposed method 'MASON' (Model-AgnoStic ObjectNess) uses a deep convolutional network to generate category-independent and model-agnostic heat maps for any image. The network is not explicitly trained for the task, and hence, can be used off-the-shelf in tandem… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: Accepted at AutoNUE Workshop, 15th European Conference on Computer Vision (ECCV), September 2018, Munich, Germany

  24. arXiv:1807.08140  [pdf, other

    cs.LG math.OC stat.ML

    On the Analysis of Trajectories of Gradient Descent in the Optimization of Deep Neural Networks

    Authors: Adepu Ravi Sankar, Vishwak Srinivasan, Vineeth N Balasubramanian

    Abstract: Theoretical analysis of the error landscape of deep neural networks has garnered significant interest in recent years. In this work, we theoretically study the importance of noise in the trajectories of gradient descent towards optimal solutions in multi-layer neural networks. We show that adding noise (in different ways) to a neural network while training increases the rank of the product of weig… ▽ More

    Submitted 21 July, 2018; originally announced July 2018.

    Comments: 4 pages + 1 figure (main, excluding references), 5 pages + 4 figures (appendix)

  25. arXiv:1803.05137  [pdf, other

    cs.CV

    Adversarial Data Programming: Using GANs to Relax the Bottleneck of Curated Labeled Data

    Authors: Arghya Pal, Vineeth N Balasubramanian

    Abstract: Paucity of large curated hand-labeled training data for every domain-of-interest forms a major bottleneck in the deployment of machine learning models in computer vision and other fields. Recent work (Data Programming) has shown how distant supervision signals in the form of labeling functions can be used to obtain labels for given data in near-constant time. In this work, we present Adversarial D… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.

    Comments: CVPR 2018 main conference paper

  26. arXiv:1803.02781  [pdf, other

    stat.ML cs.LG

    Fast Dawid-Skene: A Fast Vote Aggregation Scheme for Sentiment Classification

    Authors: Vaibhav B Sinha, Sukrut Rao, Vineeth N Balasubramanian

    Abstract: Many real world problems can now be effectively solved using supervised machine learning. A major roadblock is often the lack of an adequate quantity of labeled data for training. A possible solution is to assign the task of labeling data to a crowd, and then infer the true label using aggregation methods. A well-known approach for aggregation is the Dawid-Skene (DS) algorithm, which is based on t… ▽ More

    Submitted 7 September, 2018; v1 submitted 7 March, 2018; originally announced March 2018.

    Comments: 8 pages, 5 tables, 1 figure, KDD Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM) 2018

  27. arXiv:1712.07424  [pdf, ps, other

    stat.ML cs.LG

    ADINE: An Adaptive Momentum Method for Stochastic Gradient Descent

    Authors: Vishwak Srinivasan, Adepu Ravi Sankar, Vineeth N Balasubramanian

    Abstract: Two major momentum-based techniques that have achieved tremendous success in optimization are Polyak's heavy ball method and Nesterov's accelerated gradient. A crucial step in all momentum-based methods is the choice of the momentum parameter $m$ which is always suggested to be set to less than $1$. Although the choice of $m < 1$ is justified only under very strong theoretical assumptions, it work… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Comments: 8 + 1 pages, 12 figures, accepted at CoDS-COMAD 2018

  28. arXiv:1711.04150  [pdf, other

    cs.SI cs.LG stat.ML

    STWalk: Learning Trajectory Representations in Temporal Graphs

    Authors: Supriya Pandhre, Himangi Mittal, Manish Gupta, Vineeth N Balasubramanian

    Abstract: Analyzing the temporal behavior of nodes in time-varying graphs is useful for many applications such as targeted advertising, community evolution and outlier detection. In this paper, we present a novel approach, STWalk, for learning trajectory representations of nodes in temporal graphs. The proposed framework makes use of structural properties of graphs at current and previous time-steps to lear… ▽ More

    Submitted 11 November, 2017; originally announced November 2017.

    Comments: 10 pages, 5 figures, 2 tables

  29. Grad-CAM++: Improved Visual Explanations for Deep Convolutional Networks

    Authors: Aditya Chattopadhyay, Anirban Sarkar, Prantik Howlader, Vineeth N Balasubramanian

    Abstract: Over the last decade, Convolutional Neural Network (CNN) models have been highly successful in solving complex vision problems. However, these deep models are perceived as "black box" methods considering the lack of understanding of their internal functioning. There has been a significant recent interest in develo** explainable deep learning models, and this paper is an effort in this direction.… ▽ More

    Submitted 9 November, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: 17 Pages, 15 Figures, 11 Tables. Accepted in the proceedings of IEEE Winter Conf. on Applications of Computer Vision (WACV2018). Extended version is under review at IEEE Transactions on Pattern Analysis and Machine Intelligence

  30. arXiv:1708.05980  [pdf, other

    cs.CV

    Attentive Semantic Video Generation using Captions

    Authors: Tanya Marwah, Gaurav Mittal, Vineeth N. Balasubramanian

    Abstract: This paper proposes a network architecture to perform variable length semantic video generation using captions. We adopt a new perspective towards video generation where we allow the captions to be combined with the long-term and short-term dependencies between video frames and thus generate a video in an incremental manner. Our experiments demonstrate our network architecture's ability to disting… ▽ More

    Submitted 21 October, 2017; v1 submitted 20 August, 2017; originally announced August 2017.

    Journal ref: Presented at ICCV 2017 (International Conference on Computer Vision)

  31. arXiv:1706.07530  [pdf, ps, other

    cs.CV

    Multiresolution Match Kernels for Gesture Video Classification

    Authors: Hemanth Venkateswara, Vineeth N. Balasubramanian, Prasanth Lade, Sethuraman Panchanathan

    Abstract: The emergence of depth imaging technologies like the Microsoft Kinect has renewed interest in computational methods for gesture classification based on videos. For several years now, researchers have used the Bag-of-Features (BoF) as a primary method for generation of feature vectors from video data for recognition of gestures. However, the BoF method is a coarse representation of the information… ▽ More

    Submitted 22 June, 2017; originally announced June 2017.

    Comments: ICME 2013 Conference

  32. arXiv:1706.02052  [pdf, other

    stat.ML cs.LG cs.NE

    Are Saddles Good Enough for Deep Learning?

    Authors: Adepu Ravi Sankar, Vineeth N Balasubramanian

    Abstract: Recent years have seen a growing interest in understanding deep neural networks from an optimization perspective. It is understood now that converging to low-cost local minima is sufficient for such models to become effective in practice. However, in this work, we propose a new hypothesis based on recent theoretical findings and empirical studies that deep neural network models actually converge t… ▽ More

    Submitted 7 June, 2017; originally announced June 2017.

  33. arXiv:1612.09435  [pdf, other

    cs.SI

    Community-based Outlier Detection for Edge-attributed Graphs

    Authors: Supriya Pandhre, Manish Gupta, Vineeth N Balasubramanian

    Abstract: The study of networks has emerged in diverse disciplines as a means of analyzing complex relationship data. Beyond graph analysis tasks like graph query processing, link analysis, influence propagation, there has recently been some work in the area of outlier detection for information network data. Although various kinds of outliers have been studied for graph data, there is not much work on anoma… ▽ More

    Submitted 11 November, 2017; v1 submitted 30 December, 2016; originally announced December 2016.

    Comments: 9 pages, 5 figures, 1 table

    ACM Class: G.2; G.3; H.2.8

  34. Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures

    Authors: Gaurav Mittal, Tanya Marwah, Vineeth N. Balasubramanian

    Abstract: This paper introduces a novel approach for generating videos called Synchronized Deep Recurrent Attentive Writer (Sync-DRAW). Sync-DRAW can also perform text-to-video generation which, to the best of our knowledge, makes it the first approach of its kind. It combines a Variational Autoencoder~(VAE) with a Recurrent Attention Mechanism in a novel manner to create a temporally dependent sequence of… ▽ More

    Submitted 21 October, 2017; v1 submitted 30 November, 2016; originally announced November 2016.

  35. arXiv:1610.09650  [pdf, other

    cs.LG

    Deep Model Compression: Distilling Knowledge from Noisy Teachers

    Authors: Bharat Bhusan Sau, Vineeth N. Balasubramanian

    Abstract: The remarkable successes of deep learning models across various applications have resulted in the design of deeper networks that can solve complex problems. However, the increasing depth of such models also results in a higher storage and runtime complexity, which restricts the deployability of such very deep models on mobile and portable devices, which have limited storage and battery capacity. W… ▽ More

    Submitted 2 November, 2016; v1 submitted 30 October, 2016; originally announced October 2016.

    Comments: 9 pages, 3 figures