Skip to main content

Showing 1–12 of 12 results for author: Konda, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.04181  [pdf, other

    cs.CV cs.RO

    DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions

    Authors: Sanket Kalwar, Mihir Ungarala, Shruti Jain, Aaron Monis, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

    Abstract: Semantic segmentation in adverse weather scenarios is a critical task for autonomous driving systems. While foundation models have shown promise, the need for specialized adaptors becomes evident for handling more challenging scenarios. We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in founda… ▽ More

    Submitted 26 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  2. arXiv:2209.14922  [pdf, other

    cs.CV cs.RO

    GDIP: Gated Differentiable Image Processing for Object-Detection in Adverse Conditions

    Authors: Sanket Kalwar, Dhruv Patel, Aakash Aanegola, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

    Abstract: Detecting objects under adverse weather and lighting conditions is crucial for the safe and continuous operation of an autonomous vehicle, and remains an unsolved problem. We present a Gated Differentiable Image Processing (GDIP) block, a domain-agnostic network architecture, which can be plugged into existing object detection networks (e.g., Yolo) and trained end-to-end with adverse condition ima… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Submitted to ICRA2023. More information at https://gatedip.github.io

  3. arXiv:2201.07120  [pdf, other

    cs.CV cs.RO

    Contextual road lane and symbol generation for autonomous driving

    Authors: Ajay Soni, Pratik Padamwar, Krishna Reddy Konda

    Abstract: In this paper we present a novel approach for lane detection and segmentation using generative models. Traditionally discriminative models have been employed to classify pixels semantically on a road. We model the probability distribution of lanes and road symbols by training a generative adversarial network. Based on the learned probability distribution, context-aware lanes and road signs are gen… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

  4. arXiv:1903.02893  [pdf, other

    cs.LG stat.ML

    Only sparsity based loss function for learning representations

    Authors: Vivek Bakaraju, Kishore Reddy Konda

    Abstract: We study the emergence of sparse representations in neural networks. We show that in unsupervised models with regularization, the emergence of sparsity is the result of the input data samples being distributed along highly non-linear or discontinuous manifold. We also derive a similar argument for discriminatively trained networks and present experiments to support this hypothesis. Based on our st… ▽ More

    Submitted 7 March, 2019; originally announced March 2019.

  5. arXiv:1705.06778  [pdf, other

    cs.CV cs.NE

    Building effective deep neural network architectures one feature at a time

    Authors: Martin Mundt, Tobias Weis, Kishore Konda, Visvanathan Ramesh

    Abstract: Successful training of convolutional neural networks is often associated with sufficiently deep architectures composed of high amounts of features. These networks typically rely on a variety of regularization and pruning techniques to converge to less redundant states. We introduce a novel bottom-up approach to expand representations in fixed-depth architectures. These architectures start from jus… ▽ More

    Submitted 19 October, 2017; v1 submitted 18 May, 2017; originally announced May 2017.

  6. arXiv:1511.02580  [pdf, other

    cs.LG cs.NE

    How far can we go without convolution: Improving fully-connected networks

    Authors: Zhouhan Lin, Roland Memisevic, Kishore Konda

    Abstract: We propose ways to improve the performance of fully connected networks. We found that two approaches in particular have a strong effect on performance: linear bottleneck layers and unsupervised pre-training using autoencoders without hidden unit biases. We show how both approaches can be related to improving gradient flow and reducing sparsity in the network. We show that a fully connected network… ▽ More

    Submitted 9 November, 2015; originally announced November 2015.

    Comments: 10 pages, 11 figures, submitted for ICLR 2016

  7. arXiv:1506.08700  [pdf, other

    stat.ML cs.LG

    Dropout as data augmentation

    Authors: Xavier Bouthillier, Kishore Konda, Pascal Vincent, Roland Memisevic

    Abstract: Dropout is typically interpreted as bagging a large number of models sharing parameters. We show that using dropout in a network can also be interpreted as a kind of data augmentation in the input space without domain knowledge. We present an approach to projecting the dropout noise within a network back into the input space, thereby generating augmented versions of the training data, and we show… ▽ More

    Submitted 7 January, 2016; v1 submitted 29 June, 2015; originally announced June 2015.

  8. arXiv:1503.01800  [pdf, other

    cs.LG cs.CV

    EmoNets: Multimodal deep learning approaches for emotion recognition in video

    Authors: Samira Ebrahimi Kahou, Xavier Bouthillier, Pascal Lamblin, Caglar Gulcehre, Vincent Michalski, Kishore Konda, Sébastien Jean, Pierre Froumenty, Yann Dauphin, Nicolas Boulanger-Lewandowski, Raul Chandias Ferrari, Mehdi Mirza, David Warde-Farley, Aaron Courville, Pascal Vincent, Roland Memisevic, Christopher Pal, Yoshua Bengio

    Abstract: The task of the emotion recognition in the wild (EmotiW) Challenge is to assign one of seven emotions to short video clips extracted from Hollywood style movies. The videos depict acted-out emotions under realistic conditions with a large degree of variation in attributes such as pose and illumination, making it worthwhile to explore approaches which consider combinations of features from multiple… ▽ More

    Submitted 29 March, 2015; v1 submitted 5 March, 2015; originally announced March 2015.

  9. arXiv:1402.3337  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Zero-bias autoencoders and the benefits of co-adapting features

    Authors: Kishore Konda, Roland Memisevic, David Krueger

    Abstract: Regularized training of an autoencoder typically results in hidden unit biases that take on large negative values. We show that negative biases are a natural result of using a hidden layer whose responsibility is to both represent the input data and act as a selection mechanism that ensures sparsity of the representation. We then show that negative biases impede the learning of data distributions… ▽ More

    Submitted 8 April, 2015; v1 submitted 13 February, 2014; originally announced February 2014.

  10. arXiv:1402.2333  [pdf, other

    cs.LG cs.CV stat.ML

    Modeling sequential data using higher-order relational features and predictive training

    Authors: Vincent Michalski, Roland Memisevic, Kishore Konda

    Abstract: Bi-linear feature learning models, like the gated autoencoder, were proposed as a way to model relationships between frames in a video. By minimizing reconstruction error of one frame, given the previous frame, these models learn "map** units" that encode the transformations inherent in a sequence, and thereby learn to encode motion. In this work we extend bi-linear models by introducing "higher… ▽ More

    Submitted 10 February, 2014; originally announced February 2014.

  11. arXiv:1312.3429  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised learning of depth and motion

    Authors: Kishore Konda, Roland Memisevic

    Abstract: We present a model for the joint estimation of disparity and motion. The model is based on learning about the interrelations between images from multiple cameras, multiple frames in a video, or the combination of both. We show that learning depth and motion cues, as well as their combinations, from data is possible within a single type of architecture and a single type of learning algorithm, by us… ▽ More

    Submitted 16 December, 2013; v1 submitted 12 December, 2013; originally announced December 2013.

  12. arXiv:1306.3162  [pdf, other

    cs.CV cs.LG stat.ML

    Learning to encode motion using spatio-temporal synchrony

    Authors: Kishore Reddy Konda, Roland Memisevic, Vincent Michalski

    Abstract: We consider the task of learning to extract motion from videos. To this end, we show that the detection of spatial transformations can be viewed as the detection of synchrony between the image sequence and a sequence of features undergoing the motion we wish to detect. We show that learning about synchrony is possible using very fast, local learning rules, by introducing multiplicative "gating" in… ▽ More

    Submitted 10 February, 2014; v1 submitted 13 June, 2013; originally announced June 2013.