Skip to main content

Showing 1–27 of 27 results for author: Sindagi, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.10970  [pdf, other

    cs.CV

    Unsupervised Restoration of Weather-affected Images using Deep Gaussian Process-based CycleGAN

    Authors: Rajeev Yasarla, Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: Existing approaches for restoring weather-degraded images follow a fully-supervised paradigm and they require paired data for training. However, collecting paired data for weather degradations is extremely challenging, and existing methods end up training on synthetic data. To overcome this issue, we describe an approach for supervising deep networks that are based on CycleGAN, thereby enabling th… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted at ICPR 2022

  2. arXiv:2109.14651  [pdf, other

    cs.CV

    Uncertainty-aware Mean Teacher for Source-free Unsupervised Domain Adaptive 3D Object Detection

    Authors: Deepti Hegde, Vishwanath Sindagi, Velat Kilic, A. Brinton Cooper, Mark Foster, Vishal Patel

    Abstract: Pseudo-label based self training approaches are a popular method for source-free unsupervised domain adaptation. However, their efficacy depends on the quality of the labels generated by the source trained model. These labels may be incorrect with high confidence, rendering thresholding methods ineffective. In order to avoid reinforcing errors caused by label noise, we propose an uncertainty-aware… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  3. arXiv:2107.07004  [pdf, other

    cs.CV physics.optics

    Lidar Light Scattering Augmentation (LISA): Physics-based Simulation of Adverse Weather Conditions for 3D Object Detection

    Authors: Velat Kilic, Deepti Hegde, Vishwanath Sindagi, A. Brinton Cooper, Mark A. Foster, Vishal M. Patel

    Abstract: Lidar-based object detectors are critical parts of the 3D perception pipeline in autonomous navigation systems such as self-driving cars. However, they are known to be sensitive to adverse weather conditions such as rain, snow and fog due to reduced signal-to-noise ratio (SNR) and signal-to-background ratio (SBR). As a result, lidar-based object detectors trained on data captured in normal weather… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

  4. arXiv:2105.13502  [pdf, other

    cs.CV cs.LG

    Unsupervised Domain Adaptation of Object Detectors: A Survey

    Authors: Poojan Oza, Vishwanath A. Sindagi, Vibashan VS, Vishal M. Patel

    Abstract: Recent advances in deep learning have led to the development of accurate and efficient models for various computer vision applications such as classification, segmentation, and detection. However, learning highly accurate models relies on the availability of large-scale annotated datasets. Due to this, model performance drops drastically when evaluated on label-scarce datasets having visually dist… ▽ More

    Submitted 4 July, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

  5. arXiv:2103.04224  [pdf, other

    cs.CV

    MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection

    Authors: Vibashan VS, Vikram Gupta, Poojan Oza, Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: Existing approaches for unsupervised domain adaptive object detection perform feature alignment via adversarial training. While these methods achieve reasonable improvements in performance, they typically perform category-agnostic domain alignment, thereby resulting in negative transfer of features. To overcome this issue, in this work, we attempt to incorporate category information into the domai… ▽ More

    Submitted 3 April, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021

  6. arXiv:2010.01663  [pdf, other

    eess.IV cs.CV

    KiU-Net: Overcomplete Convolutional Architectures for Biomedical Image and Volumetric Segmentation

    Authors: Jeya Maria Jose Valanarasu, Vishwanath A. Sindagi, Ilker Hacihaliloglu, Vishal M. Patel

    Abstract: Most methods for medical image segmentation use U-Net or its variants as they have been successful in most of the applications. After a detailed analysis of these "traditional" encoder-decoder based approaches, we observed that they perform poorly in detecting smaller structures and are unable to segment boundary regions precisely. This issue can be attributed to the increase in receptive field si… ▽ More

    Submitted 14 October, 2021; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: Journal Extension of KiU-Net (MICCAI-2020)

  7. arXiv:2009.13075  [pdf, other

    cs.CV

    Semi-Supervised Image Deraining using Gaussian Processes

    Authors: Rajeev Yasarla, V. A. Sindagi, V. M. Patel

    Abstract: Recent CNN-based methods for image deraining have achieved excellent performance in terms of reconstruction error as well as visual quality. However, these methods are limited in the sense that they can be trained only on fully labeled data. Due to various challenges in obtaining real world fully-labeled image deraining datasets, existing methods are trained only on synthetically generated data an… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

    Comments: arXiv admin note: substantial text overlap with 2006.05580

  8. arXiv:2009.06420  [pdf, other

    cs.CV

    Completely Self-Supervised Crowd Counting via Distribution Matching

    Authors: Deepak Babu Sam, Abhinav Agarwalla, Jimmy Joseph, Vishwanath A. Sindagi, R. Venkatesh Babu, Vishal M. Patel

    Abstract: Dense crowd counting is a challenging task that demands millions of head annotations for training models. Though existing self-supervised approaches could learn good representations, they require some labeled data to map these features to the end task of density estimation. We mitigate this issue with the proposed paradigm of complete self-supervision, which does not need even a single labeled ima… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  9. arXiv:2007.03195  [pdf, other

    cs.CV

    Learning to Count in the Crowd from Limited Labeled Data

    Authors: Vishwanath A. Sindagi, Rajeev Yasarla, Deepak Sam Babu, R. Venkatesh Babu, Vishal M. Patel

    Abstract: Recent crowd counting approaches have achieved excellent performance. However, they are essentially based on fully supervised paradigm and require large number of annotated samples. Obtaining annotations is an expensive and labour-intensive process. In this work, we focus on reducing the annotation efforts by learning to count in the crowd from limited number of labeled samples while leveraging a… ▽ More

    Submitted 8 July, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted at ECCV 2020

  10. arXiv:2006.05580  [pdf, other

    cs.CV

    Syn2Real Transfer Learning for Image Deraining using Gaussian Processes

    Authors: Rajeev Yasarla, Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: Recent CNN-based methods for image deraining have achieved excellent performance in terms of reconstruction error as well as visual quality. However, these methods are limited in the sense that they can be trained only on fully labeled data. Due to various challenges in obtaining real world fully-labeled image deraining datasets, existing methods are trained only on synthetically generated data an… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: Accepted at CVPR 2020

  11. arXiv:2006.04878  [pdf, other

    eess.IV cs.CV

    KiU-Net: Towards Accurate Segmentation of Biomedical Images using Over-complete Representations

    Authors: Jeya Maria Jose, Vishwanath Sindagi, Ilker Hacihaliloglu, Vishal M. Patel

    Abstract: Due to its excellent performance, U-Net is the most widely used backbone architecture for biomedical image segmentation in the recent years. However, in our studies, we observe that there is a considerable performance drop in the case of detecting smaller anatomical landmarks with blurred noisy boundaries. We analyze this issue in detail, and address it by proposing an over-complete architecture (… ▽ More

    Submitted 8 July, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Accepted at MICCAI 2020

  12. arXiv:2004.03597  [pdf, other

    cs.CV

    JHU-CROWD++: Large-Scale Crowd Counting Dataset and A Benchmark Method

    Authors: Vishwanath A. Sindagi, Rajeev Yasarla, Vishal M. Patel

    Abstract: Due to its variety of applications in the real-world, the task of single image-based crowd counting has received a lot of interest in the recent years. Recently, several approaches have been proposed to address various problems encountered in crowd counting. These approaches are essentially based on convolutional neural networks that require large amounts of data to train the network parameters. C… ▽ More

    Submitted 2 November, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted at T-PAMI 2020. The dataset can be downloaded from http://www.crowd-counting.com. arXiv admin note: substantial text overlap with arXiv:1910.12384

  13. arXiv:1912.00070  [pdf, other

    cs.CV

    Prior-based Domain Adaptive Object Detection for Hazy and Rainy Conditions

    Authors: Vishwanath A. Sindagi, Poojan Oza, Rajeev Yasarla, Vishal M. Patel

    Abstract: Adverse weather conditions such as haze and rain corrupt the quality of captured images, which cause detection networks trained on clean images to perform poorly on these images. To address this issue, we propose an unsupervised prior-based domain adversarial object detection framework for adapting the detectors to hazy and rainy conditions. In particular, we use weather-specific prior knowledge o… ▽ More

    Submitted 15 July, 2020; v1 submitted 29 November, 2019; originally announced December 2019.

    Comments: Accepted at ECCV 2020

  14. arXiv:1910.12384  [pdf, other

    cs.CV

    Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and Benchmark Method

    Authors: Vishwanath A. Sindagi, Rajeev Yasarla, Vishal M. Patel

    Abstract: In this work, we propose a novel crowd counting network that progressively generates crowd density maps via residual error estimation. The proposed method uses VGG16 as the backbone network and employs density map generated by the final layer as a coarse prediction to refine and generate finer density maps in a progressive fashion using residual learning. Additionally, the residual learning is gui… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

    Comments: Accepted at ICCV 2019

  15. arXiv:1908.10937  [pdf, other

    cs.CV

    Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting

    Authors: Vishwanath A Sindagi, Vishal M. Patel

    Abstract: Crowd counting presents enormous challenges in the form of large variation in scales within images and across the dataset. These issues are further exacerbated in highly congested scenes. Approaches based on straightforward fusion of multi-scale features from a deep network seem to be obvious solutions to this problem. However, these fusion approaches do not yield significant improvements in the c… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: Accepted at ICCV 2019

  16. HA-CCN: Hierarchical Attention-based Crowd Counting Network

    Authors: Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: Single image-based crowd counting has recently witnessed increased focus, but many leading methods are far from optimal, especially in highly congested scenes. In this paper, we present Hierarchical Attention-based Crowd Counting Network (HA-CCN) that employs attention mechanisms at various levels to selectively enhance the features of the network. The proposed method, which is based on the VGG16… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: Accepted for publication at IEEE Transactions on Image Processing (TIP) 2019

  17. arXiv:1907.01193  [pdf, other

    cs.CV

    Inverse Attention Guided Deep Crowd Counting Network

    Authors: Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: In this paper, we address the challenging problem of crowd counting in congested scenes. Specifically, we present Inverse Attention Guided Deep Crowd Counting Network (IA-DCCN) that efficiently infuses segmentation information through an inverse attention mechanism into the counting network, resulting in significant improvements. The proposed method, which is based on VGG-16, is a single-step trai… ▽ More

    Submitted 21 July, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: Accepted at 16th IEEE International Conference on Advanced Video and Signal-based Surveillance (AVSS) 2019

  18. arXiv:1904.01649  [pdf, other

    cs.CV

    MVX-Net: Multimodal VoxelNet for 3D Object Detection

    Authors: Vishwanath A. Sindagi, Yin Zhou, Oncel Tuzel

    Abstract: Many recent works on 3D object detection have focused on designing neural network architectures that can consume point cloud data. While these approaches demonstrate encouraging performance, they are typically based on a single modality and are unable to leverage information from other modalities, such as a camera. Although a few approaches fuse data from different modalities, these methods either… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: 7 pages

    Journal ref: International Conference on Robotics and Automation (ICRA), 2019

  19. arXiv:1901.05375  [pdf, other

    cs.CV

    DAFE-FD: Density Aware Feature Enrichment for Face Detection

    Authors: Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: Recent research on face detection, which is focused primarily on improving accuracy of detecting smaller faces, attempt to develop new anchor design strategies to facilitate increased overlap between anchor boxes and ground truth faces of smaller sizes. In this work, we approach the problem of small face detection with the motivation of enriching the feature maps using a density map estimation mod… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

  20. arXiv:1804.10275  [pdf, other

    cs.CV

    Pushing the Limits of Unconstrained Face Detection: a Challenge Dataset and Baseline Results

    Authors: Hajime Nada, Vishwanath A. Sindagi, He Zhang, Vishal M. Patel

    Abstract: Face detection has witnessed immense progress in the last few years, with new milestones being surpassed every year. While many challenges such as large variations in scale, pose, appearance are successfully addressed, there still exist several issues which are not specifically captured by existing methods or datasets. In this work, we identify the next set of challenges that requires attention fr… ▽ More

    Submitted 8 August, 2018; v1 submitted 26 April, 2018; originally announced April 2018.

    Comments: Accepted in BTAS'2018

  21. arXiv:1710.10182  [pdf, other

    cs.CV

    High-Quality Facial Photo-Sketch Synthesis Using Multi-Adversarial Networks

    Authors: Lidan Wang, Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: Synthesizing face sketches from real photos and its inverse have many applications. However, photo/sketch synthesis remains a challenging problem due to the fact that photo and sketch have different characteristics. In this work, we consider this task as an image-to-image translation problem and explore the recently popular generative models (GANs) to generate high-quality realistic photos from sk… ▽ More

    Submitted 2 March, 2018; v1 submitted 27 October, 2017; originally announced October 2017.

    Comments: Accepted by 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018)(Oral)

  22. GP-GAN: Gender Preserving GAN for Synthesizing Faces from Landmarks

    Authors: Xing Di, Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: Facial landmarks constitute the most compressed representation of faces and are known to preserve information such as pose, gender and facial structure present in the faces. Several works exist that attempt to perform high-level face-related analysis tasks based on landmarks. In contrast, in this work, an attempt is made to tackle the inverse problem of synthesizing faces from their respective lan… ▽ More

    Submitted 25 April, 2018; v1 submitted 2 October, 2017; originally announced October 2017.

    Comments: 6 pages, 5 figures, this paper is accepted as 2018 24th International Conference on Pattern Recognition (ICPR2018)

  23. arXiv:1708.00953  [pdf, other

    cs.CV

    Generating High-Quality Crowd Density Maps using Contextual Pyramid CNNs

    Authors: Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: We present a novel method called Contextual Pyramid CNN (CP-CNN) for generating high-quality crowd density and count estimation by explicitly incorporating global and local contextual information of crowd images. The proposed CP-CNN consists of four modules: Global Context Estimator (GCE), Local Context Estimator (LCE), Density Map Estimator (DME) and a Fusion-CNN (F-CNN). GCE is a VGG-16 based CN… ▽ More

    Submitted 2 August, 2017; originally announced August 2017.

    Comments: Accepted at ICCV 2017

  24. arXiv:1708.00581  [pdf, other

    cs.CV

    Joint Transmission Map Estimation and Dehazing using Deep Networks

    Authors: He Zhang, Vishwanath Sindagi, Vishal M. Patel

    Abstract: Single image haze removal is an extremely challenging problem due to its inherent ill-posed nature. Several prior-based and learning-based methods have been proposed in the literature to solve this problem and they have achieved superior results. However, most of the existing methods assume constant atmospheric light model and tend to follow a two-step procedure involving prior-based methods for e… ▽ More

    Submitted 20 April, 2019; v1 submitted 1 August, 2017; originally announced August 2017.

    Comments: This paper has been accepted in IEEE-TCSVT

  25. arXiv:1707.09605  [pdf, other

    cs.CV

    CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting

    Authors: Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: Estimating crowd count in densely crowded scenes is an extremely challenging task due to non-uniform scale variations. In this paper, we propose a novel end-to-end cascaded network of CNNs to jointly learn crowd count classification and density map estimation. Classifying crowd count into various groups is tantamount to coarsely estimating the total count in the image thereby incorporating a high-… ▽ More

    Submitted 16 August, 2017; v1 submitted 30 July, 2017; originally announced July 2017.

    Comments: Accepted at AVSS 2017 (14th International Conference on Advanced Video and Signal Based Surveillance)

  26. A Survey of Recent Advances in CNN-based Single Image Crowd Counting and Density Estimation

    Authors: Vishwanath A. Sindagi, Vishal M. Patel

    Abstract: Estimating count and density maps from crowd images has a wide range of applications such as video surveillance, traffic monitoring, public safety and urban planning. In addition, techniques developed for crowd counting can be applied to related tasks in other fields of study such as cell microscopy, vehicle counting and environmental survey. The task of crowd counting and density map estimation i… ▽ More

    Submitted 4 July, 2017; originally announced July 2017.

    Comments: 16 pages, 17 figures

  27. arXiv:1701.05957  [pdf, other

    cs.CV

    Image De-raining Using a Conditional Generative Adversarial Network

    Authors: He Zhang, Vishwanath Sindagi, Vishal M. Patel

    Abstract: Severe weather conditions such as rain and snow adversely affect the visual quality of images captured under such conditions thus rendering them useless for further usage and sharing. In addition, such degraded images drastically affect performance of vision systems. Hence, it is important to solve the problem of single image de-raining/de-snowing. However, this is a difficult problem to solve due… ▽ More

    Submitted 2 June, 2019; v1 submitted 20 January, 2017; originally announced January 2017.