Skip to main content

Showing 1–49 of 49 results for author: Turaga, P

.
  1. arXiv:2403.14140  [pdf, other

    cs.CV cs.LG

    Learning Decomposable and Debiased Representations via Attribute-Centric Information Bottlenecks

    Authors: **yung Hong, Eun Som Jeon, Changhoon Kim, Keun Hee Park, Utkarsh Nath, Yezhou Yang, Pavan Turaga, Theodore P. Pavlic

    Abstract: Biased attributes, spuriously correlated with target labels in a dataset, can problematically lead to neural networks that learn improper shortcuts for classifications and limit their capabilities for out-of-distribution (OOD) generalization. Although many debiasing approaches have been proposed to ensure correct predictions from biased datasets, few studies have considered learning latent embeddi… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 24 pages, 16 figures, 3 tables

  2. arXiv:2305.13284  [pdf, other

    cs.CV cs.AI

    Target-Aware Generative Augmentations for Single-Shot Adaptation

    Authors: Kowshik Thopalli, Rakshith Subramanyam, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: In this paper, we address the problem of adapting models from a source domain to a target domain, a task that has become increasingly important due to the brittle generalization of deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic toolbox data augmentations in cases of limited target data availability. We consider the challenging set… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at International Conference Machine Learning (ICML) 2023

  3. arXiv:2305.10513  [pdf, other

    cs.CV stat.ML

    Learning Pose Image Manifolds Using Geometry-Preserving GANs and Elasticae

    Authors: Shenyuan Liang, Pavan Turaga, Anuj Srivastava

    Abstract: This paper investigates the challenge of learning image manifolds, specifically pose manifolds, of 3D objects using limited training data. It proposes a DNN approach to manifold learning and for predicting images of objects for novel, continuous 3D rotations. The approach uses two distinct concepts: (1) Geometric Style-GAN (Geom-SGAN), which maps images to low-dimensional latent representations an… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  4. arXiv:2303.11424  [pdf, other

    cs.CV

    Polynomial Implicit Neural Representations For Large Diverse Datasets

    Authors: Rajhans Singh, Ankita Shukla, Pavan Turaga

    Abstract: Implicit neural representations (INR) have gained significant popularity for signal and image representation for many end-tasks, such as superresolution, 3D modeling, and more. Most INR architectures rely on sinusoidal positional encoding, which accounts for high-frequency information in data. However, the finite encoding size restricts the model's representational power. Higher representational p… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  5. Leveraging Angular Distributions for Improved Knowledge Distillation

    Authors: Eun Som Jeon, Hongjun Choi, Ankita Shukla, Pavan Turaga

    Abstract: Knowledge distillation as a broad class of methods has led to the development of lightweight and memory efficient models, using a pre-trained model with a large capacity (teacher network) to train a smaller model (student network). Recently, additional variations for knowledge distillation, utilizing activation maps of intermediate layers as the source of knowledge, have been studied. Generally, i… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Neurocomputing, Volume 518, 21 January 2023, Pages 466-481

    Journal ref: Neurocomputing, Volume 518, 2023, Pages 466-481

  6. arXiv:2211.03946  [pdf, other

    cs.CV

    Understanding the Role of Mixup in Knowledge Distillation: An Empirical Study

    Authors: Hongjun Choi, Eun Som Jeon, Ankita Shukla, Pavan Turaga

    Abstract: Mixup is a popular data augmentation technique based on creating new samples by linear interpolation between two given data samples, to improve both the generalization and robustness of the trained model. Knowledge distillation (KD), on the other hand, is widely used for model compression and transfer learning, which involves using a larger network's implicit knowledge to guide the learning of a s… ▽ More

    Submitted 8 November, 2022; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: To be presented at WACV 2023

  7. arXiv:2210.16692  [pdf, other

    cs.CV cs.LG stat.ML

    Single-Shot Domain Adaptation via Target-Aware Generative Augmentation

    Authors: Rakshith Subramanyam, Kowshik Thopalli, Spring Berman, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: The problem of adapting models from a source domain using data from any target domain of interest has gained prominence, thanks to the brittle generalization in deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic data augmentations in cases of limited target data availability. In this paper, we consider the challenging setting of singl… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

  8. arXiv:2207.04185  [pdf, other

    cs.CV cs.LG

    Domain Alignment Meets Fully Test-Time Adaptation

    Authors: Kowshik Thopalli, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: A foundational requirement of a deployed ML model is to generalize to data drawn from a testing distribution that is different from training. A popular solution to this problem is to adapt a pre-trained model to novel domains using only unlabeled data. In this paper, we focus on a challenging variant of this problem, where access to the original source data is restricted. While fully test-time ada… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: 16 Pages including references, 5 figures

  9. arXiv:2205.11722  [pdf, other

    cs.CV

    Improving Shape Awareness and Interpretability in Deep Networks Using Geometric Moments

    Authors: Rajhans Singh, Ankita Shukla, Pavan Turaga

    Abstract: Deep networks for image classification often rely more on texture information than object shape. While efforts have been made to make deep-models shape-aware, it is often difficult to make such models simple, interpretable, or rooted in known mathematical definitions of shape. This paper presents a deep-learning model inspired by geometric moments, a classically well understood approach to measure… ▽ More

    Submitted 22 May, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted at CVPR 2023 Workshop: Deep Learning for Geometric Computing

  10. arXiv:2201.01806  [pdf, other

    cs.LG cs.CV

    Revisiting Deep Subspace Alignment for Unsupervised Domain Adaptation

    Authors: Kowshik Thopalli, Jayaraman J Thiagarajan, Rushil Anirudh, Pavan K Turaga

    Abstract: Unsupervised domain adaptation (UDA) aims to transfer and adapt knowledge from a labeled source domain to an unlabeled target domain. Traditionally, subspace-based methods form an important class of solutions to this problem. Despite their mathematical elegance and tractability, these methods are often found to be ineffective at producing domain-invariant features with complex, real-world datasets… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:1906.04338

  11. arXiv:2201.00111  [pdf, other

    cs.LG cs.HC eess.SP

    Role of Data Augmentation Strategies in Knowledge Distillation for Wearable Sensor Data

    Authors: Eun Som Jeon, Anirudh Som, Ankita Shukla, Kristina Hasanaj, Matthew P. Buman, Pavan Turaga

    Abstract: Deep neural networks are parametrized by several thousands or millions of parameters, and have shown tremendous success in many classification problems. However, the large number of parameters makes it difficult to integrate these models into edge devices such as smartphones and wearable devices. To address this problem, knowledge distillation (KD) has been widely employed, that uses a pre-trained… ▽ More

    Submitted 31 December, 2021; originally announced January 2022.

  12. arXiv:2112.09802  [pdf, other

    cs.LG cs.CV

    Automated Domain Discovery from Multiple Sources to Improve Zero-Shot Generalization

    Authors: Kowshik Thopalli, Sameeksha Katoch, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: Domain generalization (DG) methods aim to develop models that generalize to settings where the test distribution is different from the training data. In this paper, we focus on the challenging problem of multi-source zero shot DG (MDG), where labeled training data from multiple source domains is available but with no access to data from the target domain. A wide range of solutions have been propos… ▽ More

    Submitted 3 November, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  13. arXiv:2111.14053  [pdf, other

    q-bio.BM cs.AI cs.LG physics.bio-ph

    Towards Conditional Generation of Minimal Action Potential Pathways for Molecular Dynamics

    Authors: John Kevin Cava, John Vant, Nicholas Ho, Ankita Shukla, Pavan Turaga, Ross Maciejewski, Abhishek Singharoy

    Abstract: In this paper, we utilized generative models, and reformulate it for problems in molecular dynamics (MD) simulation, by introducing an MD potential energy component to our generative model. By incorporating potential energy as calculated from TorchMD into a conditional generative framework, we attempt to construct a low-potential energy route of transformation between the helix~$\rightarrow$~coil… ▽ More

    Submitted 5 January, 2022; v1 submitted 28 November, 2021; originally announced November 2021.

    Comments: Accepted to ELLIS ML4Molecules Workshop 2021

  14. arXiv:2111.12798  [pdf, other

    cs.LG cs.CV

    Geometric Priors for Scientific Generative Models in Inertial Confinement Fusion

    Authors: Ankita Shukla, Rushil Anirudh, Eugene Kur, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Brian K. Spears, Tammy Ma, Pavan Turaga

    Abstract: In this paper, we develop a Wasserstein autoencoder (WAE) with a hyperspherical prior for multimodal data in the application of inertial confinement fusion. Unlike a typical hyperspherical generative model that requires computationally inefficient sampling from distributions like the von Mis Fisher, we sample from a normal distribution followed by a projection layer before the generator. Finally,… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 5 pages, 4 figures, Fourth Workshop on Machine Learning and the Physical Sciences, NeurIPS 2021

  15. arXiv:2102.08360  [pdf, other

    cs.LG cs.CV

    Interpretable COVID-19 Chest X-Ray Classification via Orthogonality Constraint

    Authors: Ella Y. Wang, Anirudh Som, Ankita Shukla, Hongjun Choi, Pavan Turaga

    Abstract: Deep neural networks have increasingly been used as an auxiliary tool in healthcare applications, due to their ability to improve performance of several diagnosis tasks. However, these methods are not widely adopted in clinical settings due to the practical limitations in the reliability, generalizability, and interpretability of deep learning based systems. As a result, methods have been develope… ▽ More

    Submitted 21 December, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: Accepted in the 2021 ACM CHIL Workshop track. An extended version of this work is under consideration at Pattern Recognition Letters

  16. arXiv:2012.02043  [pdf, other

    cs.CV cs.LG

    Recovering Trajectories of Unmarked Joints in 3D Human Actions Using Latent Space Optimization

    Authors: Suhas Lohit, Rushil Anirudh, Pavan Turaga

    Abstract: Motion capture (mocap) and time-of-flight based sensing of human actions are becoming increasingly popular modalities to perform robust activity analysis. Applications range from action recognition to quantifying movement quality for health applications. While marker-less motion capture has made great progress, in critical applications such as healthcare, marker-based systems, especially active ma… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted at WACV 2021

  17. arXiv:2009.10762  [pdf, other

    cs.CV cs.LG

    Role of Orthogonality Constraints in Improving Properties of Deep Networks for Image Classification

    Authors: Hongjun Choi, Anirudh Som, Pavan Turaga

    Abstract: Standard deep learning models that employ the categorical cross-entropy loss are known to perform well at image classification tasks. However, many standard models thus obtained often exhibit issues like feature redundancy, low interpretability, and poor calibration. A body of recent work has emerged that has tried addressing some of these challenges by proposing the use of new regularization func… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: 8 figures, 4 tables, 1 pseudo-code

  18. arXiv:2006.10873  [pdf, other

    cs.CV cs.LG

    Generative Patch Priors for Practical Compressive Image Recovery

    Authors: Rushil Anirudh, Suhas Lohit, Pavan Turaga

    Abstract: In this paper, we propose the generative patch prior (GPP) that defines a generative prior for compressive image recovery, based on patch-manifold models. Unlike learned, image-level priors that are restricted to the range space of a pre-trained generator, GPP can recover a wide variety of natural images using a pre-trained patch generator. Additionally, GPP retains the benefits of generative prio… ▽ More

    Submitted 5 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

  19. arXiv:2005.02936  [pdf, other

    cs.CV cs.CR cs.LG

    GraCIAS: Grassmannian of Corrupted Images for Adversarial Security

    Authors: Ankita Shukla, Pavan Turaga, Saket Anand

    Abstract: Input transformation based defense strategies fall short in defending against strong adversarial attacks. Some successful defenses adopt approaches that either increase the randomness within the applied transformations, or make the defense computationally intensive, making it substantially more challenging for the attacker. However, it limits the applicability of such defenses as a pre-processing… ▽ More

    Submitted 7 May, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 16 pages

  20. arXiv:2005.02589  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Unsupervised Pre-trained Models from Healthy ADLs Improve Parkinson's Disease Classification of Gait Patterns

    Authors: Anirudh Som, Narayanan Krishnamurthi, Matthew Buman, Pavan Turaga

    Abstract: Application and use of deep learning algorithms for different healthcare applications is gaining interest at a steady pace. However, use of such algorithms can prove to be challenging as they require large amounts of training data that capture different possible variations. This makes it difficult to use them in a clinical setting since in most health applications researchers often have to work wi… ▽ More

    Submitted 6 May, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: Accepted in the 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society (EMBC 2020)

  21. arXiv:2004.09805  [pdf, other

    cs.CV

    AMC-Loss: Angular Margin Contrastive Loss for Improved Explainability in Image Classification

    Authors: Hongjun Choi, Anirudh Som, Pavan Turaga

    Abstract: Deep-learning architectures for classification problems involve the cross-entropy loss sometimes assisted with auxiliary loss functions like center loss, contrastive loss and triplet loss. These auxiliary loss functions facilitate better discrimination between the different classes of interest. However, recent studies hint at the fact that these loss functions do not take into account the intrinsi… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  22. arXiv:2004.08614  [pdf, other

    cs.CV cs.LG

    Halluci-Net: Scene Completion by Exploiting Object Co-occurrence Relationships

    Authors: Kuldeep Kulkarni, Tejas Gokhale, Rajhans Singh, Pavan Turaga, Aswin Sankaranarayanan

    Abstract: Recently, there has been substantial progress in image synthesis from semantic labelmaps. However, methods used for this task assume the availability of complete and unambiguous labelmaps, with instance boundaries of objects, and class labels for each pixel. This reliance on heavily annotated inputs restricts the application of image synthesis techniques to real-world applications, especially unde… ▽ More

    Submitted 20 May, 2021; v1 submitted 18 April, 2020; originally announced April 2020.

    Comments: Accepted to AI for Content Creation Workshop @CVPR 2021

  23. arXiv:2004.07384  [pdf, other

    cs.LG eess.IV q-bio.QM stat.ML

    Topological Descriptors for Parkinson's Disease Classification and Regression Analysis

    Authors: Afra Nawar, Farhan Rahman, Narayanan Krishnamurthi, Anirudh Som, Pavan Turaga

    Abstract: At present, the vast majority of human subjects with neurological disease are still diagnosed through in-person assessments and qualitative analysis of patient data. In this paper, we propose to use Topological Data Analysis (TDA) together with machine learning tools to automate the process of Parkinson's disease classification and severity assessment. An automated, stable, and accurate method to… ▽ More

    Submitted 6 May, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: Accepted in the 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society (EMBC 2020)

  24. arXiv:1911.10600  [pdf, other

    cs.CV

    Invenio: Discovering Hidden Relationships Between Tasks/Domains Using Structured Meta Learning

    Authors: Sameeksha Katoch, Kowshik Thopalli, Jayaraman J. Thiagarajan, Pavan Turaga, Andreas Spanias

    Abstract: Exploiting known semantic relationships between fine-grained tasks is critical to the success of recent model agnostic approaches. These approaches often rely on meta-optimization to make a model robust to systematic task or domain shifts. However, in practice, the performance of these methods can suffer, when there are no coherent semantic relationships between the tasks (or domains). We present… ▽ More

    Submitted 1 February, 2020; v1 submitted 24 November, 2019; originally announced November 2019.

    Comments: Semantic structure development for tasks/domains essential for efficient knowledge transfer

  25. arXiv:1907.09554  [pdf, other

    cs.CV cs.LG

    Product of Orthogonal Spheres Parameterization for Disentangled Representation Learning

    Authors: Ankita Shukla, Sarthak Bhagat, Shagun Uppal, Saket Anand, Pavan Turaga

    Abstract: Learning representations that can disentangle explanatory attributes underlying the data improves interpretabilty as well as provides control on data generation. Various learning frameworks such as VAEs, GANs and auto-encoders have been used in the literature to learn such representations. Most often, the latent space is constrained to a partitioned representation or structured by a prior to impos… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: Accepted at British Machine Vision Conference (BMVC) 2019

  26. arXiv:1906.05947  [pdf, other

    cs.CV

    Temporal Transformer Networks: Joint Learning of Invariant and Discriminative Time War**

    Authors: Suhas Lohit, Qiao Wang, Pavan Turaga

    Abstract: Many time-series classification problems involve develo** metrics that are invariant to temporal misalignment. In human activity analysis, temporal misalignment arises due to various reasons including differing initial phase, sensor sampling rates, and elastic time-warps due to subject-specific biomechanics. Past work in this area has only looked at reducing intra-class variability by elastic te… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Published in CVPR 2019, Codes available at https://github.com/suhaslohit/TTN

  27. arXiv:1906.04338  [pdf, other

    stat.ML cs.CV cs.LG

    SALT: Subspace Alignment as an Auxiliary Learning Task for Domain Adaptation

    Authors: Kowshik Thopalli, Jayaraman J. Thiagarajan, Rushil Anirudh, Pavan Turaga

    Abstract: Unsupervised domain adaptation aims to transfer and adapt knowledge learned from a labeled source domain to an unlabeled target domain. Key components of unsupervised domain adaptation include: (a) maximizing performance on the target, and (b) aligning the source and target domains. Traditionally, these tasks have either been considered as separate, or assumed to be implicitly addressed together w… ▽ More

    Submitted 18 December, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

  28. arXiv:1906.01769  [pdf, other

    cs.CV cs.LG math.AT

    PI-Net: A Deep Learning Approach to Extract Topological Persistence Images

    Authors: Anirudh Som, Hongjun Choi, Karthikeyan Natesan Ramamurthy, Matthew Buman, Pavan Turaga

    Abstract: Topological features such as persistence diagrams and their functional approximations like persistence images (PIs) have been showing substantial promise for machine learning and computer vision applications. This is greatly attributed to the robustness topological representations provide against different types of physical nuisance variables seen in real-world data, such as view-point, illuminati… ▽ More

    Submitted 23 May, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: 10 pages, 8 figures, 4 tables

  29. arXiv:1905.07061  [pdf, other

    cs.CV cs.LG

    Non-Parametric Priors For Generative Adversarial Networks

    Authors: Rajhans Singh, Pavan Turaga, Suren Jayasuriya, Ravi Garg, Martin W. Braun

    Abstract: The advent of generative adversarial networks (GAN) has enabled new capabilities in synthesis, interpolation, and data augmentation heretofore considered very challenging. However, one of the common assumptions in most GAN architectures is the assumption of simple parametric latent-space distributions. While easy to implement, a simple latent-space distribution can be problematic for uses such as… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Journal ref: International Conference on Machine Learning (2019)

  30. arXiv:1902.06964  [pdf, other

    cs.CV

    Geometry of Deep Generative Models for Disentangled Representations

    Authors: Ankita Shukla, Shagun Uppal, Sarthak Bhagat, Saket Anand, Pavan Turaga

    Abstract: Deep generative models like variational autoencoders approximate the intrinsic geometry of high dimensional data manifolds by learning low-dimensional latent-space variables and an embedding function. The geometric properties of these latent spaces has been studied under the lens of Riemannian geometry; via analysis of the non-linearity of the generator function. In new developments, deep generati… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

    Comments: Accepted at ICVGIP, 2018

  31. arXiv:1812.08882  [pdf, other

    cs.CV

    An Optical Flow-Based Approach for Minimally-Divergent Velocimetry Data Interpolation

    Authors: Berkay Kanberoglu, Dhritiman Das, Priya Nair, Pavan Turaga, David Frakes

    Abstract: Three-dimensional (3D) biomedical image sets are often acquired with in-plane pixel spacings that are far less than the out-of-plane spacings between images. The resultant anisotropy, which can be detrimental in many applications, can be decreased using image interpolation. Optical flow and/or other registration-based interpolators have proven useful in such interpolation roles in the past. When a… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: 24 pages, 10 figures, International Journal of Biomedical Imaging, accepted manuscript

  32. arXiv:1811.04491  [pdf, other

    cs.CV

    Multiple Subspace Alignment Improves Domain Adaptation

    Authors: Kowshik Thopalli, Rushil Anirudh, Jayaraman J. Thiagarajan, Pavan Turaga

    Abstract: We present a novel unsupervised domain adaptation (DA) method for cross-domain visual recognition. Though subspace methods have found success in DA, their performance is often limited due to the assumption of approximating an entire dataset using a single low-dimensional subspace. Instead, we develop a method to effectively represent the source and target datasets via a collection of low-dimension… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: under review in ICASSP 2019

  33. arXiv:1809.02850  [pdf, other

    cs.CV

    Rate-Adaptive Neural Networks for Spatial Multiplexers

    Authors: Suhas Lohit, Rajhans Singh, Kuldeep Kulkarni, Pavan Turaga

    Abstract: In resource-constrained environments, one can employ spatial multiplexing cameras to acquire a small number of measurements of a scene, and perform effective reconstruction or high-level inference using purely data-driven neural networks. However, once trained, the measurement matrix and the network are valid only for a single measurement rate (MR) chosen at training time. To overcome this drawbac… ▽ More

    Submitted 8 September, 2018; originally announced September 2018.

  34. arXiv:1807.10400  [pdf, other

    cs.CV

    Perturbation Robust Representations of Topological Persistence Diagrams

    Authors: Anirudh Som, Kowshik Thopalli, Karthikeyan Natesan Ramamurthy, Vinay Venkataraman, Ankita Shukla, Pavan Turaga

    Abstract: Topological methods for data analysis present opportunities for enforcing certain invariances of broad interest in computer vision, including view-point in activity analysis, articulation in shape analysis, and measurement invariance in non-linear dynamical modeling. The increasing success of these methods is attributed to the complementary information that topology provides, as well as availabili… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: 19 pages, 4 figures, 6 tables

  35. arXiv:1806.03379  [pdf, other

    cs.CV cs.AI

    CS-VQA: Visual Question Answering with Compressively Sensed Images

    Authors: Li-Chi Huang, Kuldeep Kulkarni, Anik Jha, Suhas Lohit, Suren Jayasuriya, Pavan Turaga

    Abstract: Visual Question Answering (VQA) is a complex semantic task requiring both natural language processing and visual recognition. In this paper, we explore whether VQA is solvable when images are captured in a sub-Nyquist compressive paradigm. We develop a series of deep-network architectures that exploit available compressive data to increasing degrees of accuracy, and show that VQA is indeed solvabl… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: 5 pages, 2 figures, accepted to ICIP 2018

    MSC Class: 68

  36. arXiv:1802.01722  [pdf, other

    cs.CV

    Compressive Light Field Reconstructions using Deep Learning

    Authors: Mayank Gupta, Arjun Jauhari, Kuldeep Kulkarni, Suren Jayasuriya, Alyosha Molnar, Pavan Turaga

    Abstract: Light field imaging is limited in its computational processing demands of high sampling for both spatial and angular dimensions. Single-shot light field cameras sacrifice spatial resolution to sample angular viewpoints, typically by multiplexing incoming rays onto a 2D sensor array. While this resolution can be recovered using compressive sensing, these iterative solutions are slow in processing a… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Comments: Published at CCD 2017 workshop held in conjunction with CVPR 2017

  37. arXiv:1708.09485  [pdf, other

    cs.CV

    Learning Invariant Riemannian Geometric Representations Using Deep Nets

    Authors: Suhas Lohit, Pavan Turaga

    Abstract: Non-Euclidean constraints are inherent in many kinds of data in computer vision and machine learning, typically as a result of specific invariance requirements that need to be respected during high-level inference. Often, these geometric constraints can be expressed in the language of Riemannian geometry, where conventional vector space machine learning does not apply directly. The central questio… ▽ More

    Submitted 22 September, 2017; v1 submitted 30 August, 2017; originally announced August 2017.

    Comments: Accepted at International Conference on Computer Vision Workshop (ICCVW), 2017 on Manifold Learning: from Euclid to Riemann

  38. arXiv:1708.04669  [pdf, other

    cs.CV

    Convolutional Neural Networks for Non-iterative Reconstruction of Compressively Sensed Images

    Authors: Suhas Lohit, Kuldeep Kulkarni, Ronan Kerviche, Pavan Turaga, Amit Ashok

    Abstract: Traditional algorithms for compressive sensing recovery are computationally expensive and are ineffective at low measurement rates. In this work, we propose a data driven non-iterative algorithm to overcome the shortcomings of earlier iterative algorithms. Our solution, ReconNet, is a deep neural network, whose parameters are learned end-to-end to map block-wise compressive measurements of the sce… ▽ More

    Submitted 16 August, 2017; v1 submitted 15 August, 2017; originally announced August 2017.

  39. arXiv:1610.09582  [pdf, other

    cs.CV

    Diversity Promoting Online Sampling for Streaming Video Summarization

    Authors: Rushil Anirudh, Ahnaf Masroor, Pavan Turaga

    Abstract: Many applications benefit from sampling algorithms where a small number of well chosen samples are used to generalize different properties of a large dataset. In this paper, we use diverse sampling for streaming video summarization. Several emerging applications support streaming video, but existing summarization algorithms need access to the entire video which requires a lot of memory and computa… ▽ More

    Submitted 29 October, 2016; originally announced October 2016.

    Comments: Published at ICIP 2016

  40. arXiv:1605.08912  [pdf, other

    math.AT cs.CG cs.CV math.DG math.ST

    A Riemannian Framework for Statistical Analysis of Topological Persistence Diagrams

    Authors: Rushil Anirudh, Vinay Venkataraman, Karthikeyan Natesan Ramamurthy, Pavan Turaga

    Abstract: Topological data analysis is becoming a popular way to study high dimensional feature spaces without any contextual clues or assumptions. This paper concerns itself with one popular topological feature, which is the number of $d-$dimensional holes in the dataset, also known as the Betti$-d$ number. The persistence of the Betti numbers over various scales is encoded into a persistence diagram (PD),… ▽ More

    Submitted 28 May, 2016; originally announced May 2016.

    Comments: Accepted at DiffCVML 2016 (CVPR 2016 Workshops)

  41. arXiv:1603.05310  [pdf, other

    cs.CG cs.CV

    Persistent Homology of Attractors For Action Recognition

    Authors: Vinay Venkataraman, Karthikeyan Natesan Ramamurthy, Pavan Turaga

    Abstract: In this paper, we propose a novel framework for dynamical analysis of human actions from 3D motion capture data using topological data analysis. We model human actions using the topological features of the attractor of the dynamical system. We reconstruct the phase-space of time series corresponding to actions using time-delay embedding, and compute the persistent homology of the phase-space recon… ▽ More

    Submitted 16 March, 2016; originally announced March 2016.

    Comments: 5 pages, Under review in International Conference on Image Processing

  42. Elastic Functional Coding of Riemannian Trajectories

    Authors: Rushil Anirudh, Pavan Turaga, **gyong Su, Anuj Srivastava

    Abstract: Visual observations of dynamic phenomena, such as human actions, are often represented as sequences of smoothly-varying features . In cases where the feature spaces can be structured as Riemannian manifolds, the corresponding representations become trajectories on manifolds. Analysis of these trajectories is challenging due to non-linearity of underlying spaces and high-dimensionality of trajector… ▽ More

    Submitted 7 March, 2016; originally announced March 2016.

    Comments: Under major revision at IEEE T-PAMI, 2016

  43. Shape Distributions of Nonlinear Dynamical Systems for Video-based Inference

    Authors: Vinay Venkataraman, Pavan Turaga

    Abstract: This paper presents a shape-theoretic framework for dynamical analysis of nonlinear dynamical systems which appear frequently in several video-based inference tasks. Traditional approaches to dynamical modeling have included linear and nonlinear methods with their respective drawbacks. A novel approach we propose is the use of descriptors of the shape of the dynamical attractor as a feature repres… ▽ More

    Submitted 27 January, 2016; originally announced January 2016.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence

  44. arXiv:1601.07258  [pdf, other

    cs.CV math.OC

    Fast Integral Image Estimation at 1% measurement rate

    Authors: Kuldeep Kulkarni, Pavan Turaga

    Abstract: We propose a framework called ReFInE to directly obtain integral image estimates from a very small number of spatially multiplexed measurements of the scene without iterative reconstruction of any auxiliary image, and demonstrate their practical utility in visual object tracking. Specifically, we design measurement matrices which are tailored to facilitate extremely fast estimation of the integral… ▽ More

    Submitted 26 January, 2016; originally announced January 2016.

    Comments: Submitted to TPAMI

  45. arXiv:1601.06892  [pdf, other

    cs.CV

    ReconNet: Non-Iterative Reconstruction of Images from Compressively Sensed Random Measurements

    Authors: Kuldeep Kulkarni, Suhas Lohit, Pavan Turaga, Ronan Kerviche, Amit Ashok

    Abstract: The goal of this paper is to present a non-iterative and more importantly an extremely fast algorithm to reconstruct images from compressively sensed (CS) random measurements. To this end, we propose a novel convolutional neural network (CNN) architecture which takes in CS measurements of an image as input and outputs an intermediate reconstruction. We call this network, ReconNet. The intermediate… ▽ More

    Submitted 7 March, 2016; v1 submitted 26 January, 2016; originally announced January 2016.

    Comments: Accepted at IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2016

  46. arXiv:1501.04367  [pdf, other

    cs.CV

    Reconstruction-free action inference from compressive imagers

    Authors: Kuldeep Kulkarni, Pavan Turaga

    Abstract: Persistent surveillance from camera networks, such as at parking lots, UAVs, etc., often results in large amounts of video data, resulting in significant challenges for inference in terms of storage, communication and computation. Compressive cameras have emerged as a potential solution to deal with the data deluge issues in such applications. However, inference tasks such as action recognition re… ▽ More

    Submitted 18 January, 2015; originally announced January 2015.

  47. arXiv:1406.5653  [pdf, other

    cs.CV

    Interactively Test Driving an Object Detector: Estimating Performance on Unlabeled Data

    Authors: Rushil Anirudh, Pavan Turaga

    Abstract: In this paper, we study the problem of `test-driving' a detector, i.e. allowing a human user to get a quick sense of how well the detector generalizes to their specific requirement. To this end, we present the first system that estimates detector performance interactively without extensive ground truthing using a human in the loop. We approach this as a problem of estimating proportions and show t… ▽ More

    Submitted 21 June, 2014; originally announced June 2014.

    Comments: Published at Winter Conference on Applications of Computer Vision, 2014

  48. arXiv:1403.0820  [pdf, other

    cs.CV math.DG

    Geometry-based Adaptive Symbolic Approximation for Fast Sequence Matching on Manifolds

    Authors: Rushil Anirudh, Pavan Turaga

    Abstract: In this paper, we consider the problem of fast and efficient indexing techniques for sequences evolving in non-Euclidean spaces. This problem has several applications in the areas of human activity analysis, where there is a need to perform fast search, and recognition in very high dimensional spaces. The problem is made more challenging when representations such as landmarks, contours, and human… ▽ More

    Submitted 13 February, 2015; v1 submitted 4 March, 2014; originally announced March 2014.

    Comments: Under major revision at IJCV

  49. arXiv:1201.4895  [pdf, other

    cs.CV

    Compressive Acquisition of Dynamic Scenes

    Authors: Aswin C Sankaranarayanan, Pavan K Turaga, Rama Chellappa, Richard G Baraniuk

    Abstract: Compressive sensing (CS) is a new approach for the acquisition and recovery of sparse signals and images that enables sampling rates significantly below the classical Nyquist rate. Despite significant progress in the theory and methods of CS, little headway has been made in compressive video acquisition and recovery. Video CS is complicated by the ephemeral nature of dynamic events, which makes di… ▽ More

    Submitted 26 June, 2013; v1 submitted 23 January, 2012; originally announced January 2012.