Skip to main content

Showing 1–47 of 47 results for author: Veeraraghavan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10212  [pdf, other

    cs.CV cs.GR

    NeST: Neural Stress Tensor Tomography by leveraging 3D Photoelasticity

    Authors: Akshat Dave, Tianyi Zhang, Aaron Young, Ramesh Raskar, Wolfgang Heidrich, Ashok Veeraraghavan

    Abstract: Photoelasticity enables full-field stress analysis in transparent objects through stress-induced birefringence. Existing techniques are limited to 2D slices and require destructively slicing the object. Recovering the internal 3D stress distribution of the entire object is challenging as it involves solving a tensor tomography problem and handling phase wrap** ambiguities. We introduce NeST, an… ▽ More

    Submitted 24 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: Project webpage: https://akshatdave.github.io/nest

  2. arXiv:2406.00859  [pdf, other

    eess.IV cs.CV

    Streaming quanta sensors for online, high-performance imaging and vision

    Authors: Tianyi Zhang, Matthew Dutson, Vivek Boominathan, Mohit Gupta, Ashok Veeraraghavan

    Abstract: Recently quanta image sensors (QIS) -- ultra-fast, zero-read-noise binary image sensors -- have demonstrated remarkable imaging capabilities in many challenging scenarios. Despite their potential, the adoption of these sensors is severely hampered by (a) high data rates and (b) the need for new computational pipelines to handle the unconventional raw data. We introduce a simple, low-bandwidth comp… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  3. arXiv:2404.15274  [pdf, other

    cs.LG cs.CV eess.IV physics.med-ph

    Metric-guided Image Reconstruction Bounds via Conformal Prediction

    Authors: Matt Y Cheung, Tucker J Netherton, Laurence E Court, Ashok Veeraraghavan, Guha Balakrishnan

    Abstract: Recent advancements in machine learning have led to novel imaging systems and algorithms that address ill-posed problems. Assessing their trustworthiness and understanding how to deploy them safely at test time remains an important and open problem. We propose a method that leverages conformal prediction to retrieve upper/lower bounds and statistical inliers/outliers of reconstructions based on th… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  4. arXiv:2404.07985  [pdf, other

    cs.CV eess.IV

    WaveMo: Learning Wavefront Modulations to See Through Scattering

    Authors: Mingyang Xie, Haiyun Guo, Brandon Y. Feng, Lingbo **, Ashok Veeraraghavan, Christopher A. Metzler

    Abstract: Imaging through scattering media is a fundamental and pervasive challenge in fields ranging from medical diagnostics to astronomy. A promising strategy to overcome this challenge is wavefront modulation, which induces measurement diversity during image acquisition. Despite its importance, designing optimal wavefront modulations to image through scattering remains under-explored. This paper introdu… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  5. arXiv:2403.13199  [pdf, other

    cs.CV cs.DC

    DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images

    Authors: Zaid Tasneem, Akshat Dave, Abhishek Singh, Kushagra Tiwary, Praneeth Vepakomma, Ashok Veeraraghavan, Ramesh Raskar

    Abstract: Neural radiance fields (NeRFs) show potential for transforming images captured worldwide into immersive 3D visual experiences. However, most of this captured visual data remains siloed in our camera rolls as these images contain personal details. Even if made public, the problem of learning 3D representations of billions of scenes captured daily in a centralized manner is computationally intractab… ▽ More

    Submitted 28 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  6. arXiv:2402.18102  [pdf, other

    eess.IV cs.CV

    Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging

    Authors: Bhargav Ghanekar, Salman Siddique Khan, Pranav Sharma, Shreyas Singh, Vivek Boominathan, Kaushik Mitra, Ashok Veeraraghavan

    Abstract: Passive, compact, single-shot 3D sensing is useful in many application areas such as microscopy, medical imaging, surgical navigation, and autonomous driving where form factor, time, and power constraints can exist. Obtaining RGB-D scene information over a short imaging distance, in an ultra-compact form factor, and in a passive, snapshot manner is challenging. Dual-pixel (DP) sensors are a potent… ▽ More

    Submitted 30 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  7. arXiv:2312.04679  [pdf, other

    eess.IV cs.CV

    ConVRT: Consistent Video Restoration Through Turbulence with Test-time Optimization of Neural Video Representations

    Authors: Haoming Cai, **gxi Chen, Brandon Y. Feng, Weiyun Jiang, Mingyang Xie, Kevin Zhang, Ashok Veeraraghavan, Christopher Metzler

    Abstract: tmospheric turbulence presents a significant challenge in long-range imaging. Current restoration algorithms often struggle with temporal inconsistency, as well as limited generalization ability across varying turbulence levels and scene content different than the training data. To tackle these issues, we introduce a self-supervised method, Consistent Video Restoration through Turbulence (ConVRT)… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: https://convrt-2024.github.io/

  8. arXiv:2311.09652  [pdf, other

    cs.CV cs.GR

    Event-based Motion-Robust Accurate Shape Estimation for Mixed Reflectance Scenes

    Authors: Aniket Dashpute, Jiazhang Wang, James Taylor, Oliver Cossairt, Ashok Veeraraghavan, Florian Willomitzer

    Abstract: Event-based structured light systems have recently been introduced as an exciting alternative to conventional frame-based triangulation systems for the 3D measurements of diffuse surfaces. Important benefits include the fast capture speed and the high dynamic range provided by the event camera - albeit at the cost of lower data quality. So far, both low-accuracy event-based as well as high-accurac… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  9. arXiv:2308.02100  [pdf, other

    eess.IV cs.CV

    CT Reconstruction from Few Planar X-rays with Application towards Low-resource Radiotherapy

    Authors: Yiran Sun, Tucker Netherton, Laurence Court, Ashok Veeraraghavan, Guha Balakrishnan

    Abstract: CT scans are the standard-of-care for many clinical ailments, and are needed for treatments like external beam radiotherapy. Unfortunately, CT scanners are rare in low and mid-resource settings due to their costs. Planar X-ray radiography units, in comparison, are far more prevalent, but can only provide limited 2D observations of the 3D anatomy. In this work, we propose a method to generate CT vo… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 10 pages, 5 figures

  10. arXiv:2308.00622  [pdf, other

    cs.CV

    NeRT: Implicit Neural Representations for General Unsupervised Turbulence Mitigation

    Authors: Weiyun Jiang, Yuhao Liu, Vivek Boominathan, Ashok Veeraraghavan

    Abstract: The atmospheric and water turbulence mitigation problems have emerged as challenging inverse problems in computer vision and optics communities over the years. However, current methods either rely heavily on the quality of the training dataset or fail to generalize over various scenarios, such as static scenes, dynamic scenes, and text reconstructions. We propose a general implicit neural represen… ▽ More

    Submitted 1 April, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

  11. arXiv:2304.01308  [pdf, other

    eess.IV cs.CV

    Role of Transients in Two-Bounce Non-Line-of-Sight Imaging

    Authors: Siddharth Somasundaram, Akshat Dave, Connor Henley, Ashok Veeraraghavan, Ramesh Raskar

    Abstract: The goal of non-line-of-sight (NLOS) imaging is to image objects occluded from the camera's field of view using multiply scattered light. Recent works have demonstrated the feasibility of two-bounce (2B) NLOS imaging by scanning a laser and measuring cast shadows of occluded objects in scenes with two relay surfaces. In this work, we study the role of time-of-flight (ToF) measurements, \ie transie… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  12. arXiv:2304.00696  [pdf, other

    cs.CV

    Thermal Spread Functions (TSF): Physics-guided Material Classification

    Authors: Aniket Dashpute, Vishwanath Saragadam, Emma Alexander, Florian Willomitzer, Aggelos Katsaggelos, Ashok Veeraraghavan, Oliver Cossairt

    Abstract: Robust and non-destructive material classification is a challenging but crucial first-step in numerous vision applications. We propose a physics-guided material classification framework that relies on thermal properties of the object. Our key observation is that the rate of heating and cooling of an object depends on the unique intrinsic properties of the material, namely the emissivity and diffus… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

  13. arXiv:2301.05187  [pdf, other

    cs.CV cs.GR eess.IV

    WIRE: Wavelet Implicit Neural Representations

    Authors: Vishwanath Saragadam, Daniel LeJeune, Jasper Tan, Guha Balakrishnan, Ashok Veeraraghavan, Richard G. Baraniuk

    Abstract: Implicit neural representations (INRs) have recently advanced numerous vision-related areas. INR performance depends strongly on the choice of the nonlinear activation function employed in its multilayer perceptron (MLP) network. A wide range of nonlinearities have been explored, but, unfortunately, current INRs designed to have high accuracy also suffer from poor robustness (to signal noise, para… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

  14. arXiv:2212.06345  [pdf, other

    physics.optics cs.CV

    Foveated Thermal Computational Imaging in the Wild Using All-Silicon Meta-Optics

    Authors: Vishwanath Saragadam, Zheyi Han, Vivek Boominathan, Luocheng Huang, Shiyu Tan, Johannes E. Fröch, Karl F. Böhringer, Richard G. Baraniuk, Arka Majumdar, Ashok Veeraraghavan

    Abstract: Foveated imaging provides a better tradeoff between situational awareness (field of view) and resolution and is critical in long-wavelength infrared regimes because of the size, weight, power, and cost of thermal sensors. We demonstrate computational foveated imaging by exploiting the ability of a meta-optical frontend to discriminate between different polarization states and a computational backe… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

  15. arXiv:2212.04531  [pdf, other

    cs.CV cs.AI

    ORCa: Glossy Objects as Radiance Field Cameras

    Authors: Kushagra Tiwary, Akshat Dave, Nikhil Behari, Tzofi Klinghoffer, Ashok Veeraraghavan, Ramesh Raskar

    Abstract: Reflections on glossy objects contain valuable and hidden information about the surrounding environment. By converting these objects into cameras, we can unlock exciting applications, including imaging beyond the camera's field-of-view and from seemingly impossible vantage points, e.g. from reflections on the human eye. However, this task is challenging because reflections depend jointly on object… ▽ More

    Submitted 12 December, 2022; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: for more information, see https://ktiwary2.github.io/objectsascam/

  16. arXiv:2207.00945  [pdf, other

    eess.IV cs.CV

    PS$^2$F: Polarized Spiral Point Spread Function for Single-Shot 3D Sensing

    Authors: Bhargav Ghanekar, Vishwanath Saragadam, Dushyant Mehra, Anna-Karin Gustavsson, Aswin Sankaranarayanan, Ashok Veeraraghavan

    Abstract: We propose a compact snapshot monocular depth estimation technique that relies on an engineered point spread function (PSF). Traditional approaches used in microscopic super-resolution imaging such as the Double-Helix PSF (DHPSF) are ill-suited for scenes that are more complex than a sparse set of point light sources. We show, using the Cramér-Rao lower bound, that separating the two lobes of the… ▽ More

    Submitted 4 August, 2022; v1 submitted 2 July, 2022; originally announced July 2022.

    Comments: 12 pages, 12 figures

  17. arXiv:2206.08141  [pdf

    cs.AR

    i-FlatCam: A 253 FPS, 91.49 $μ$J/Frame Ultra-Compact Intelligent Lensless Camera for Real-Time and Efficient Eye Tracking in VR/AR

    Authors: Yang Zhao, Ziyun Li, Yonggan Fu, Yongan Zhang, Chaojian Li, Cheng Wan, Haoran You, Shang Wu, Xu Ouyang, Vivek Boominathan, Ashok Veeraraghavan, Yingyan Lin

    Abstract: We present a first-of-its-kind ultra-compact intelligent camera system, dubbed i-FlatCam, including a lensless camera with a computational (Comp.) chip. It highlights (1) a predict-then-focus eye tracking pipeline for boosted efficiency without compromising the accuracy, (2) a unified compression scheme for single-chip processing and improved frame rate per second (FPS), and (3) dedicated intra-ch… ▽ More

    Submitted 22 February, 2024; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: Accepted by VLSI 2022

  18. EyeCoD: Eye Tracking System Acceleration via FlatCam-based Algorithm & Accelerator Co-Design

    Authors: Haoran You, Cheng Wan, Yang Zhao, Zhongzhi Yu, Yonggan Fu, Jiayi Yuan, Shang Wu, Shunyao Zhang, Yongan Zhang, Chaojian Li, Vivek Boominathan, Ashok Veeraraghavan, Ziyun Li, Yingyan Lin

    Abstract: Eye tracking has become an essential human-machine interaction modality for providing immersive experience in numerous virtual and augmented reality (VR/AR) applications desiring high throughput (e.g., 240 FPS), small-form, and enhanced visual privacy. However, existing eye tracking systems are still limited by their: (1) large form-factor largely due to the adopted bulky lens-based cameras; and (… ▽ More

    Submitted 5 February, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: Accepted by ISCA 2022; Also selected as an IEEE Micro's Top Pick of 2023

  19. arXiv:2204.03145  [pdf, other

    stat.AP cs.LG stat.ML

    DeepTensor: Low-Rank Tensor Decomposition with Deep Network Priors

    Authors: Vishwanath Saragadam, Randall Balestriero, Ashok Veeraraghavan, Richard G. Baraniuk

    Abstract: DeepTensor is a computationally efficient framework for low-rank decomposition of matrices and tensors using deep generative networks. We decompose a tensor as the product of low-rank tensor factors (e.g., a matrix as the outer product of two vectors), where each low-rank tensor is generated by a deep network (DN) that is trained in a self-supervised manner to minimize the mean-squared approximati… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: 14 pages

  20. arXiv:2203.13458  [pdf, other

    cs.CV cs.GR

    PANDORA: Polarization-Aided Neural Decomposition Of Radiance

    Authors: Akshat Dave, Yongyi Zhao, Ashok Veeraraghavan

    Abstract: Reconstructing an object's geometry and appearance from multiple images, also known as inverse rendering, is a fundamental problem in computer graphics and vision. Inverse rendering is inherently ill-posed because the captured image is an intricate function of unknown lighting conditions, material properties and scene geometry. Recent progress in representing scene properties as coordinate-based n… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: Project webpage: https://akshatdave.github.io/pandora

  21. arXiv:2202.03532  [pdf, other

    cs.CV

    MINER: Multiscale Implicit Neural Representations

    Authors: Vishwanath Saragadam, Jasper Tan, Guha Balakrishnan, Richard G. Baraniuk, Ashok Veeraraghavan

    Abstract: We introduce a new neural signal model designed for efficient high-resolution representation of large-scale signals. The key innovation in our multiscale implicit neural representation (MINER) is an internal representation via a Laplacian pyramid, which provides a sparse multiscale decomposition of the signal that captures orthogonal parts of the signal across scales. We leverage the advantages of… ▽ More

    Submitted 17 July, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 14 pages, accepted to ECCV 2022

  22. arXiv:2108.07973  [pdf, other

    eess.IV cs.CV

    Thermal Image Processing via Physics-Inspired Deep Networks

    Authors: Vishwanath Saragadam, Akshat Dave, Ashok Veeraraghavan, Richard Baraniuk

    Abstract: We introduce DeepIR, a new thermal image processing framework that combines physically accurate sensor modeling with deep network-based image representation. Our key enabling observations are that the images captured by thermal sensors can be factored into slowly changing, scene-independent sensor non-uniformities (that can be accurately modeled using physics) and a scene-specific radiance flux (t… ▽ More

    Submitted 25 August, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: Accepted to 2nd ICCV workshop on Learning for Computational Imaging (LCI)

  23. arXiv:2104.04641  [pdf, other

    cs.CV eess.IV physics.optics

    CodedStereo: Learned Phase Masks for Large Depth-of-field Stereo

    Authors: Shiyu Tan, Yicheng Wu, Shoou-I Yu, Ashok Veeraraghavan

    Abstract: Conventional stereo suffers from a fundamental trade-off between imaging volume and signal-to-noise ratio (SNR) -- due to the conflicting impact of aperture size on both these variables. Inspired by the extended depth of field cameras, we propose a novel end-to-end learning-based technique to overcome this limitation, by introducing a phase mask at the aperture plane of the cameras in a stereo ima… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021 as an oral presentation

  24. arXiv:2012.14495  [pdf, other

    eess.IV cs.CV cs.GR cs.LG

    SASSI -- Super-Pixelated Adaptive Spatio-Spectral Imaging

    Authors: Vishwanath Saragadam, Michael DeZeeuw, Richard Baraniuk, Ashok Veeraraghavan, Aswin Sankaranarayanan

    Abstract: We introduce a novel video-rate hyperspectral imager with high spatial, and temporal resolutions. Our key hypothesis is that spectral profiles of pixels in a super-pixel of an oversegmented image tend to be very similar. Hence, a scene-adaptive spatial sampling of an hyperspectral scene, guided by its super-pixel segmented image, is capable of obtaining high-quality reconstructions. To achieve thi… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

  25. arXiv:2011.12485  [pdf, other

    eess.IV cs.CV

    How to Train Neural Networks for Flare Removal

    Authors: Yicheng Wu, Qiurui He, Tianfan Xue, Rahul Garg, Jiawen Chen, Ashok Veeraraghavan, Jonathan T. Barron

    Abstract: When a camera is pointed at a strong light source, the resulting photograph may contain lens flare artifacts. Flares appear in a wide variety of patterns (halos, streaks, color bleeding, haze, etc.) and this diversity in appearance makes flare removal challenging. Existing analytical solutions make strong assumptions about the artifact's geometry or brightness, and therefore only work well on a sm… ▽ More

    Submitted 7 October, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: A new version paper is uploaded

  26. arXiv:2010.15440  [pdf, other

    eess.IV cs.CV cs.LG

    FlatNet: Towards Photorealistic Scene Reconstruction from Lensless Measurements

    Authors: Salman S. Khan, Varun Sundar, Vivek Boominathan, Ashok Veeraraghavan, Kaushik Mitra

    Abstract: Lensless imaging has emerged as a potential solution towards realizing ultra-miniature cameras by eschewing the bulky lens in a traditional camera. Without a focusing lens, the lensless cameras rely on computational algorithms to recover the scenes from multiplexed measurements. However, the current iterative-optimization-based reconstruction algorithms produce noisier and perceptually poorer imag… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2020. Supplementary material attached. For project website, see https://siddiquesalman.github.io/flatnet/

  27. arXiv:2010.07770  [pdf, other

    eess.IV cs.LG

    The Benefit of Distraction: Denoising Remote Vitals Measurements using Inverse Attention

    Authors: Ewa Nowara, Daniel McDuff, Ashok Veeraraghavan

    Abstract: Attention is a powerful concept in computer vision. End-to-end networks that learn to focus selectively on regions of an image or video often perform strongly. However, other image regions, while not necessarily containing the signal of interest, may contain useful context. We present an approach that exploits the idea that statistics of noise may be shared between the regions that contain the sig… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  28. arXiv:1811.07567  [pdf, other

    cs.CV

    Fine-grained Classification using Heterogeneous Web Data and Auxiliary Categories

    Authors: Li Niu, Ashok Veeraraghavan, Ashu Sabharwal

    Abstract: Fine-grained classification remains a very challenging problem, because of the absence of well-labeled training data caused by the high cost of annotating a large number of fine-grained categories. In the extreme case, given a set of test categories without any well-labeled training data, the majority of existing works can be grouped into the following two research directions: 1) crawl noisy label… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

  29. arXiv:1806.09228  [pdf, other

    cs.LG cs.CV stat.ML

    Deep $k$-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions

    Authors: Junru Wu, Yue Wang, Zhenyu Wu, Zhangyang Wang, Ashok Veeraraghavan, Yingyan Lin

    Abstract: The current trend of pushing CNNs deeper with convolutions has created a pressing demand to achieve higher compression gains on CNNs where convolutions dominate the computation and parameter amount (e.g., GoogLeNet, ResNet and Wide ResNet). Further, the high energy consumption of convolutions limits its deployment on mobile devices. To this end, we proposed a simple yet effective scheme for compre… ▽ More

    Submitted 24 June, 2018; originally announced June 2018.

    Comments: Accepted by ICML 2018

  30. arXiv:1805.06374  [pdf, other

    cs.CV

    Fast Retinomorphic Event Stream for Video Recognition and Reinforcement Learning

    Authors: Wanjia Liu, Huai** Chen, Rishab Goel, Yuzhong Huang, Ashok Veeraraghavan, Ankit Patel

    Abstract: Good temporal representations are crucial for video understanding, and the state-of-the-art video recognition framework is based on two-stream networks. In such framework, besides the regular ConvNets responsible for RGB frame inputs, a second network is introduced to handle the temporal representation, usually the optical flow (OF). However, OF or other task-oriented flow is computationally costl… ▽ More

    Submitted 19 May, 2018; v1 submitted 16 May, 2018; originally announced May 2018.

  31. arXiv:1803.03857  [pdf, other

    cs.CV

    Learning from Noisy Web Data with Category-level Supervision

    Authors: Li Niu, Qingtao Tang, Ashok Veeraraghavan, Ashu Sabharwal

    Abstract: As tons of photos are being uploaded to public websites (e.g., Flickr, Bing, and Google) every day, learning from web data has become an increasingly popular research direction because of freely available web resources, which is also referred to as webly supervised learning. Nevertheless, the performance gap between webly supervised learning and traditional supervised learning is still very large,… ▽ More

    Submitted 24 May, 2018; v1 submitted 10 March, 2018; originally announced March 2018.

  32. arXiv:1803.00212  [pdf, other

    stat.ML cs.LG

    prDeep: Robust Phase Retrieval with a Flexible Deep Network

    Authors: Christopher A. Metzler, Philip Schniter, Ashok Veeraraghavan, Richard G. Baraniuk

    Abstract: Phase retrieval algorithms have become an important component in many modern computational imaging systems. For instance, in the context of ptychography and speckle correlation imaging, they enable imaging past the diffraction limit and through scattering media, respectively. Unfortunately, traditional phase retrieval algorithms struggle in the presence of noise. Progress has been made recently on… ▽ More

    Submitted 29 June, 2018; v1 submitted 28 February, 2018; originally announced March 2018.

  33. arXiv:1801.05117  [pdf, other

    cs.CV

    Reblur2Deblur: Deblurring Videos via Self-Supervised Learning

    Authors: Huai** Chen, **wei Gu, Orazio Gallo, Ming-Yu Liu, Ashok Veeraraghavan, Jan Kautz

    Abstract: Motion blur is a fundamental problem in computer vision as it impacts image quality and hinders inference. Traditional deblurring algorithms leverage the physics of the image formation model and use hand-crafted priors: they usually produce results that better reflect the underlying scene, but present artifacts. Recent learning-based methods implicitly extract the distribution of natural images di… ▽ More

    Submitted 16 January, 2018; originally announced January 2018.

  34. arXiv:1711.06167  [pdf, ps, other

    cs.CV

    Zero-Shot Learning via Category-Specific Visual-Semantic Map**

    Authors: Li Niu, Jianfei Cai, Ashok Veeraraghavan

    Abstract: Zero-Shot Learning (ZSL) aims to classify a test instance from an unseen category based on the training instances from seen categories, in which the gap between seen categories and unseen categories is generally bridged via visual-semantic map** between the low-level visual feature space and the intermediate semantic space. However, the visual-semantic map** learnt based on seen categories may… ▽ More

    Submitted 12 December, 2017; v1 submitted 16 November, 2017; originally announced November 2017.

  35. arXiv:1706.09585  [pdf, other

    cs.LG

    Online Reweighted Least Squares Algorithm for Sparse Recovery and Application to Short-Wave Infrared Imaging

    Authors: Subhadip Mukherjee, Deepak R., Huai** Chen, Ashok Veeraraghavan, Chandra Sekhar Seelamantula

    Abstract: We address the problem of sparse recovery in an online setting, where random linear measurements of a sparse signal are revealed sequentially and the objective is to recover the underlying signal. We propose a reweighted least squares (RLS) algorithm to solve the problem of online sparse reconstruction, wherein a system of linear equations is solved using conjugate gradient with the arrival of eve… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

  36. arXiv:1605.03621  [pdf, other

    cs.CV

    ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks using Angle Sensitive Pixels

    Authors: Huai** Chen, Suren Jayasuriya, Jiyue Yang, Judy Stephen, Sriram Sivaramakrishnan, Ashok Veeraraghavan, Alyosha Molnar

    Abstract: Deep learning using convolutional neural networks (CNNs) is quickly becoming the state-of-the-art for challenging computer vision applications. However, deep learning's power consumption and bandwidth requirements currently limit its application in embedded and mobile systems with tight energy budgets. In this paper, we explore the energy savings of optically computing the first layer of CNNs. To… ▽ More

    Submitted 16 November, 2016; v1 submitted 11 May, 2016; originally announced May 2016.

    Comments: Presented in CVPR 2016 (oral), 10 pages, 12 figures. This new version corrects the comparison between imaging power for ASPs and a regular image sensor

  37. arXiv:1512.06539  [pdf, ps, other

    cs.CV

    Spatial Phase-Sweep: Increasing temporal resolution of transient imaging using a light source array

    Authors: Ryuichi Tadano, Adithya Kumar Pediredla, Kaushik Mitra, Ashok Veeraraghavan

    Abstract: Transient imaging or light-in-flight techniques capture the propagation of an ultra-short pulse of light through a scene, which in effect captures the optical impulse response of the scene. Recently, it has been shown that we can capture transient images using commercially available Time-of-Flight (ToF) systems such as Photonic Mixer Devices (PMD). In this paper, we propose `spatial phase-sweep',… ▽ More

    Submitted 21 December, 2015; originally announced December 2015.

  38. arXiv:1510.08470  [pdf, other

    cs.CV physics.optics

    Toward Long Distance, Sub-diffraction Imaging Using Coherent Camera Arrays

    Authors: Jason Holloway, M. Salman Asif, Manoj Kumar Sharma, Nathan Matsuda, Roarke Horstmeyer, Oliver Cossairt, Ashok Veeraraghavan

    Abstract: In this work, we propose using camera arrays coupled with coherent illumination as an effective method of improving spatial resolution in long distance images by a factor of ten and beyond. Recent advances in ptychography have demonstrated that one can image beyond the diffraction limit of the objective lens in a microscope. We demonstrate a similar imaging system to image beyond the diffraction l… ▽ More

    Submitted 28 October, 2015; originally announced October 2015.

    Comments: 13 pages, 16 figures, submitted to IEEE Transactions on Computational Imaging

  39. arXiv:1509.00816  [pdf, other

    cs.CV

    Depth Fields: Extending Light Field Techniques to Time-of-Flight Imaging

    Authors: Suren Jayasuriya, Adithya Pediredla, Sriram Sivaramakrishnan, Alyosha Molnar, Ashok Veeraraghavan

    Abstract: A variety of techniques such as light field, structured illumination, and time-of-flight (TOF) are commonly used for depth acquisition in consumer imaging, robotics and many other applications. Unfortunately, each technique suffers from its individual limitations preventing robust depth sensing. In this paper, we explore the strengths and weaknesses of combining light field and time-of-flight imag… ▽ More

    Submitted 2 September, 2015; originally announced September 2015.

    Comments: 9 pages, 8 figures, Accepted to 3DV 2015

  40. arXiv:1509.00116  [pdf, other

    cs.CV

    FlatCam: Thin, Bare-Sensor Cameras using Coded Aperture and Computation

    Authors: M. Salman Asif, Ali Ayremlou, Aswin Sankaranarayanan, Ashok Veeraraghavan, Richard Baraniuk

    Abstract: FlatCam is a thin form-factor lensless camera that consists of a coded mask placed on top of a bare, conventional sensor array. Unlike a traditional, lens-based camera where an image of the scene is directly recorded on the sensor pixels, each pixel in FlatCam records a linear combination of light from multiple scene elements. A computational algorithm is then used to demultiplex the recorded meas… ▽ More

    Submitted 27 January, 2016; v1 submitted 31 August, 2015; originally announced September 2015.

    Comments: 12 pages, 10 figures

  41. arXiv:1508.01244  [pdf, other

    cs.CV

    TabletGaze: Unconstrained Appearance-based Gaze Estimation in Mobile Tablets

    Authors: Qiong Huang, Ashok Veeraraghavan, Ashutosh Sabharwal

    Abstract: We study gaze estimation on tablets, our key design goal is uncalibrated gaze estimation using the front-facing camera during natural use of tablets, where the posture and method of holding the tablet is not constrained. We collected the first large unconstrained gaze dataset of tablet users, labeled Rice TabletGaze dataset. The dataset consists of 51 subjects, each with 4 different postures and 3… ▽ More

    Submitted 16 July, 2016; v1 submitted 5 August, 2015; originally announced August 2015.

    Comments: 18 pages, 17 figures, submitted to journal, website hosting the dataset: http://sh.rice.edu/tablet_gaze.html

  42. arXiv:1504.04085  [pdf, other

    cs.CV

    FPA-CS: Focal Plane Array-based Compressive Imaging in Short-wave Infrared

    Authors: Huai** Chen, M. Salman Asif, Aswin C. Sankaranarayanan, Ashok Veeraraghavan

    Abstract: Cameras for imaging in short and mid-wave infrared spectra are significantly more expensive than their counterparts in visible imaging. As a result, high-resolution imaging in those spectrum remains beyond the reach of most consumers. Over the last decade, compressive sensing (CS) has emerged as a potential means to realize inexpensive short-wave infrared cameras. One approach for doing this is th… ▽ More

    Submitted 15 April, 2015; originally announced April 2015.

    Comments: appears in IEEE Conf. Computer Vision and Pattern Recognition, 2015

  43. arXiv:1502.08040  [pdf, other

    cs.CV

    DistancePPG: Robust non-contact vital signs monitoring using a camera

    Authors: Mayank Kumar, Ashok Veeraraghavan, Ashutosh Sabharval

    Abstract: Vital signs such as pulse rate and breathing rate are currently measured using contact probes. But, non-contact methods for measuring vital signs are desirable both in hospital settings (e.g. in NICU) and for ubiquitous in-situ health tracking (e.g. on mobile phone and computers with webcams). Recently, camera-based non-contact vital sign monitoring have been shown to be feasible. However, camera-… ▽ More

    Submitted 23 March, 2015; v1 submitted 27 February, 2015; originally announced February 2015.

    Comments: 24 pages, 11 figures

  44. arXiv:1412.0680  [pdf, other

    cs.CV

    Fast Sublinear Sparse Representation using Shallow Tree Matching Pursuit

    Authors: Ali Ayremlou, Thomas Goldstein, Ashok Veeraraghavan, Richard Baraniuk

    Abstract: Sparse approximations using highly over-complete dictionaries is a state-of-the-art tool for many imaging applications including denoising, super-resolution, compressive sensing, light-field analysis, and object recognition. Unfortunately, the applicability of such methods is severely hampered by the computational burden of sparse approximation: these algorithms are linear or super-linear in both… ▽ More

    Submitted 1 December, 2014; originally announced December 2014.

  45. arXiv:1308.1981  [pdf, other

    cs.CV

    A Framework for the Analysis of Computational Imaging Systems with Practical Applications

    Authors: Kaushik Mitra, Oliver Cossairt, Ashok Veeraraghavan

    Abstract: Over the last decade, a number of Computational Imaging (CI) systems have been proposed for tasks such as motion deblurring, defocus deblurring and multispectral imaging. These techniques increase the amount of light reaching the sensor via multiplexing and then undo the deleterious effects of multiplexing by appropriate reconstruction algorithms. Given the widespread appeal and the considerable e… ▽ More

    Submitted 13 March, 2014; v1 submitted 8 August, 2013; originally announced August 2013.

    ACM Class: I.4

  46. arXiv:1203.4280  [pdf, other

    physics.optics cs.CV

    Reconstruction of hidden 3D shapes using diffuse reflections

    Authors: Otkrist Gupta, Andreas Velten, Thomas Willwacher, Ashok Veeraraghavan, Ramesh Raskar

    Abstract: We analyze multi-bounce propagation of light in an unknown hidden volume and demonstrate that the reflected light contains sufficient information to recover the 3D structure of the hidden scene. We formulate the forward and inverse theory of secondary and tertiary scattering reflection using ideas from energy front propagation and tomography. We show that using careful choice of approximations, su… ▽ More

    Submitted 19 March, 2012; originally announced March 2012.

  47. arXiv:1109.1865  [pdf, other

    cs.CV

    Progressive versus Random Projections for Compressive Capture of Images, Lightfields and Higher Dimensional Visual Signals

    Authors: Rohit Pandharkar, Ashok Veeraraghavan, Ramesh Raskar

    Abstract: Computational photography involves sophisticated capture methods. A new trend is to capture projection of higher dimensional visual signals such as videos, multi-spectral data and lightfields on lower dimensional sensors. Carefully designed capture methods exploit the sparsity of the underlying signal in a transformed domain to reduce the number of measurements and use an appropriate reconstructio… ▽ More

    Submitted 8 September, 2011; originally announced September 2011.

    Comments: Draft of working paper