Skip to main content

Showing 1–34 of 34 results for author: Srinivasan, P P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06527  [pdf, other

    cs.CV cs.AI cs.GR

    IllumiNeRF: 3D Relighting without Inverse Rendering

    Authors: Xiaoming Zhao, Pratul P. Srinivasan, Dor Verbin, Keunhong Park, Ricardo Martin Brualla, Philipp Henzler

    Abstract: Existing methods for relightable view synthesis -- using a set of images of an object under unknown lighting to recover a 3D representation that can be rendered from novel viewpoints under a target illumination -- are based on inverse rendering, and attempt to disentangle the object geometry, materials, and lighting that explain the input images. Furthermore, this typically involves optimization t… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Project page: https://illuminerf.github.io/

  2. arXiv:2405.14871  [pdf, other

    cs.CV cs.GR

    NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections

    Authors: Dor Verbin, Pratul P. Srinivasan, Peter Hedman, Ben Mildenhall, Benjamin Attal, Richard Szeliski, Jonathan T. Barron

    Abstract: Neural Radiance Fields (NeRFs) typically struggle to reconstruct and render highly specular objects, whose appearance varies quickly with changes in viewpoint. Recent works have improved NeRF's ability to render detailed specular appearance of distant environment illumination, but are unable to synthesize consistent reflections of closer content. Moreover, these techniques rely on large computatio… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Project page: http://nerf-casting.github.io

  3. arXiv:2402.12377  [pdf, other

    cs.CV

    Binary Opacity Grids: Capturing Fine Geometric Detail for Mesh-Based View Synthesis

    Authors: Christian Reiser, Stephan Garbin, Pratul P. Srinivasan, Dor Verbin, Richard Szeliski, Ben Mildenhall, Jonathan T. Barron, Peter Hedman, Andreas Geiger

    Abstract: While surface-based view synthesis algorithms are appealing due to their low computational requirements, they often struggle to reproduce thin structures. In contrast, more expensive methods that model the scene's geometry as a volumetric density field (e.g. NeRF) excel at reconstructing fine geometric detail. However, density fields often represent geometry in a "fuzzy" manner, which hinders exac… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Project page at https://binary-opacity-grid.github.io

  4. arXiv:2312.05283  [pdf, other

    cs.CV cs.GR

    Nuvo: Neural UV Map** for Unruly 3D Representations

    Authors: Pratul P. Srinivasan, Stephan J. Garbin, Dor Verbin, Jonathan T. Barron, Ben Mildenhall

    Abstract: Existing UV map** algorithms are designed to operate on well-behaved meshes, instead of the geometry representations produced by state-of-the-art 3D reconstruction and generation techniques. As such, applying these methods to the volume densities recovered by neural radiance fields and related techniques (or meshes triangulated from such fields) results in texture atlases that are too fragmented… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Project page at https://pratulsrinivasan.github.io/nuvo

  5. arXiv:2312.02981  [pdf, other

    cs.CV

    ReconFusion: 3D Reconstruction with Diffusion Priors

    Authors: Rundi Wu, Ben Mildenhall, Philipp Henzler, Keunhong Park, Ruiqi Gao, Daniel Watson, Pratul P. Srinivasan, Dor Verbin, Jonathan T. Barron, Ben Poole, Aleksander Holynski

    Abstract: 3D reconstruction methods such as Neural Radiance Fields (NeRFs) excel at rendering photorealistic novel views of complex scenes. However, recovering a high-quality NeRF typically requires tens to hundreds of input images, resulting in a time-consuming capture process. We present ReconFusion to reconstruct real-world scenes using only a few photos. Our approach leverages a diffusion prior for nove… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Project page: https://reconfusion.github.io/

  6. arXiv:2310.07687  [pdf, other

    astro-ph.HE astro-ph.IM cs.CV

    Orbital Polarimetric Tomography of a Flare Near the Sagittarius A* Supermassive Black Hole

    Authors: Aviad Levis, Andrew A. Chael, Katherine L. Bouman, Maciek Wielgus, Pratul P. Srinivasan

    Abstract: The interaction between the supermassive black hole at the center of the Milky Way, Sagittarius A*, and its accretion disk occasionally produces high-energy flares seen in X-ray, infrared, and radio. One proposed mechanism that produces flares is the formation of compact, bright regions that appear within the accretion disk and close to the event horizon. Understanding these flares provides a wind… ▽ More

    Submitted 16 April, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  7. arXiv:2309.04437  [pdf, other

    cs.CV astro-ph.CO

    Single View Refractive Index Tomography with Neural Fields

    Authors: Brandon Zhao, Aviad Levis, Liam Connor, Pratul P. Srinivasan, Katherine L. Bouman

    Abstract: Refractive Index Tomography is the inverse problem of reconstructing the continuously-varying 3D refractive index in a scene using 2D projected image measurements. Although a purely refractive field is not directly visible, it bends light rays as they travel through space, thus providing a signal for reconstruction. The effects of such fields appear in many scientific computer vision settings, ran… ▽ More

    Submitted 1 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

  8. arXiv:2305.16321  [pdf, other

    cs.CV cs.GR

    Eclipse: Disambiguating Illumination and Materials using Unintended Shadows

    Authors: Dor Verbin, Ben Mildenhall, Peter Hedman, Jonathan T. Barron, Todd Zickler, Pratul P. Srinivasan

    Abstract: Decomposing an object's appearance into representations of its materials and the surrounding illumination is difficult, even when the object's 3D shape is known beforehand. This problem is especially challenging for diffuse objects: it is ill-conditioned because diffuse materials severely blur incoming light, and it is ill-posed because diffuse materials under high-frequency lighting can be indist… ▽ More

    Submitted 13 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Project page: https://dorverbin.github.io/eclipse/

  9. arXiv:2304.06706  [pdf, other

    cs.CV cs.GR cs.LG

    Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields

    Authors: Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, Peter Hedman

    Abstract: Neural Radiance Field training can be accelerated through the use of grid-based representations in NeRF's learned map** from spatial coordinates to colors and volumetric density. However, these grid-based approaches lack an explicit understanding of scale and therefore often introduce aliasing, usually in the form of jaggies or missing scene content. Anti-aliasing has previously been addressed b… ▽ More

    Submitted 26 October, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Project page: https://jonbarron.info/zipnerf/

  10. arXiv:2302.14859  [pdf, other

    cs.CV

    BakedSDF: Meshing Neural SDFs for Real-Time View Synthesis

    Authors: Lior Yariv, Peter Hedman, Christian Reiser, Dor Verbin, Pratul P. Srinivasan, Richard Szeliski, Jonathan T. Barron, Ben Mildenhall

    Abstract: We present a method for reconstructing high-quality meshes of large unbounded real-world scenes suitable for photorealistic novel view synthesis. We first optimize a hybrid neural volume-surface scene representation designed to have well-behaved level sets that correspond to surfaces in the scene. We then bake this representation into a high-quality triangle mesh, which we equip with a simple and… ▽ More

    Submitted 16 May, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Video and interactive web demo available at https://bakedsdf.github.io/

  11. arXiv:2302.12249  [pdf, other

    cs.CV cs.GR

    MERF: Memory-Efficient Radiance Fields for Real-time View Synthesis in Unbounded Scenes

    Authors: Christian Reiser, Richard Szeliski, Dor Verbin, Pratul P. Srinivasan, Ben Mildenhall, Andreas Geiger, Jonathan T. Barron, Peter Hedman

    Abstract: Neural radiance fields enable state-of-the-art photorealistic view synthesis. However, existing radiance field representations are either too compute-intensive for real-time rendering or require too much memory to scale to large scenes. We present a Memory-Efficient Radiance Field (MERF) representation that achieves real-time rendering of large-scale scenes in a browser. MERF reduces the memory co… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Video and interactive web demo available at https://merf42.github.io

  12. arXiv:2302.08504  [pdf, other

    cs.CV cs.GR

    PersonNeRF: Personalized Reconstruction from Photo Collections

    Authors: Chung-Yi Weng, Pratul P. Srinivasan, Brian Curless, Ira Kemelmacher-Shlizerman

    Abstract: We present PersonNeRF, a method that takes a collection of photos of a subject (e.g. Roger Federer) captured across multiple years with arbitrary body poses and appearances, and enables rendering the subject with arbitrary novel combinations of viewpoint, body pose, and appearance. PersonNeRF builds a customized neural volumetric 3D model of the subject that is able to render an entire space spann… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: Project Page: https://grail.cs.washington.edu/projects/personnerf/

  13. arXiv:2204.03715  [pdf, other

    cs.CV astro-ph.IM

    Gravitationally Lensed Black Hole Emission Tomography

    Authors: Aviad Levis, Pratul P. Srinivasan, Andrew A. Chael, Ren Ng, Katherine L. Bouman

    Abstract: Measurements from the Event Horizon Telescope enabled the visualization of light emission around a black hole for the first time. So far, these measurements have been used to recover a 2D image under the assumption that the emission field is static over the period of acquisition. In this work, we propose BH-NeRF, a novel tomography approach that leverages gravitational lensing to recover the conti… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: To appear in the IEEE Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2022. Supplemental material including accompanying pdf, code, and video highlight can be found in the project page: http://imaging.cms.caltech.edu/bhnerf/

  14. arXiv:2202.05263  [pdf, other

    cs.CV cs.GR

    Block-NeRF: Scalable Large Scene Neural View Synthesis

    Authors: Matthew Tancik, Vincent Casser, Xinchen Yan, Sabeek Pradhan, Ben Mildenhall, Pratul P. Srinivasan, Jonathan T. Barron, Henrik Kretzschmar

    Abstract: We present Block-NeRF, a variant of Neural Radiance Fields that can represent large-scale environments. Specifically, we demonstrate that when scaling NeRF to render city-scale scenes spanning multiple blocks, it is vital to decompose the scene into individually trained NeRFs. This decomposition decouples rendering time from scene size, enables rendering to scale to arbitrarily large environments,… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Project page: https://waymo.com/research/block-nerf/

  15. arXiv:2201.04127  [pdf, other

    cs.CV cs.GR

    HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video

    Authors: Chung-Yi Weng, Brian Curless, Pratul P. Srinivasan, Jonathan T. Barron, Ira Kemelmacher-Shlizerman

    Abstract: We introduce a free-viewpoint rendering method -- HumanNeRF -- that works on a given monocular video of a human performing complex body motions, e.g. a video from YouTube. Our method enables pausing the video at any frame and rendering the subject from arbitrary new camera viewpoints or even a full 360-degree camera path for that particular frame and body pose. This task is particularly challengin… ▽ More

    Submitted 14 June, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: CVPR 2022 (oral). Project page with videos: https://grail.cs.washington.edu/projects/humannerf/

  16. arXiv:2112.03907  [pdf, other

    cs.CV cs.GR

    Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields

    Authors: Dor Verbin, Peter Hedman, Ben Mildenhall, Todd Zickler, Jonathan T. Barron, Pratul P. Srinivasan

    Abstract: Neural Radiance Fields (NeRF) is a popular view synthesis technique that represents a scene as a continuous volumetric function, parameterized by multilayer perceptrons that provide the volume density and view-dependent emitted radiance at each location. While NeRF-based techniques excel at representing fine geometric structures with smoothly varying view-dependent appearance, they often fail to a… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Project page: https://dorverbin.github.io/refnerf/

  17. arXiv:2112.03288  [pdf, other

    cs.CV

    Dense Depth Priors for Neural Radiance Fields from Sparse Input Views

    Authors: Barbara Roessle, Jonathan T. Barron, Ben Mildenhall, Pratul P. Srinivasan, Matthias Nießner

    Abstract: Neural radiance fields (NeRF) encode a scene into a neural representation that enables photo-realistic rendering of novel views. However, a successful reconstruction from RGB images requires a large number of input views taken under static conditions - typically up to a few hundred images for room-size scenes. Our method aims to synthesize novel views of whole rooms from an order of magnitude fewe… ▽ More

    Submitted 7 April, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: CVPR 2022, project page: https://barbararoessle.github.io/dense_depth_priors_nerf/ , video: https://youtu.be/zzkvvdcvksc

  18. arXiv:2111.14643  [pdf, other

    cs.CV cs.GR

    Urban Radiance Fields

    Authors: Konstantinos Rematas, Andrew Liu, Pratul P. Srinivasan, Jonathan T. Barron, Andrea Tagliasacchi, Thomas Funkhouser, Vittorio Ferrari

    Abstract: The goal of this work is to perform 3D reconstruction and novel view synthesis from data captured by scanning platforms commonly deployed for world map** in urban outdoor environments (e.g., Street View). Given a sequence of posed RGB images and lidar sweeps acquired by cameras and scanners moving through an outdoor scene, we produce a model from which 3D surfaces can be extracted and novel RGB… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Project: https://urban-radiance-fields.github.io/

  19. arXiv:2111.12077  [pdf, other

    cs.CV cs.GR

    Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields

    Authors: Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, Peter Hedman

    Abstract: Though neural radiance fields (NeRF) have demonstrated impressive view synthesis results on objects and small bounded regions of space, they struggle on "unbounded" scenes, where the camera may point in any direction and content may exist at any distance. In this setting, existing NeRF-like models often produce blurry or low-resolution renderings (due to the unbalanced detail and scale of nearby a… ▽ More

    Submitted 25 March, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: https://jonbarron.info/mipnerf360/

  20. arXiv:2110.05655  [pdf, other

    cs.CV

    Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image

    Authors: Shumian Xin, Neal Wadhwa, Tianfan Xue, Jonathan T. Barron, Pratul P. Srinivasan, Jiawen Chen, Ioannis Gkioulekas, Rahul Garg

    Abstract: We present a method that takes as input a single dual-pixel image, and simultaneously estimates the image's defocus map -- the amount of defocus blur at each pixel -- and recovers an all-in-focus image. Our method is inspired from recent works that leverage the dual-pixel sensors available in many consumer cameras to assist with autofocus, and use them for recovery of defocus maps or all-in-focus… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: ICCV 2021 (Oral)

  21. NeRFactor: Neural Factorization of Shape and Reflectance Under an Unknown Illumination

    Authors: Xiuming Zhang, Pratul P. Srinivasan, Boyang Deng, Paul Debevec, William T. Freeman, Jonathan T. Barron

    Abstract: We address the problem of recovering the shape and spatially-varying reflectance of an object from multi-view images (and their camera poses) of an object illuminated by one unknown lighting condition. This enables the rendering of novel views of the object under arbitrary environment lighting and editing of the object's material properties. The key to our approach, which we call Neural Radiance F… ▽ More

    Submitted 21 December, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: Camera-ready version for SIGGRAPH Asia 2021. Project Page: https://people.csail.mit.edu/xiuming/projects/nerfactor/

  22. arXiv:2103.14645  [pdf, other

    cs.CV cs.GR

    Baking Neural Radiance Fields for Real-Time View Synthesis

    Authors: Peter Hedman, Pratul P. Srinivasan, Ben Mildenhall, Jonathan T. Barron, Paul Debevec

    Abstract: Neural volumetric representations such as Neural Radiance Fields (NeRF) have emerged as a compelling technique for learning to represent 3D scenes from images with the goal of rendering photorealistic images of the scene from unobserved viewpoints. However, NeRF's computational requirements are prohibitive for real-time applications: rendering views from a trained NeRF requires querying a multilay… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: Project page: https://nerf.live

  23. arXiv:2103.13415  [pdf, other

    cs.CV cs.GR

    Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields

    Authors: Jonathan T. Barron, Ben Mildenhall, Matthew Tancik, Peter Hedman, Ricardo Martin-Brualla, Pratul P. Srinivasan

    Abstract: The rendering procedure used by neural radiance fields (NeRF) samples a scene with a single ray per pixel and may therefore produce renderings that are excessively blurred or aliased when training or testing images observe scene content at different resolutions. The straightforward solution of supersampling by rendering with multiple rays per pixel is impractical for NeRF, because rendering each r… ▽ More

    Submitted 13 August, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

  24. arXiv:2012.03927  [pdf, other

    cs.CV cs.GR

    NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis

    Authors: Pratul P. Srinivasan, Boyang Deng, Xiuming Zhang, Matthew Tancik, Ben Mildenhall, Jonathan T. Barron

    Abstract: We present a method that takes as input a set of images of a scene illuminated by unconstrained known lighting, and produces as output a 3D representation that can be rendered from novel viewpoints under arbitrary lighting conditions. Our method represents the scene as a continuous volumetric function parameterized as MLPs whose inputs are a 3D location and whose outputs are the following scene pr… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: Project page: https://people.eecs.berkeley.edu/~pratul/nerv

  25. arXiv:2012.02189  [pdf, other

    cs.CV

    Learned Initializations for Optimizing Coordinate-Based Neural Representations

    Authors: Matthew Tancik, Ben Mildenhall, Terrance Wang, Divi Schmidt, Pratul P. Srinivasan, Jonathan T. Barron, Ren Ng

    Abstract: Coordinate-based neural representations have shown significant promise as an alternative to discrete, array-based representations for complex low dimensional signals. However, optimizing a coordinate-based network from randomly initialized weights for each new signal is inefficient. We propose applying standard meta-learning algorithms to learn the initial weight parameters for these fully-connect… ▽ More

    Submitted 23 March, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Project page: https://www.matthewtancik.com/learnit

  26. arXiv:2008.01815  [pdf, other

    cs.CV cs.GR

    Deep Multi Depth Panoramas for View Synthesis

    Authors: Kai-En Lin, Zexiang Xu, Ben Mildenhall, Pratul P. Srinivasan, Yannick Hold-Geoffroy, Stephen DiVerdi, Qi Sun, Kalyan Sunkavalli, Ravi Ramamoorthi

    Abstract: We propose a learning-based approach for novel view synthesis for multi-camera 360$^{\circ}$ panorama capture rigs. Previous work constructs RGBD panoramas from such data, allowing for view synthesis with small amounts of translation, but cannot handle the disocclusions and view-dependent effects that are caused by large translations. To address this issue, we present a novel scene representation… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: Published at the European Conference on Computer Vision, 2020

  27. arXiv:2006.10739  [pdf, other

    cs.CV cs.LG

    Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains

    Authors: Matthew Tancik, Pratul P. Srinivasan, Ben Mildenhall, Sara Fridovich-Keil, Nithin Raghavan, Utkarsh Singhal, Ravi Ramamoorthi, Jonathan T. Barron, Ren Ng

    Abstract: We show that passing input points through a simple Fourier feature map** enables a multilayer perceptron (MLP) to learn high-frequency functions in low-dimensional problem domains. These results shed light on recent advances in computer vision and graphics that achieve state-of-the-art results by using MLPs to represent complex 3D objects and scenes. Using tools from the neural tangent kernel (N… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: Project page: https://people.eecs.berkeley.edu/~bmild/fourfeat/

  28. arXiv:2003.08934  [pdf, other

    cs.CV cs.GR

    NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

    Authors: Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, Ren Ng

    Abstract: We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views. Our algorithm represents a scene using a fully-connected (non-convolutional) deep network, whose input is a single continuous 5D coordinate (spatial location $(x,y,z)$ and viewing direction… ▽ More

    Submitted 3 August, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: ECCV 2020 (oral). Project page with videos and code: http://tancik.com/nerf

  29. arXiv:2003.08367  [pdf, other

    cs.CV cs.GR

    Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination

    Authors: Pratul P. Srinivasan, Ben Mildenhall, Matthew Tancik, Jonathan T. Barron, Richard Tucker, Noah Snavely

    Abstract: We present a deep learning solution for estimating the incident illumination at any 3D location within a scene from an input narrow-baseline stereo image pair. Previous approaches for predicting global illumination from images either predict just a single illumination for the entire scene, or separately estimate the illumination at each 3D location without enforcing that the predictions are consis… ▽ More

    Submitted 13 May, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: CVPR 2020. Project page: https://people.eecs.berkeley.edu/~pratul/lighthouse/ [Updates: typos corrected]

  30. arXiv:1905.00889  [pdf, other

    cs.CV cs.GR

    Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines

    Authors: Ben Mildenhall, Pratul P. Srinivasan, Rodrigo Ortiz-Cayon, Nima Khademi Kalantari, Ravi Ramamoorthi, Ren Ng, Abhishek Kar

    Abstract: We present a practical and robust deep learning solution for capturing and rendering novel views of complex real world scenes for virtual exploration. Previous approaches either require intractably dense view sampling or provide little to no guidance for how users should sample views of a scene to reliably render high-quality novel views. Instead, we propose an algorithm for view synthesis from an… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

    Comments: SIGGRAPH 2019. Project page with video and code: http://people.eecs.berkeley.edu/~bmild/llff/

  31. arXiv:1905.00413  [pdf, other

    cs.CV

    Pushing the Boundaries of View Extrapolation with Multiplane Images

    Authors: Pratul P. Srinivasan, Richard Tucker, Jonathan T. Barron, Ravi Ramamoorthi, Ren Ng, Noah Snavely

    Abstract: We explore the problem of view synthesis from a narrow baseline pair of images, and focus on generating high-quality view extrapolations with plausible disocclusions. Our method builds upon prior work in predicting a multiplane image (MPI), which represents scene content as a set of RGB$α$ planes within a reference view frustum and renders novel views by projecting this content into the target vie… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

    Comments: Oral presentation at CVPR 2019

  32. arXiv:1711.07933  [pdf, other

    cs.CV

    Aperture Supervision for Monocular Depth Estimation

    Authors: Pratul P. Srinivasan, Rahul Garg, Neal Wadhwa, Ren Ng, Jonathan T. Barron

    Abstract: We present a novel method to train machine learning algorithms to estimate scene depths from a single image, by using the information provided by a camera's aperture as supervision. Prior works use a depth sensor's outputs or images of the same scene from alternate viewpoints as supervision, while our method instead uses images from the same viewpoint taken with a varying camera aperture. To enabl… ▽ More

    Submitted 29 March, 2018; v1 submitted 21 November, 2017; originally announced November 2017.

    Comments: To appear at CVPR 2018 (updated to camera ready version)

  33. arXiv:1708.03292  [pdf, other

    cs.CV cs.GR

    Learning to Synthesize a 4D RGBD Light Field from a Single Image

    Authors: Pratul P. Srinivasan, Tongzhou Wang, Ashwin Sreelal, Ravi Ramamoorthi, Ren Ng

    Abstract: We present a machine learning algorithm that takes as input a 2D RGB image and synthesizes a 4D RGBD light field (color and depth of the scene in each ray direction). For training, we introduce the largest public light field dataset, consisting of over 3300 plenoptic camera light fields of scenes containing flowers and plants. Our synthesis pipeline consists of a convolutional neural network (CNN)… ▽ More

    Submitted 10 August, 2017; originally announced August 2017.

    Comments: International Conference on Computer Vision (ICCV) 2017

  34. arXiv:1704.05416  [pdf, other

    cs.CV

    Light Field Blind Motion Deblurring

    Authors: Pratul P. Srinivasan, Ren Ng, Ravi Ramamoorthi

    Abstract: We study the problem of deblurring light fields of general 3D scenes captured under 3D camera motion and present both theoretical and practical contributions. By analyzing the motion-blurred light field in the primal and Fourier domains, we develop intuition into the effects of camera motion on the light field, show the advantages of capturing a 4D light field instead of a conventional 2D image fo… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.

    Comments: To be presented at CVPR 2017