Skip to main content

Showing 1–7 of 7 results for author: Wimbauer, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07933  [pdf, other

    cs.CV

    Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation

    Authors: Keonhee Han, Dominik Muhle, Felix Wimbauer, Daniel Cremers

    Abstract: Inferring scene geometry from images via Structure from Motion is a long-standing and fundamental problem in computer vision. While classical approaches and, more recently, depth map predictions only focus on the visible parts of a scene, the task of scene completion aims to reason about geometry even in occluded regions. With the popularity of neural radiance fields (NeRFs), implicit representati… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2312.05208  [pdf, other

    cs.CV

    ControlRoom3D: Room Generation using Semantic Proxy Rooms

    Authors: Jonas Schult, Sam Tsai, Lukas Höllein, Bichen Wu, Jialiang Wang, Chih-Yao Ma, Kunpeng Li, Xiaofang Wang, Felix Wimbauer, Zijian He, Peizhao Zhang, Bastian Leibe, Peter Vajda, Ji Hou

    Abstract: Manually creating 3D environments for AR/VR applications is a complex process requiring expert knowledge in 3D modeling software. Pioneering works facilitate this process by generating room meshes conditioned on textual style descriptions. Yet, many of these automatically generated 3D meshes do not adhere to typical room layouts, compromising their plausibility, e.g., by placing several beds in on… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: Project Page: https://jonasschult.github.io/ControlRoom3D/

  3. arXiv:2312.03209  [pdf, other

    cs.CV

    Cache Me if You Can: Accelerating Diffusion Models through Block Caching

    Authors: Felix Wimbauer, Bichen Wu, Edgar Schoenfeld, Xiaoliang Dai, Ji Hou, Zijian He, Artsiom Sanakoyeu, Peizhao Zhang, Sam Tsai, Jonas Kohler, Christian Rupprecht, Daniel Cremers, Peter Vajda, Jialiang Wang

    Abstract: Diffusion models have recently revolutionized the field of image synthesis due to their ability to generate photorealistic images. However, one of the major drawbacks of diffusion models is that the image generation process is costly. A large image-to-image network has to be applied many times to iteratively refine an image from random noise. While many recent works propose techniques to reduce th… ▽ More

    Submitted 12 January, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Project page: https://fwmb.github.io/blockcaching/

  4. arXiv:2310.07522  [pdf, other

    cs.CV

    S4C: Self-Supervised Semantic Scene Completion with Neural Fields

    Authors: Adrian Hayler, Felix Wimbauer, Dominik Muhle, Christian Rupprecht, Daniel Cremers

    Abstract: 3D semantic scene understanding is a fundamental challenge in computer vision. It enables mobile agents to autonomously plan and navigate arbitrary environments. SSC formalizes this challenge as jointly estimating dense geometry and semantic information from sparse observations of a scene. Current methods for SSC are generally trained on 3D ground truth based on aggregated LiDAR scans. This proces… ▽ More

    Submitted 12 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  5. arXiv:2301.07668  [pdf, other

    cs.CV

    Behind the Scenes: Density Fields for Single View Reconstruction

    Authors: Felix Wimbauer, Nan Yang, Christian Rupprecht, Daniel Cremers

    Abstract: Inferring a meaningful geometric scene representation from a single image is a fundamental problem in computer vision. Approaches based on traditional depth map prediction can only reason about areas that are visible in the image. Currently, neural radiance fields (NeRFs) can capture true 3D including color, but are too complex to be generated from a single image. As an alternative, we propose to… ▽ More

    Submitted 19 April, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Project Page: https://fwmb.github.io/bts/

  6. arXiv:2201.02279  [pdf, other

    cs.CV

    De-rendering 3D Objects in the Wild

    Authors: Felix Wimbauer, Shangzhe Wu, Christian Rupprecht

    Abstract: With increasing focus on augmented and virtual reality applications (XR) comes the demand for algorithms that can lift objects from images and videos into representations that are suitable for a wide variety of related 3D tasks. Large-scale deployment of XR devices and applications means that we cannot solely rely on supervised learning, as collecting and annotating data for the unlimited variety… ▽ More

    Submitted 27 September, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 18490-18499

  7. arXiv:2011.11814  [pdf, other

    cs.CV

    MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

    Authors: Felix Wimbauer, Nan Yang, Lukas von Stumberg, Niclas Zeller, Daniel Cremers

    Abstract: In this paper, we propose MonoRec, a semi-supervised monocular dense reconstruction architecture that predicts depth maps from a single moving camera in dynamic environments. MonoRec is based on a multi-view stereo setting which encodes the information of multiple consecutive images in a cost volume. To deal with dynamic objects in the scene, we introduce a MaskModule that predicts moving object m… ▽ More

    Submitted 6 May, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: CVPR 2021, Project page with video can be found under https://vision.in.tum.de/research/monorec. 14 pages, 10 figures, 5 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 6112-6122