Skip to main content

Showing 1–20 of 20 results for author: Haene, C

.
  1. arXiv:2404.02225  [pdf, other

    cs.CV cs.AI

    CHOSEN: Contrastive Hypothesis Selection for Multi-View Depth Refinement

    Authors: Di Qiu, Yinda Zhang, Thabo Beeler, Vladimir Tankovich, Christian Häne, Sean Fanello, Christoph Rhemann, Sergio Orts Escolano

    Abstract: We propose CHOSEN, a simple yet flexible, robust and effective multi-view depth refinement framework. It can be employed in any existing multi-view stereo pipeline, with straightforward generalization capability for different multi-view capture systems such as camera relative positioning and lenses. Given an initial depth estimation, CHOSEN iteratively re-samples and selects the best hypotheses, a… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  2. arXiv:2312.03556  [pdf, other

    cs.CV cs.LG

    Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention

    Authors: Jian** Xu, Saman Motamed, Praneetha Vaddamanu, Chen Henry Wu, Christian Haene, Jean-Charles Bazin, Fernando de la Torre

    Abstract: Face inpainting is important in various applications, such as photo restoration, image editing, and virtual reality. Despite the significant advances in face generative models, ensuring that a person's unique facial identity is maintained during the inpainting process is still an elusive goal. Current state-of-the-art techniques, exemplified by MyStyle, necessitate resource-intensive fine-tuning a… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  3. arXiv:2212.03406  [pdf, other

    cs.CV

    SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields

    Authors: Siddhant Ranade, Christoph Lassner, Kai Li, Christian Haene, Shen-Chi Chen, Jean-Charles Bazin, Sofien Bouaziz

    Abstract: Neural Radiance Fields (NeRFs) encode the radiance in a scene parameterized by the scene's plenoptic function. This is achieved by using an MLP together with a map** to a higher-dimensional space, and has been proven to capture scenes with a great level of detail. Naturally, the same parameterization can be used to encode additional properties of the scene, beyond just its radiance. A particular… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Project page: https://www.siddhantranade.com/research/2022/12/06/SSDNeRF-Semantic-Soft-Decomposition-of-Neural-Radiance-Fields.html

  4. arXiv:2202.08752  [pdf, other

    cs.CV cs.GR

    OmniSyn: Synthesizing 360 Videos with Wide-baseline Panoramas

    Authors: David Li, Yinda Zhang, Christian Häne, Danhang Tang, Amitabh Varshney, Ruofei Du

    Abstract: Immersive maps such as Google Street View and Bing Streetside provide true-to-life views with a massive collection of panoramas. However, these panoramas are only available at sparse intervals along the path they are taken, resulting in visual discontinuities during navigation. Prior art in view synthesis is usually built upon a set of perspective images, a pair of stereoscopic images, or a monocu… ▽ More

    Submitted 22 February, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: Updated related works

  5. arXiv:2109.05591  [pdf, other

    cs.CV

    Multiresolution Deep Implicit Functions for 3D Shape Representation

    Authors: Zhang Chen, Yinda Zhang, Kyle Genova, Sean Fanello, Sofien Bouaziz, Christian Haene, Ruofei Du, Cem Keskin, Thomas Funkhouser, Danhang Tang

    Abstract: We introduce Multiresolution Deep Implicit Functions (MDIF), a hierarchical representation that can recover fine geometry detail, while being able to perform global operations such as shape completion. Our model represents a complex 3D shape with a hierarchy of latent grids, which can be decoded into different levels of detail and also achieve better accuracy. For shape completion, we propose late… ▽ More

    Submitted 16 September, 2021; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: 8 pages of main paper, 10 pages of supplementary. Accepted by ICCV'21

  6. arXiv:2007.12140  [pdf, other

    cs.CV

    HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching

    Authors: Vladimir Tankovich, Christian Häne, Yinda Zhang, Adarsh Kowdle, Sean Fanello, Sofien Bouaziz

    Abstract: This paper presents HITNet, a novel neural network architecture for real-time stereo matching. Contrary to many recent neural network approaches that operate on a full cost volume and rely on 3D convolutions, our approach does not explicitly build a volume and instead relies on a fast multi-resolution initialization step, differentiable 2D geometric propagation and war** mechanisms to infer disp… ▽ More

    Submitted 19 January, 2023; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: The pretrained models used for submission to benchmarks and sample evaluation scripts can be found at https://github.com/google-research/google-research/tree/master/hitnet

  7. arXiv:2005.08877  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Implicit Volume Compression

    Authors: Danhang Tang, Saurabh Singh, Philip A. Chou, Christian Haene, Mingsong Dou, Sean Fanello, Jonathan Taylor, Philip Davidson, Onur G. Guleryuz, Yinda Zhang, Shahram Izadi, Andrea Tagliasacchi, Sofien Bouaziz, Cem Keskin

    Abstract: We describe a novel approach for compressing truncated signed distance fields (TSDF) stored in 3D voxel grids, and their corresponding textures. To compress the TSDF, our method relies on a block-based neural network architecture trained end-to-end, achieving state-of-the-art rate-distortion trade-off. To prevent topological errors, we losslessly compress the signs of the TSDF, which also upper bo… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: Danhang Tang and Saurabh Singh have equal contribution

  8. arXiv:2003.14299  [pdf, other

    cs.CV

    Du$^2$Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels

    Authors: Yinda Zhang, Neal Wadhwa, Sergio Orts-Escolano, Christian Häne, Sean Fanello, Rahul Garg

    Abstract: Computational stereo has reached a high level of accuracy, but degrades in the presence of occlusions, repeated textures, and correspondence errors along edges. We present a novel approach based on neural networks for depth estimation that combines stereo from dual cameras with stereo from a dual-pixel sensor, which is increasingly common on consumer cameras. Our network uses a novel architecture… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

  9. arXiv:2002.03933  [pdf, other

    cs.CV

    RePose: Learning Deep Kinematic Priors for Fast Human Pose Estimation

    Authors: Hossam Isack, Christian Haene, Cem Keskin, Sofien Bouaziz, Yuri Boykov, Shahram Izadi, Sameh Khamis

    Abstract: We propose a novel efficient and lightweight model for human pose estimation from a single image. Our model is designed to achieve competitive results at a fraction of the number of parameters and computational cost of various state-of-the-art methods. To this end, we explicitly incorporate part-based structural and geometric priors in a hierarchical prediction framework. At the coarsest resolutio… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

  10. arXiv:1906.10491  [pdf, other

    cs.CV

    Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction

    Authors: Nikolay Savinov, Lubor Ladicky, Christian Haene, Marc Pollefeys

    Abstract: Dense semantic 3D reconstruction is typically formulated as a discrete or continuous problem over label assignments in a voxel grid, combining semantic and depth likelihoods in a Markov Random Field framework. The depth and semantic information is incorporated as a unary potential, smoothed by a pairwise regularizer. However, modelling likelihoods as a unary potential does not model the problem co… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

    Comments: Published at CVPR 2015

  11. arXiv:1901.01971  [pdf, other

    cs.CV

    Learning Independent Object Motion from Unlabelled Stereoscopic Videos

    Authors: Zhe Cao, Abhishek Kar, Christian Haene, Jitendra Malik

    Abstract: We present a system for learning motion of independently moving objects from stereo videos. The only human annotation used in our system are 2D object bounding boxes which introduce the notion of objects to our system. Unlike prior learning based work which has focused on predicting dense pixel-wise optical flow field and/or a depth map for each image, we propose to predict object instance specifi… ▽ More

    Submitted 8 January, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

  12. arXiv:1809.02882  [pdf, other

    cs.CV cs.LG

    Cost-Sensitive Active Learning for Intracranial Hemorrhage Detection

    Authors: Weicheng Kuo, Christian Häne, Esther Yuh, Pratik Mukherjee, Jitendra Malik

    Abstract: Deep learning for clinical applications is subject to stringent performance requirements, which raises a need for large labeled datasets. However, the enormous cost of labeling medical data makes this challenging. In this paper, we build a cost-sensitive active learning system for the problem of intracranial hemorrhage detection and segmentation on head computed tomography (CT). We show that our e… ▽ More

    Submitted 8 September, 2018; originally announced September 2018.

  13. arXiv:1806.03265  [pdf, other

    cs.CV

    PatchFCN for Intracranial Hemorrhage Detection

    Authors: Weicheng Kuo, Christian Häne, Esther Yuh, Pratik Mukherjee, Jitendra Malik

    Abstract: This paper studies the problem of detecting and segmenting acute intracranial hemorrhage on head computed tomography (CT) scans. We propose to solve both tasks as a semantic segmentation problem using a patch-based fully convolutional network (PatchFCN). This formulation allows us to accurately localize hemorrhages while bypassing the complexity of object detection. Our system demonstrates competi… ▽ More

    Submitted 14 April, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

  14. arXiv:1710.06104  [pdf, other

    cs.CV

    Large-Scale 3D Shape Reconstruction and Segmentation from ShapeNet Core55

    Authors: Li Yi, Lin Shao, Manolis Savva, Haibin Huang, Yang Zhou, Qirui Wang, Benjamin Graham, Martin Engelcke, Roman Klokov, Victor Lempitsky, Yuan Gan, Pengyu Wang, Kun Liu, Fenggen Yu, Panpan Shui, Bingyang Hu, Yan Zhang, Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Minki Jeong, Jaehoon Choi, Changick Kim, Angom Geetchandra , et al. (25 additional authors not shown)

    Abstract: We introduce a large-scale 3D shape understanding benchmark using data and annotation from ShapeNet 3D object database. The benchmark consists of two tasks: part-level segmentation of 3D shapes and 3D reconstruction from single view images. Ten teams have participated in the challenge and the best performing teams have outperformed state-of-the-art approaches on both tasks. A few novel deep learni… ▽ More

    Submitted 27 October, 2017; v1 submitted 17 October, 2017; originally announced October 2017.

  15. arXiv:1708.09839  [pdf, other

    cs.CV

    3D Visual Perception for Self-Driving Cars using a Multi-Camera System: Calibration, Map**, Localization, and Obstacle Detection

    Authors: Christian Häne, Lionel Heng, Gim Hee Lee, Friedrich Fraundorfer, Paul Furgale, Torsten Sattler, Marc Pollefeys

    Abstract: Cameras are a crucial exteroceptive sensor for self-driving cars as they are low-cost and small, provide appearance information about the environment, and work in various weather conditions. They can be used for multiple purposes such as visual navigation and obstacle detection. We can use a surround multi-camera system to cover the full 360-degree field-of-view around the car. In this way, we avo… ▽ More

    Submitted 31 August, 2017; originally announced August 2017.

  16. arXiv:1708.05375  [pdf, other

    cs.CV

    Learning a Multi-View Stereo Machine

    Authors: Abhishek Kar, Christian Häne, Jitendra Malik

    Abstract: We present a learnt system for multi-view stereopsis. In contrast to recent learning based methods for 3D reconstruction, we leverage the underlying 3D geometry of the problem through feature projection and unprojection along viewing rays. By formulating these operations in a differentiable manner, we are able to learn the system end-to-end for the task of metric 3D reconstruction. End-to-end lear… ▽ More

    Submitted 17 August, 2017; originally announced August 2017.

  17. arXiv:1704.00710  [pdf, other

    cs.CV

    Hierarchical Surface Prediction for 3D Object Reconstruction

    Authors: Christian Häne, Shubham Tulsiani, Jitendra Malik

    Abstract: Recently, Convolutional Neural Networks have shown promising results for 3D geometry prediction. They can make predictions from very little input data such as a single color image. A major limitation of such approaches is that they only predict a coarse resolution voxel grid, which does not capture the surface of the objects well. We propose a general framework, called hierarchical surface predict… ▽ More

    Submitted 6 November, 2017; v1 submitted 3 April, 2017; originally announced April 2017.

    Comments: 3DV 2017

  18. arXiv:1604.02885  [pdf, other

    cs.CV

    Semantic 3D Reconstruction with Continuous Regularization and Ray Potentials Using a Visibility Consistency Constraint

    Authors: Nikolay Savinov, Christian Haene, Lubor Ladicky, Marc Pollefeys

    Abstract: We propose an approach for dense semantic 3D reconstruction which uses a data term that is defined as potentials over viewing rays, combined with continuous surface area penalization. Our formulation is a convex relaxation which we augment with a crucial non-convex constraint that ensures exact handling of visibility. To tackle the non-convex minimization problem, we propose a majorize-minimize ty… ▽ More

    Submitted 26 August, 2019; v1 submitted 11 April, 2016; originally announced April 2016.

    Comments: Accepted as a spotlight oral paper by CVPR 2016. Code at https://github.com/nsavinov/ray_potentials/

  19. arXiv:1502.00652  [pdf, other

    cs.CV

    Learning the Matching Function

    Authors: Ľubor Ladický, Christian Häne, Marc Pollefeys

    Abstract: The matching function for the problem of stereo reconstruction or optical flow has been traditionally designed as a function of the distance between the features describing matched pixels. This approach works under assumption, that the appearance of pixels in two stereo cameras or in two consecutive video frames does not change dramatically. However, this might not be the case, if we try to match… ▽ More

    Submitted 2 February, 2015; originally announced February 2015.

    Comments: rejected from ACCV 2014 and probably from CVPR 2015

  20. arXiv:1308.3101  [pdf, other

    cs.CV cs.LG stat.ML

    Compact Relaxations for MAP Inference in Pairwise MRFs with Piecewise Linear Priors

    Authors: Christopher Zach, Christian Häne

    Abstract: Label assignment problems with large state spaces are important tasks especially in computer vision. Often the pairwise interaction (or smoothness prior) between labels assigned at adjacent nodes (or pixels) can be described as a function of the label difference. Exact inference in such labeling tasks is still difficult, and therefore approximate inference methods based on a linear programming (LP… ▽ More

    Submitted 11 April, 2017; v1 submitted 14 August, 2013; originally announced August 2013.