Skip to main content

Showing 1–9 of 9 results for author: Goesele, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.13561  [pdf, other

    cs.HC cs.CV

    Project Aria: A New Tool for Egocentric Multi-Modal AI Research

    Authors: Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, Cheng Peng, Chris Sweeney, Cole Wilson, Dan Barnes, Daniel DeTone, David Caruso, Derek Valleroy, Dinesh Ginjupalli, Duncan Frost, Edward Miller, Elias Mueggler, Evgeniy Oleinik, Fan Zhang, Guruprasad Somasundaram, Gustavo Solaira , et al. (49 additional authors not shown)

    Abstract: Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception. These future devices will need to be all-day wearable in a socially acceptable form-factor to support always available, context-aware and personalized AI applications. Our team at Meta Reality Labs Research built the Aria device, an egocentric, mul… ▽ More

    Submitted 1 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  2. arXiv:2203.00051  [pdf, other

    cs.CV cs.GR

    ERF: Explicit Radiance Field Reconstruction From Scratch

    Authors: Samir Aroudj, Steven Lovegrove, Eddy Ilg, Tanner Schmidt, Michael Goesele, Richard Newcombe

    Abstract: We propose a novel explicit dense 3D reconstruction approach that processes a set of images of a scene with sensor poses and calibrations and estimates a photo-real digital model. One of the key innovations is that the underlying volumetric representation is completely explicit in contrast to neural network-based (implicit) alternatives. We encode scenes explicitly using clear and understandable m… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Comments: 23 pages, 18 figures

    ACM Class: I.3.3; I.4.5

  3. arXiv:2103.02597  [pdf, other

    cs.CV cs.GR

    Neural 3D Video Synthesis from Multi-view Video

    Authors: Tianye Li, Mira Slavcheva, Michael Zollhoefer, Simon Green, Christoph Lassner, Changil Kim, Tanner Schmidt, Steven Lovegrove, Michael Goesele, Richard Newcombe, Zhaoyang Lv

    Abstract: We propose a novel approach for 3D video synthesis that is able to represent multi-view video recordings of a dynamic real-world scene in a compact, yet expressive representation that enables high-quality view synthesis and motion interpolation. Our approach takes the high quality and compactness of static neural radiance fields in a new direction: to a model-free, dynamic setting. At the core of… ▽ More

    Submitted 2 May, 2022; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Accepted as an oral presentation for CVPR 2022. Project website: https://neural-3d-video.github.io/

  4. arXiv:2005.14264  [pdf, other

    cs.CV

    LR-CNN: Local-aware Region CNN for Vehicle Detection in Aerial Imagery

    Authors: Wentong Liao, Xiang Chen, **gfeng Yang, Stefan Roth, Michael Goesele, Michael Ying Yang, Bodo Rosenhahn

    Abstract: State-of-the-art object detection approaches such as Fast/Faster R-CNN, SSD, or YOLO have difficulties detecting dense, small targets with arbitrary orientation in large aerial images. The main reason is that using interpolation to align RoI features can result in a lack of accuracy or even loss of location information. We present the Local-aware Region Convolutional Neural Network (LR-CNN), a nov… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: 8 pages

  5. arXiv:1906.05797  [pdf, other

    cs.CV cs.GR eess.IV

    The Replica Dataset: A Digital Replica of Indoor Spaces

    Authors: Julian Straub, Thomas Whelan, Lingni Ma, Yufan Chen, Erik Wijmans, Simon Green, Jakob J. Engel, Raul Mur-Artal, Carl Ren, Shobhit Verma, Anton Clarkson, Mingfei Yan, Brian Budge, Yajie Yan, Xiaqing Pan, June Yon, Yuyang Zou, Kimberly Leon, Nigel Carter, Jesus Briales, Tyler Gillingham, Elias Mueggler, Luis Pesqueira, Manolis Savva, Dhruv Batra , et al. (5 additional authors not shown)

    Abstract: We introduce Replica, a dataset of 18 highly photo-realistic 3D indoor scene reconstructions at room and building scale. Each scene consists of a dense mesh, high-resolution high-dynamic-range (HDR) textures, per-primitive semantic class and instance information, and planar mirror and glass reflectors. The goal of Replica is to enable machine learning (ML) research that relies on visually, geometr… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  6. arXiv:1811.10020  [pdf, other

    cs.CV

    Background Subtraction with Real-time Semantic Segmentation

    Authors: Dongdong Zeng, Xiang Chen, Ming Zhu, Michael Goesele, Arjan Kuijper

    Abstract: Accurate and fast foreground object extraction is very important for object tracking and recognition in video surveillance. Although many background subtraction (BGS) methods have been proposed in the recent past, it is still regarded as a tough problem due to the variety of challenging situations that occur in real-world scenarios. In this paper, we explore this problem from a new perspective and… ▽ More

    Submitted 12 December, 2018; v1 submitted 25 November, 2018; originally announced November 2018.

  7. arXiv:1804.04076  [pdf, other

    cs.CV cs.LG

    Detail-Preserving Pooling in Deep Networks

    Authors: Faraz Saeedan, Nicolas Weber, Michael Goesele, Stefan Roth

    Abstract: Most convolutional neural networks use some method for gradually downscaling the size of the hidden layers. This is commonly referred to as pooling, and is applied to reduce the number of parameters, improve invariance to certain distortions, and increase the receptive field size. Since pooling by nature is a lossy process, it is crucial that each such layer maintains the portion of the activation… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: To appear at CVPR 2018

  8. arXiv:1610.07368  [pdf, other

    cs.GR

    Simplification of Multi-Scale Geometry using Adaptive Curvature Fields

    Authors: Patrick Seemann, Simon Fuhrmann, Stefan Guthe, Fabian Langguth, Michael Goesele

    Abstract: We present a novel algorithm to compute multi-scale curvature fields on triangle meshes. Our algorithm is based on finding robust mean curvatures using the ball neighborhood, where the radius of a ball corresponds to the scale of the features. The essential problem is to find a good radius for each ball to obtain a reliable curvature estimation. We propose an algorithm that finds suitable radii in… ▽ More

    Submitted 31 October, 2016; v1 submitted 24 October, 2016; originally announced October 2016.

    Comments: 8 pages

    MSC Class: 65D18 ACM Class: I.3.5

  9. arXiv:1601.06950  [pdf, other

    cs.CV cs.GR

    Virtual Rephotography: Novel View Prediction Error for 3D Reconstruction

    Authors: Michael Waechter, Mate Beljan, Simon Fuhrmann, Nils Moehrle, Johannes Kopf, Michael Goesele

    Abstract: The ultimate goal of many image-based modeling systems is to render photo-realistic novel views of a scene without visible artifacts. Existing evaluation metrics and benchmarks focus mainly on the geometric accuracy of the reconstructed model, which is, however, a poor predictor of visual accuracy. Furthermore, using only geometric accuracy by itself does not allow evaluating systems that either l… ▽ More

    Submitted 26 January, 2016; originally announced January 2016.

    Comments: 10 pages, 12 figures, paper was submitted to ACM Transactions on Graphics for review

    ACM Class: I.3.7