Skip to main content

Showing 1–10 of 10 results for author: Radwan, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.03570  [pdf, other

    cs.CV cs.GR cs.LG

    Kubric: A scalable dataset generator

    Authors: Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti, Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi , et al. (10 additional authors not shown)

    Abstract: Data is the driving force of machine learning, with the amount and quality of training data often being more important for the performance of a system than architecture and training details. But collecting, processing and annotating real data at scale is difficult, expensive, and frequently raises additional privacy, fairness and legal concerns. Synthetic data is a powerful tool with the potential… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 21 pages, CVPR2022

  2. arXiv:2112.00724  [pdf, other

    cs.CV cs.AI cs.GR

    RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs

    Authors: Michael Niemeyer, Jonathan T. Barron, Ben Mildenhall, Mehdi S. M. Sajjadi, Andreas Geiger, Noha Radwan

    Abstract: Neural Radiance Fields (NeRF) have emerged as a powerful representation for the task of novel view synthesis due to their simplicity and state-of-the-art performance. Though NeRF can produce photorealistic renderings of unseen viewpoints when many input views are available, its performance drops significantly when this number is reduced. We observe that the majority of artifacts in sparse input sc… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: Project page available at https://m-niemeyer.github.io/regnerf/index.html

  3. arXiv:2111.13260  [pdf, other

    cs.CV cs.RO

    NeSF: Neural Semantic Fields for Generalizable Semantic Segmentation of 3D Scenes

    Authors: Suhani Vora, Noha Radwan, Klaus Greff, Henning Meyer, Kyle Genova, Mehdi S. M. Sajjadi, Etienne Pot, Andrea Tagliasacchi, Daniel Duckworth

    Abstract: We present NeSF, a method for producing 3D semantic fields from posed RGB images alone. In place of classical 3D representations, our method builds on recent work in implicit neural scene representations wherein 3D structure is captured by point-wise functions. We leverage this methodology to recover 3D density fields upon which we then train a 3D semantic segmentation model supervised by posed 2D… ▽ More

    Submitted 2 December, 2021; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: Project website: https://nesf3d.github.io/. Updated with minor edits to text

  4. arXiv:2111.13152  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.RO

    Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations

    Authors: Mehdi S. M. Sajjadi, Henning Meyer, Etienne Pot, Urs Bergmann, Klaus Greff, Noha Radwan, Suhani Vora, Mario Lucic, Daniel Duckworth, Alexey Dosovitskiy, Jakob Uszkoreit, Thomas Funkhouser, Andrea Tagliasacchi

    Abstract: A classical problem in computer vision is to infer a 3D scene representation from few images that can be used to render novel views at interactive rates. Previous work focuses on reconstructing pre-defined 3D representations, e.g. textured meshes, or implicit representations, e.g. radiance fields, and often requires input images with precise camera poses and long processing times for each novel sc… ▽ More

    Submitted 29 March, 2022; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: Accepted to CVPR 2022, Project website: https://srt-paper.github.io/

    Journal ref: CVPR 2022

  5. arXiv:2008.02268  [pdf, other

    cs.CV cs.GR cs.LG

    NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections

    Authors: Ricardo Martin-Brualla, Noha Radwan, Mehdi S. M. Sajjadi, Jonathan T. Barron, Alexey Dosovitskiy, Daniel Duckworth

    Abstract: We present a learning-based method for synthesizing novel views of complex scenes using only unstructured collections of in-the-wild photographs. We build on Neural Radiance Fields (NeRF), which uses the weights of a multilayer perceptron to model the density and color of a scene as a function of 3D coordinates. While NeRF works well on images of static subjects captured under controlled settings,… ▽ More

    Submitted 6 January, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

    Comments: Project website: https://nerf-w.github.io. Ricardo Martin-Brualla, Noha Radwan, and Mehdi S. M. Sajjadi contributed equally to this work. Updated with results for three additional scenes

  6. arXiv:1808.06887  [pdf, other

    cs.RO cs.CV

    Multimodal Interaction-aware Motion Prediction for Autonomous Street Crossing

    Authors: Noha Radwan, Wolfram Burgard, Abhinav Valada

    Abstract: For mobile robots navigating on sidewalks, it is essential to be able to safely cross street intersections. Most existing approaches rely on the recognition of the traffic light signal to make an informed crossing decision. Although these approaches have been crucial enablers for urban navigation, the capabilities of robots employing such approaches are still limited to navigating only on streets… ▽ More

    Submitted 3 August, 2020; v1 submitted 21 August, 2018; originally announced August 2018.

    Comments: The International Journal of Robotics Research (2020)

    Journal ref: The International Journal of Robotics Research (IJRR), vol. 39, no. 13, pp. 1567-1598, 2020

  7. VLocNet++: Deep Multitask Learning for Semantic Visual Localization and Odometry

    Authors: Noha Radwan, Abhinav Valada, Wolfram Burgard

    Abstract: Semantic understanding and localization are fundamental enablers of robot autonomy that have for the most part been tackled as disjoint problems. While deep learning has enabled recent breakthroughs across a wide spectrum of scene understanding tasks, its applicability to state estimation tasks has been limited due to the direct formulation that renders it incapable of encoding scene-specific cons… ▽ More

    Submitted 11 October, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: Demo and dataset available at http://deeploc.cs.uni-freiburg.de

    Journal ref: IEEE Robotics and Automation Letters (RA-L), 3(4):4407-4414, 2018

  8. arXiv:1803.03642  [pdf, other

    cs.RO cs.LG

    Deep Auxiliary Learning for Visual Localization and Odometry

    Authors: Abhinav Valada, Noha Radwan, Wolfram Burgard

    Abstract: Localization is an indispensable component of a robot's autonomy stack that enables it to determine where it is in the environment, essentially making it a precursor for any action execution or planning. Although convolutional neural networks have shown promising results for visual localization, they are still grossly outperformed by state-of-the-art local feature-based techniques. In this work, w… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: Accepted for ICRA 2018

  9. arXiv:1709.06039  [pdf, other

    cs.RO

    Why did the Robot Cross the Road? - Learning from Multi-Modal Sensor Data for Autonomous Road Crossing

    Authors: Noha Radwan, Wera Winterhalter, Christian Dornhege, Wolfram Burgard

    Abstract: We consider the problem of develo** robots that navigate like pedestrians on sidewalks through city centers for performing various tasks including delivery and surveillance. One particular challenge for such robots is crossing streets without pedestrian traffic lights. To solve this task the robot has to decide based on its sensory input if the road is clear. In this work, we propose a novel mul… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: Video: https://www.youtube.com/watch?v=N1IhHHkUzYg Dataset: http://www2.informatik.uni-freiburg.de/~radwann/freiburg_street_crossing_dataset.html

  10. arXiv:1706.08775  [pdf, other

    cs.CV cs.RO

    Topometric Localization with Deep Learning

    Authors: Gabriel L. Oliveira, Noha Radwan, Wolfram Burgard, Thomas Brox

    Abstract: Compared to LiDAR-based localization methods, which provide high accuracy but rely on expensive sensors, visual localization approaches only require a camera and thus are more cost-effective while their accuracy and reliability typically is inferior to LiDAR-based methods. In this work, we propose a vision-based localization approach that learns from LiDAR-based localization methods by using their… ▽ More

    Submitted 27 June, 2017; originally announced June 2017.

    Comments: 16 pages, 7 figures, ISRR 2017 submission