Skip to main content

Showing 1–50 of 66 results for author: Sunderhauf, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10788  [pdf, other

    cs.RO

    Physically Embodied Gaussian Splatting: A Realtime Correctable World Model for Robotics

    Authors: Jad Abou-Chakra, Krishan Rana, Feras Dayoub, Niko Sünderhauf

    Abstract: For robots to robustly understand and interact with the physical world, it is highly beneficial to have a comprehensive representation - modelling geometry, physics, and visual observations - that informs perception, planning, and control algorithms. We propose a novel dual Gaussian-Particle representation that models the physical world while (i) enabling predictive simulation of future states and… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  2. arXiv:2406.05951  [pdf, other

    cs.RO

    Open-Vocabulary Part-Based Gras**

    Authors: Tjeard van Oort, Dimity Miller, Will N. Browne, Nicolas Marticorena, Jesse Haviland, Niko Suenderhauf

    Abstract: Many robotic applications require to grasp objects not arbitrarily but at a very specific object part. This is especially important for manipulation tasks beyond simple pick-and-place scenarios or in robot-human interactions, such as object handovers. We propose AnyPart, a practical system that combines open-vocabulary object detection, open-vocabulary part segmentation and 6DOF grasp pose predict… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2405.05792  [pdf, other

    cs.RO cs.AI cs.CV cs.HC cs.LG

    RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation

    Authors: Sourav Garg, Krishan Rana, Mehdi Hosseinzadeh, Lachlan Mares, Niko Sünderhauf, Feras Dayoub, Ian Reid

    Abstract: Map** is crucial for spatial reasoning, planning and robot navigation. Existing approaches range from metric, which require precise geometry-based optimization, to purely topological, where image-as-node based graphs lack explicit object-level reasoning and interconnectivity. In this paper, we propose a novel topological representation of an environment based on "image segments", which are seman… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Published at ICRA 2024; 9 pages, 8 figures

  4. arXiv:2404.09406  [pdf, other

    cs.CV cs.HC cs.LG cs.RO

    Human-in-the-Loop Segmentation of Multi-species Coral Imagery

    Authors: Scarlett Raine, Ross Marchant, Brano Kusy, Frederic Maire, Niko Suenderhauf, Tobias Fischer

    Abstract: Broad-scale marine surveys performed by underwater vehicles significantly increase the availability of coral reef imagery, however it is costly and time-consuming for domain experts to label images. Point label propagation is an approach used to leverage existing image data labeled with sparse point labels. The resulting augmented ground truth generated is then used to train a semantic segmentatio… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: Accepted at the CVPR2024 3rd Workshop on Learning with Limited Labelled Data for Image and Video Understanding (L3D-IVU), 10 pages, 6 figures, an additional 4 pages of supplementary material

  5. arXiv:2403.16528  [pdf, other

    cs.CV

    Open-Set Recognition in the Age of Vision-Language Models

    Authors: Dimity Miller, Niko Sünderhauf, Alex Kenna, Keita Mason

    Abstract: Are vision-language models (VLMs) open-set models because they are trained on internet-scale datasets? We answer this question with a clear no - VLMs introduce closed-set assumptions via their finite query set, making them vulnerable to open-set conditions. We systematically evaluate VLMs for open-set recognition and find they frequently misclassify objects not contained in their query set, leadin… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 31 pages, under review

  6. arXiv:2312.12036  [pdf, other

    cs.RO cs.AI

    LHManip: A Dataset for Long-Horizon Language-Grounded Manipulation Tasks in Cluttered Tabletop Environments

    Authors: Federico Ceola, Lorenzo Natale, Niko Sünderhauf, Krishan Rana

    Abstract: Instructing a robot to complete an everyday task within our homes has been a long-standing challenge for robotics. While recent progress in language-conditioned imitation learning and offline reinforcement learning has demonstrated impressive performance across a wide range of tasks, they are typically limited to short-horizon tasks -- not reflective of those a home robot would be expected to comp… ▽ More

    Submitted 1 July, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: RSS 2024 Workshop on Data Generation for Robotics

  7. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  8. arXiv:2307.06135  [pdf

    cs.RO cs.AI

    SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning

    Authors: Krishan Rana, Jesse Haviland, Sourav Garg, Jad Abou-Chakra, Ian Reid, Niko Suenderhauf

    Abstract: Large language models (LLMs) have demonstrated impressive results in develo** generalist planning agents for diverse tasks. However, grounding these plans in expansive, multi-floor, and multi-room environments presents a significant challenge for robotics. We introduce SayPlan, a scalable approach to LLM-based, large-scale task planning for robotics using 3D scene graph (3DSG) representations. T… ▽ More

    Submitted 27 September, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted for oral presentation at the Conference on Robot Learning (CoRL), 2023. Project page can be found here: https://sayplan.github.io

  9. arXiv:2304.10782  [pdf, other

    cs.RO cs.AI

    Contrastive Language, Action, and State Pre-training for Robot Learning

    Authors: Krishan Rana, Andrew Melnik, Niko Sünderhauf

    Abstract: In this paper, we introduce a method for unifying language, action, and state information in a shared embedding space to facilitate a range of downstream tasks in robot learning. Our method, Contrastive Language, Action, and State Pre-training (CLASP), extends the CLIP formulation by incorporating distributional learning, capturing the inherent complexities and one-to-many relationships in behavio… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  10. arXiv:2303.16408  [pdf, other

    cs.CV cs.RO

    The Need for Inherently Privacy-Preserving Vision in Trustworthy Autonomous Systems

    Authors: Adam K. Taras, Niko Suenderhauf, Peter Corke, Donald G. Dansereau

    Abstract: Vision is a popular and effective sensor for robotics from which we can derive rich information about the environment: the geometry and semantics of the scene, as well as the age, gender, identity, activity and even emotional state of humans within that scene. This raises important questions about the reach, lifespan, and potential misuse of this information. This paper is a call to action to cons… ▽ More

    Submitted 10 May, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: 7 pages, 6 figures

  11. arXiv:2303.14930  [pdf, other

    cs.CV

    Addressing the Challenges of Open-World Object Detection

    Authors: David Pershouse, Feras Dayoub, Dimity Miller, Niko Sünderhauf

    Abstract: We address the challenging problem of open world object detection (OWOD), where object detectors must identify objects from known classes while also identifying and continually learning to detect novel objects. Prior work has resulted in detectors that have a relatively low ability to detect novel objects, and a high likelihood of classifying a novel object as one of the known classes. We approach… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  12. arXiv:2211.04041  [pdf, other

    cs.CV cs.RO

    ParticleNeRF: A Particle-Based Encoding for Online Neural Radiance Fields

    Authors: Jad Abou-Chakra, Feras Dayoub, Niko Sünderhauf

    Abstract: While existing Neural Radiance Fields (NeRFs) for dynamic scenes are offline methods with an emphasis on visual fidelity, our paper addresses the online use case that prioritises real-time adaptability. We present ParticleNeRF, a new approach that dynamically adapts to changes in the scene geometry by learning an up-to-date representation online, every 200ms. ParticleNeRF achieves this using a nov… ▽ More

    Submitted 24 March, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

  13. arXiv:2211.02231  [pdf, other

    cs.RO cs.AI cs.LG

    Residual Skill Policies: Learning an Adaptable Skill-based Action Space for Reinforcement Learning for Robotics

    Authors: Krishan Rana, Ming Xu, Brendan Tidd, Michael Milford, Niko Sünderhauf

    Abstract: Skill-based reinforcement learning (RL) has emerged as a promising strategy to leverage prior knowledge for accelerated robot learning. Skills are typically extracted from expert demonstrations and are embedded into a latent space from which they can be sampled as actions by a high-level RL agent. However, this skill space is expansive, and not all skills are relevant for a given robot state, maki… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 6th Conference on Robot Learning (CoRL), 2022

  14. arXiv:2210.06849  [pdf, other

    cs.CV

    Retrospectives on the Embodied AI Workshop

    Authors: Matt Deitke, Dhruv Batra, Yonatan Bisk, Tommaso Campari, Angel X. Chang, Devendra Singh Chaplot, Changan Chen, Claudia Pérez D'Arpino, Kiana Ehsani, Ali Farhadi, Li Fei-Fei, Anthony Francis, Chuang Gan, Kristen Grauman, David Hall, Winson Han, Unnat Jain, Aniruddha Kembhavi, Jacob Krantz, Stefan Lee, Chengshu Li, Sagnik Majumder, Oleksandr Maksymets, Roberto Martín-Martín, Roozbeh Mottaghi , et al. (14 additional authors not shown)

    Abstract: We present a retrospective on the state of Embodied AI research. Our analysis focuses on 13 challenges presented at the Embodied AI Workshop at CVPR. These challenges are grouped into three themes: (1) visual navigation, (2) rearrangement, and (3) embodied vision-and-language. We discuss the dominant datasets within each theme, evaluation metrics for the challenges, and the performance of state-of… ▽ More

    Submitted 4 December, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

  15. arXiv:2209.08718  [pdf, other

    cs.CV

    Density-aware NeRF Ensembles: Quantifying Predictive Uncertainty in Neural Radiance Fields

    Authors: Niko Sünderhauf, Jad Abou-Chakra, Dimity Miller

    Abstract: We show that ensembling effectively quantifies model uncertainty in Neural Radiance Fields (NeRFs) if a density-aware epistemic uncertainty term is considered. The naive ensembles investigated in prior work simply average rendered RGB images to quantify the model uncertainty caused by conflicting explanations of the observed scene. In contrast, we additionally consider the termination probabilitie… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

  16. arXiv:2208.13930  [pdf, other

    cs.CV

    SAFE: Sensitivity-Aware Features for Out-of-Distribution Object Detection

    Authors: Samuel Wilson, Tobias Fischer, Feras Dayoub, Dimity Miller, Niko Sünderhauf

    Abstract: We address the problem of out-of-distribution (OOD) detection for the task of object detection. We show that residual convolutional layers with batch normalisation produce Sensitivity-Aware FEatures (SAFE) that are consistently powerful for distinguishing in-distribution from out-of-distribution detections. We extract SAFE vectors for every detected object, and train a multilayer perceptron on the… ▽ More

    Submitted 22 August, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

    Journal ref: IEEE International Conference on Computer Vision 2023

  17. arXiv:2204.10516  [pdf, other

    cs.RO

    Implicit Object Map** With Noisy Data

    Authors: Jad Abou-Chakra, Feras Dayoub, Niko Sünderhauf

    Abstract: Modelling individual objects in a scene as Neural Radiance Fields (NeRFs) provides an alternative geometric scene representation that may benefit downstream robotics tasks such as scene understanding and object manipulation. However, we identify three challenges to using real-world training data collected by a robot to train a NeRF: (i) The camera trajectories are constrained, and full visual cove… ▽ More

    Submitted 7 October, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

  18. arXiv:2112.05341  [pdf, other

    cs.CV cs.AI

    Hyperdimensional Feature Fusion for Out-Of-Distribution Detection

    Authors: Samuel Wilson, Tobias Fischer, Niko Sünderhauf, Feras Dayoub

    Abstract: We introduce powerful ideas from Hyperdimensional Computing into the challenging field of Out-of-Distribution (OOD) detection. In contrast to most existing work that performs OOD detection based on only a single layer of a neural network, we use similarity-preserving semi-orthogonal projection matrices to project the feature maps from multiple layers into a common vector space. By repeatedly apply… ▽ More

    Submitted 29 August, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: Accepted to WACV2023

  19. arXiv:2112.05299  [pdf, other

    cs.RO cs.AI

    Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots

    Authors: Krishan Rana, Vibhavari Dasagi, Jesse Haviland, Ben Talbot, MIchael Milford, Niko Sünderhauf

    Abstract: While deep reinforcement learning (RL) agents have demonstrated incredible potential in attaining dexterous behaviours for robotics, they tend to make errors when deployed in the real world due to mismatches between the training and execution environments. In contrast, the classical robotics community have developed a range of controllers that can safely operate across most states in the real worl… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted for a poster and spotlight presentation at Neurips 2021 Workshop on Deployable Decision Making in Embodied Systems (DDM). arXiv admin note: substantial text overlap with arXiv:2107.09822

  20. Evaluating the Impact of Semantic Segmentation and Pose Estimation on Dense Semantic SLAM

    Authors: Suman Raj Bista, David Hall, Ben Talbot, Haoyang Zhang, Feras Dayoub, Niko Sünderhauf

    Abstract: Recent Semantic SLAM methods combine classical geometry-based estimation with deep learning-based object detection or semantic segmentation. In this paper we evaluate the quality of semantic maps generated by state-of-the-art class- and instance-aware dense semantic SLAM algorithms whose codes are publicly available and explore the impacts both semantic segmentation and pose estimation have on the… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: Paper accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021

  21. A Holistic Approach to Reactive Mobile Manipulation

    Authors: Jesse Haviland, Niko Sünderhauf, Peter Corke

    Abstract: We present the design and implementation of a taskable reactive mobile manipulation system. In contrary to related work, we treat the arm and base degrees of freedom as a holistic structure which greatly improves the speed and fluidity of the resulting motion. At the core of this approach is a robust and reactive motion controller which can achieve a desired end-effector pose, while avoiding joint… ▽ More

    Submitted 2 February, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: IEEE Robotics and Automation Letters (RA-L). Preprint Version. Accepted January, 2022. The code and videos can be found at https://jhavl.github.io/holistic/

    Journal ref: IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 3122-3129, April 2022

  22. arXiv:2108.08748  [pdf, other

    cs.CV cs.RO

    FSNet: A Failure Detection Framework for Semantic Segmentation

    Authors: Quazi Marufur Rahman, Niko Sünderhauf, Peter Corke, Feras Dayoub

    Abstract: Semantic segmentation is an important task that helps autonomous vehicles understand their surroundings and navigate safely. During deployment, even the most mature segmentation models are vulnerable to various external factors that can degrade the segmentation performance with potentially catastrophic consequences for the vehicle and its surroundings. To address this issue, we propose a failure d… ▽ More

    Submitted 27 September, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

  23. arXiv:2107.09822  [pdf, other

    cs.RO cs.AI eess.SY

    Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics

    Authors: Krishan Rana, Vibhavari Dasagi, Jesse Haviland, Ben Talbot, Michael Milford, Niko Sünderhauf

    Abstract: We present Bayesian Controller Fusion (BCF): a hybrid control strategy that combines the strengths of traditional hand-crafted controllers and model-free deep reinforcement learning (RL). BCF thrives in the robotics domain, where reliable but suboptimal control priors exist for many tasks, but RL from scratch remains unsafe and data-inefficient. By fusing uncertainty-aware distributional outputs f… ▽ More

    Submitted 3 April, 2023; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: The International Journal of Robotics Research (IJRR), 2023. Project page: https://krishanrana.github.io/bcf

  24. arXiv:2107.09540  [pdf, other

    cs.CV cs.AI

    Critic Guided Segmentation of Rewarding Objects in First-Person Views

    Authors: Andrew Melnik, Augustin Harter, Christian Limberg, Krishan Rana, Niko Suenderhauf, Helge Ritter

    Abstract: This work discusses a learning approach to mask rewarding objects in images using sparse reward signals from an imitation learning dataset. For that, we train an Hourglass network using only feedback from a critic model. The Hourglass network learns to produce a mask to decrease the critic's score of a high score image and increase the critic's score of a low score image by swap** the masked are… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  25. Probabilistic Appearance-Invariant Topometric Localization with New Place Awareness

    Authors: Ming Xu, Tobias Fischer, Niko Sünderhauf, Michael Milford

    Abstract: Probabilistic state-estimation approaches offer a principled foundation for designing localization systems, because they naturally integrate sequences of imperfect motion and exteroceptive sensor data. Recently, probabilistic localization systems utilizing appearance-invariant visual place recognition (VPR) methods as the primary exteroceptive sensor have demonstrated state-of-the-art performance… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 8 pages

    Journal ref: IEEE Robotics and Automation Letters and IROS 2021

  26. Probabilistic Visual Place Recognition for Hierarchical Localization

    Authors: Ming Xu, Niko Sünderhauf, Michael Milford

    Abstract: Visual localization techniques often comprise a hierarchical localization pipeline, with a visual place recognition module used as a coarse localizer to initialize a pose refinement stage. While improving the pose refinement step has been the focus of much recent research, most work on the coarse localization stage has focused on improvements like increased invariance to appearance change, without… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: 8 pages, 4 figures, RA-L standalone

  27. arXiv:2104.01328  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Uncertainty for Identifying Open-Set Errors in Visual Object Detection

    Authors: Dimity Miller, Niko Sünderhauf, Michael Milford, Feras Dayoub

    Abstract: Deployed into an open world, object detectors are prone to open-set errors, false positive detections of object classes not present in the training dataset. We propose GMM-Det, a real-time method for extracting epistemic uncertainty from object detectors to identify and reject open-set errors. GMM-Det trains the detector to produce a structured logit space that is modelled with class-specific Gaus… ▽ More

    Submitted 11 November, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    Journal ref: IEEE Robotics and Automation Letters (January 2022), Volume 7, Issue 1, pages 215-222, ISSN 2377-3766

  28. arXiv:2101.00443  [pdf, ps, other

    cs.RO cs.CV cs.HC cs.LG

    Semantics for Robotic Map**, Perception and Interaction: A Survey

    Authors: Sourav Garg, Niko Sünderhauf, Feras Dayoub, Douglas Morrison, Akansel Cosgun, Gustavo Carneiro, Qi Wu, Tat-Jun Chin, Ian Reid, Stephen Gould, Peter Corke, Michael Milford

    Abstract: For robots to navigate and interact more richly with the world around them, they will likely require a deeper understanding of the world in which they operate. In robotics and related research fields, the study of understanding is often referred to as semantics, which dictates what does the world "mean" to a robot, and is strongly tied to the question of how to represent that meaning. With humans… ▽ More

    Submitted 2 January, 2021; originally announced January 2021.

    Comments: 81 pages, 1 figure, published in Foundations and Trends in Robotics, 2020

    Journal ref: Foundations and Trends in Robotics: Vol. 8: No. 1-2, pp 1-224 (2020)

  29. arXiv:2012.12645  [pdf, other

    cs.CV

    SWA Object Detection

    Authors: Haoyang Zhang, Ying Wang, Feras Dayoub, Niko Sünderhauf

    Abstract: Do you want to improve 1.0 AP for your object detector without any inference cost and any change to your detector? Let us tell you such a recipe. It is surprisingly simple: train your detector for an extra 12 epochs using cyclical learning rates and then average these 12 checkpoints as your final detection model}. This potent recipe is inspired by Stochastic Weights Averaging (SWA), which is propo… ▽ More

    Submitted 11 March, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: 9 pages; polished

  30. arXiv:2011.07750  [pdf, other

    cs.CV

    Online Monitoring of Object Detection Performance During Deployment

    Authors: Quazi Marufur Rahman, Niko Sünderhauf, Feras Dayoub

    Abstract: During deployment, an object detector is expected to operate at a similar performance level reported on its testing dataset. However, when deployed onboard mobile robots that operate under varying and complex environmental conditions, the detector's performance can fluctuate and occasionally degrade severely without warning. Undetected, this can lead the robot to take unsafe and risky actions base… ▽ More

    Submitted 9 March, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: V2 with more experimental results and improved clarity of presentation

  31. arXiv:2009.08650  [pdf, other

    cs.CV

    Per-frame mAP Prediction for Continuous Performance Monitoring of Object Detection During Deployment

    Authors: Quazi Marufur Rahman, Niko Sünderhauf, Feras Dayoub

    Abstract: Performance monitoring of object detection is crucial for safety-critical applications such as autonomous vehicles that operate under varying and complex environmental conditions. Currently, object detectors are evaluated using summary metrics based on a single dataset that is assumed to be representative of all future deployment conditions. In practice, this assumption does not hold, and the perf… ▽ More

    Submitted 16 November, 2020; v1 submitted 18 September, 2020; originally announced September 2020.

  32. arXiv:2009.05246  [pdf, other

    cs.RO

    The Robotic Vision Scene Understanding Challenge

    Authors: David Hall, Ben Talbot, Suman Raj Bista, Haoyang Zhang, Rohan Smith, Feras Dayoub, Niko Sünderhauf

    Abstract: Being able to explore an environment and understand the location and type of all objects therein is important for indoor robotic platforms that must interact closely with humans. However, it is difficult to evaluate progress in this area due to a lack of standardized testing which is limited due to the need for active robot agency and perfect object ground-truth. To help provide a standard for tes… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

  33. arXiv:2008.13367  [pdf, other

    cs.CV

    VarifocalNet: An IoU-aware Dense Object Detector

    Authors: Haoyang Zhang, Ying Wang, Feras Dayoub, Niko Sünderhauf

    Abstract: Accurately ranking the vast number of candidate detections is crucial for dense object detectors to achieve high performance. Prior work uses the classification score or a combination of classification and predicted localization scores to rank candidates. However, neither option results in a reliable ranking, thus degrading detection performance. In this paper, we propose to learn an Iou-aware Cla… ▽ More

    Submitted 4 March, 2021; v1 submitted 31 August, 2020; originally announced August 2020.

    Comments: Accepted to CVPR 2021 as an oral

  34. arXiv:2008.00635  [pdf, other

    cs.RO

    BenchBot: Evaluating Robotics Research in Photorealistic 3D Simulation and on Real Robots

    Authors: Ben Talbot, David Hall, Haoyang Zhang, Suman Raj Bista, Rohan Smith, Feras Dayoub, Niko Sünderhauf

    Abstract: We introduce BenchBot, a novel software suite for benchmarking the performance of robotics research across both photorealistic 3D simulations and real robot platforms. BenchBot provides a simple interface to the sensorimotor capabilities of a robot when solving robotics research problems; an interface that is consistent regardless of whether the target platform is simulated or a real robot. In thi… ▽ More

    Submitted 2 August, 2020; originally announced August 2020.

    Comments: Future submission to RAL; software available at http://benchbot.org

  35. arXiv:2004.02434  [pdf, other

    cs.CV

    Class Anchor Clustering: a Loss for Distance-based Open Set Recognition

    Authors: Dimity Miller, Niko Sünderhauf, Michael Milford, Feras Dayoub

    Abstract: In open set recognition, deep neural networks encounter object classes that were unknown during training. Existing open set classifiers distinguish between known and unknown classes by measuring distance in a network's logit space, assuming that known classes cluster closer to the training data than unknown classes. However, this approach is applied post-hoc to networks trained with cross-entropy… ▽ More

    Submitted 2 March, 2021; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: Published at 2021 IEEE Winter Conference on Applications of Computer Vision (WACV)

  36. arXiv:2003.05117  [pdf, other

    cs.RO cs.AI

    Multiplicative Controller Fusion: Leveraging Algorithmic Priors for Sample-efficient Reinforcement Learning and Safe Sim-To-Real Transfer

    Authors: Krishan Rana, Vibhavari Dasagi, Ben Talbot, Michael Milford, Niko Sünderhauf

    Abstract: Learning-based approaches often outperform hand-coded algorithmic solutions for many problems in robotics. However, learning long-horizon tasks on real robot hardware can be intractable, and transferring a learned policy from simulation to reality is still extremely challenging. We present a novel approach to model-free reinforcement learning that can leverage existing sub-optimal solutions as an… ▽ More

    Submitted 27 July, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: Accepted for presentation at IROS2020. Project site available at https://sites.google.com/view/mcf-nav/home

  37. arXiv:2001.02366  [pdf, other

    cs.RO cs.CV

    What can robotics research learn from computer vision research?

    Authors: Peter Corke, Feras Dayoub, David Hall, John Skinner, Niko Sünderhauf

    Abstract: The computer vision and robotics research communities are each strong. However progress in computer vision has become turbo-charged in recent years due to big data, GPU computing, novel learning algorithms and a very effective research methodology. By comparison, progress in robotics seems slower. It is true that robotics came later to exploring the potential of learning -- the advantages over the… ▽ More

    Submitted 11 June, 2020; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: 15 pages, to appear in the proceeding of the International Symposium on Robotics Research (ISRR) 2019

  38. arXiv:1909.10972  [pdf, other

    cs.RO cs.LG

    Residual Reactive Navigation: Combining Classical and Learned Navigation Strategies For Deployment in Unknown Environments

    Authors: Krishan Rana, Ben Talbot, Vibhavari Dasagi, Michael Milford, Niko Sünderhauf

    Abstract: In this work we focus on improving the efficiency and generalisation of learned navigation strategies when transferred from its training environment to previously unseen ones. We present an extension of the residual reinforcement learning framework from the robotic manipulation literature and adapt it to the vast and unstructured environments that mobile robots can operate in. The concept is based… ▽ More

    Submitted 11 March, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: Accepted as a conference paper at ICRA2020. Project site available at https://sites.google.com/view/srrn/home

  39. arXiv:1909.07376  [pdf, other

    cs.LG cs.RO

    Where are the Keys? -- Learning Object-Centric Navigation Policies on Semantic Maps with Graph Convolutional Networks

    Authors: Niko Sünderhauf

    Abstract: Emerging object-based SLAM algorithms can build a graph representation of an environment comprising nodes for robot poses and object landmarks. However, while this map will contain static objects such as furniture or appliances, many moveable objects (e.g. the car keys, the glasses, or a magazine), are not suitable as landmarks and will not be part of the map due to their non-static nature. We sho… ▽ More

    Submitted 20 January, 2021; v1 submitted 16 September, 2019; originally announced September 2019.

  40. arXiv:1903.07840  [pdf, other

    cs.RO cs.CV cs.LG

    The Probabilistic Object Detection Challenge

    Authors: John Skinner, David Hall, Haoyang Zhang, Feras Dayoub, Niko Sünderhauf

    Abstract: We introduce a new challenge for computer and robotic vision, the first ACRV Robotic Vision Challenge, Probabilistic Object Detection. Probabilistic object detection is a new variation on traditional object detection tasks, requiring estimates of spatial and semantic uncertainty. We extend the traditional bounding box format of object detection to express spatial uncertainty using gaussian distrib… ▽ More

    Submitted 7 April, 2019; v1 submitted 19 March, 2019; originally announced March 2019.

    Comments: 4 pages, workshop paper

  41. Did You Miss the Sign? A False Negative Alarm System for Traffic Sign Detectors

    Authors: Quazi Marufur Rahman, Niko Sünderhauf, Feras Dayoub

    Abstract: Object detection is an integral part of an autonomous vehicle for its safety-critical and navigational purposes. Traffic signs as objects play a vital role in guiding such systems. However, if the vehicle fails to locate any critical sign, it might make a catastrophic failure. In this paper, we propose an approach to identify traffic signs that have been mistakenly discarded by the object detector… ▽ More

    Submitted 15 March, 2019; originally announced March 2019.

    Comments: Submitted to the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)

  42. arXiv:1902.07381  [pdf, other

    cs.RO

    Look No Deeper: Recognizing Places from Opposing Viewpoints under Varying Scene Appearance using Single-View Depth Estimation

    Authors: Sourav Garg, Madhu Babu V, Thanuja Dharmasiri, Stephen Hausler, Niko Suenderhauf, Swagat Kumar, Tom Drummond, Michael Milford

    Abstract: Visual place recognition (VPR) - the act of recognizing a familiar visual place - becomes difficult when there is extreme environmental appearance change or viewpoint change. Particularly challenging is the scenario where both phenomena occur simultaneously, such as when returning for the first time along a road at night that was previously traversed during the day in the opposite direction. While… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

    Comments: Accepted for ICRA 2019

  43. arXiv:1811.10800  [pdf, other

    cs.CV

    Probabilistic Object Detection: Definition and Evaluation

    Authors: David Hall, Feras Dayoub, John Skinner, Haoyang Zhang, Dimity Miller, Peter Corke, Gustavo Carneiro, Anelia Angelova, Niko Sünderhauf

    Abstract: We introduce Probabilistic Object Detection, the task of detecting objects in images and accurately quantifying the spatial and semantic uncertainties of the detections. Given the lack of methods capable of assessing such probabilistic object detections, we present the new Probability-based Detection Quality measure (PDQ).Unlike AP-based measures, PDQ has no arbitrary thresholds and rewards spatia… ▽ More

    Submitted 30 January, 2020; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 21 pages, 25 figures, to appear in the proceedings of the winter conference on applications of computer vision WACV 2020

  44. arXiv:1809.07480  [pdf, other

    cs.LG stat.ML

    Sim-to-Real Transfer of Robot Learning with Variable Length Inputs

    Authors: Vibhavari Dasagi, Robert Lee, Serena Mou, Jake Bruce, Niko Sünderhauf, Jürgen Leitner

    Abstract: Current end-to-end deep Reinforcement Learning (RL) approaches require jointly learning perception, decision-making and low-level control from very sparse reward signals and high-dimensional inputs, with little capability of incorporating prior knowledge. This results in prohibitively long training times for use on real-world robotic tasks. Existing algorithms capable of extracting task-level repr… ▽ More

    Submitted 8 October, 2019; v1 submitted 20 September, 2018; originally announced September 2018.

  45. arXiv:1809.06977  [pdf, other

    cs.RO

    An Orientation Factor for Object-Oriented SLAM

    Authors: Natalie Jablonsky, Michael Milford, Niko Sünderhauf

    Abstract: Current approaches to object-oriented SLAM lack the ability to incorporate prior knowledge of the scene geometry, such as the expected global orientation of objects. We overcome this limitation by proposing a geometric factor that constrains the global orientation of objects in the map, depending on the objects' semantics. This new geometric factor is a first example of how semantics can inform an… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Comments: Submitted to ICRA 2019, under review

  46. arXiv:1809.06006  [pdf, other

    cs.CV

    Evaluating Merging Strategies for Sampling-based Uncertainty Techniques in Object Detection

    Authors: Dimity Miller, Feras Dayoub, Michael Milford, Niko Sünderhauf

    Abstract: There has been a recent emergence of sampling-based techniques for estimating epistemic uncertainty in deep neural networks. While these methods can be applied to classification or semantic segmentation tasks by simply averaging samples, this is not the case for object detection, where detection sample bounding boxes must be accurately associated and merged. A weak merging strategy can significant… ▽ More

    Submitted 6 March, 2019; v1 submitted 16 September, 2018; originally announced September 2018.

    Comments: to appear in IEEE International Conference on Robotics and Automation 2019 (ICRA 2019)

  47. arXiv:1807.05211  [pdf, other

    cs.RO

    Learning Deployable Navigation Policies at Kilometer Scale from a Single Traversal

    Authors: Jake Bruce, Niko Sünderhauf, Piotr Mirowski, Raia Hadsell, Michael Milford

    Abstract: Model-free reinforcement learning has recently been shown to be effective at learning navigation policies from complex image input. However, these algorithms tend to require large amounts of interaction with the environment, which can be prohibitively costly to obtain on robots in the real world. We present an approach for efficiently learning goal-directed navigation policies on a mobile robot, f… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

  48. arXiv:1804.09111  [pdf, other

    cs.RO cs.CV

    Structure Aware SLAM using Quadrics and Planes

    Authors: Mehdi Hosseinzadeh, Yasir Latif, Trung Pham, Niko Suenderhauf, Ian Reid

    Abstract: Simultaneous Localization And Map** (SLAM) is a fundamental problem in mobile robotics. While point-based SLAM methods provide accurate camera localization, the generated maps lack semantic information. On the other hand, state of the art object detection methods provide rich information about entities present in the scene from a single image. This work marries the two and proposes a method for… ▽ More

    Submitted 2 November, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: Accepted to ACCV 2018

  49. arXiv:1804.06557  [pdf, other

    cs.RO

    The Limits and Potentials of Deep Learning for Robotics

    Authors: Niko Sünderhauf, Oliver Brock, Walter Scheirer, Raia Hadsell, Dieter Fox, Jürgen Leitner, Ben Upcroft, Pieter Abbeel, Wolfram Burgard, Michael Milford, Peter Corke

    Abstract: The application of deep learning in robotics leads to very specific problems and research questions that are typically not addressed by the computer vision and machine learning communities. In this paper we discuss a number of robotics-specific learning, reasoning, and embodiment challenges for deep learning. We explain the need for better evaluation metrics, highlight the importance and unique ch… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

  50. arXiv:1804.05526  [pdf, other

    cs.RO cs.CV

    LoST? Appearance-Invariant Place Recognition for Opposite Viewpoints using Visual Semantics

    Authors: Sourav Garg, Niko Suenderhauf, Michael Milford

    Abstract: Human visual scene understanding is so remarkable that we are able to recognize a revisited place when entering it from the opposite direction it was first visited, even in the presence of extreme variations in appearance. This capability is especially apparent during driving: a human driver can recognize where they are when travelling in the reverse direction along a route for the first time, wit… ▽ More

    Submitted 26 May, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

    Comments: Accepted for Robotics: Science and Systems (RSS) 2018. Source code now available at https://github.com/oravus/lostX