Skip to main content

Showing 1–13 of 13 results for author: Mordohai, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.07225  [pdf, other

    cs.RO

    Stereo-NEC: Enhancing Stereo Visual-Inertial SLAM Initialization with Normal Epipolar Constraints

    Authors: Weihan Wang, Chieh Chou, Ganesh Sevagamoorthy, Kevin Chen, Zheng Chen, Ziyue Feng, Youjie Xia, Feiyang Cai, Yi Xu, Philippos Mordohai

    Abstract: We propose an accurate and robust initialization approach for stereo visual-inertial SLAM systems. Unlike the current state-of-the-art method, which heavily relies on the accuracy of a pure visual SLAM system to estimate inertial variables without updating camera poses, potentially compromising accuracy and robustness, our approach offers a different solution. We realize the crucial impact of prec… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  2. arXiv:2308.08715  [pdf, other

    cs.CV

    V-FUSE: Volumetric Depth Map Fusion with Long-Range Constraints

    Authors: Nathaniel Burgdorfer, Philippos Mordohai

    Abstract: We introduce a learning-based depth map fusion framework that accepts a set of depth and confidence maps generated by a Multi-View Stereo (MVS) algorithm as input and improves them. This is accomplished by integrating volumetric visibility constraints that encode long-range surface relationships across different views into an end-to-end trainable architecture. We also introduce a depth search wind… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  3. arXiv:2308.02670  [pdf, other

    cs.RO cs.CV

    EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems

    Authors: Weihan Wang, Jiani Li, Yuhang Ming, Philippos Mordohai

    Abstract: Visual-inertial initialization can be classified into joint and disjoint approaches. Joint approaches tackle both the visual and the inertial parameters together by aligning observations from feature-bearing points based on IMU integration then use a closed-form solution with visual and acceleration observations to find initial velocity and gravity. In contrast, disjoint approaches independently s… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  4. arXiv:2304.02704  [pdf, other

    cs.CV cs.RO

    Real-Time Dense 3D Map** of Underwater Environments

    Authors: Weihan Wang, Bharat Joshi, Nathaniel Burgdorfer, Konstantinos Batsos, Alberto Quattrini Li, Philippos Mordohai, Ioannis Rekleitis

    Abstract: This paper addresses real-time dense 3D reconstruction for a resource-constrained Autonomous Underwater Vehicle (AUV). Underwater vision-guided operations are among the most challenging as they combine 3D motion in the presence of external forces, limited visibility, and absence of global positioning. Obstacle avoidance and effective path planning require online dense reconstructions of the enviro… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  5. arXiv:2304.00152  [pdf, other

    cs.CV

    Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation

    Authors: Liyan Chen, Weihan Wang, Philippos Mordohai

    Abstract: We present a new loss function for joint disparity and uncertainty estimation in deep stereo matching. Our work is motivated by the need for precise uncertainty estimates and the observation that multi-task learning often leads to improved performance in all tasks. We show that this can be achieved by requiring the distribution of uncertainty to match the distribution of disparity errors via a KL… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Comments: CVPR 2023

    MSC Class: 65D19

  6. Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications

    Authors: Tejas Mane, Aylar Bayramova, Kostas Daniilidis, Philippos Mordohai, Elena Bernardis

    Abstract: We address the problem of estimating the shape of a person's head, defined as the geometry of the complete head surface, from a video taken with a single moving camera, and determining the alignment of the fitted 3D head for all video frames, irrespective of the person's pose. 3D head reconstructions commonly tend to focus on perfecting the face reconstruction, leaving the scalp to a statistical a… ▽ More

    Submitted 7 March, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  7. arXiv:2010.07350  [pdf, other

    cs.CV

    Do End-to-end Stereo Algorithms Under-utilize Information?

    Authors: Changjiang Cai, Philippos Mordohai

    Abstract: Deep networks for stereo matching typically leverage 2D or 3D convolutional encoder-decoder architectures to aggregate cost and regularize the cost volume for accurate disparity estimation. Due to content-insensitive convolutions and down-sampling and up-sampling operations, these cost aggregation mechanisms do not take full advantage of the information available in the images. Disparity maps suff… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: 13 pages, 10 figures, International Conference on 3D Vision (3DV'2020)

  8. arXiv:2010.07347  [pdf, other

    cs.CV

    Matching-space Stereo Networks for Cross-domain Generalization

    Authors: Changjiang Cai, Matteo Poggi, Stefano Mattoccia, Philippos Mordohai

    Abstract: End-to-end deep networks represent the state of the art for stereo matching. While excelling on images framing environments similar to the training set, major drops in accuracy occur in unseen domains (e.g., when moving from synthetic to real scenes). In this paper we introduce a novel family of architectures, namely Matching-Space Networks (MS-Nets), with improved generalization properties. By re… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: 14 pages, 8 figures, International Conference on 3D Vision (3DV'2020), Github code at https://github.com/ccj5351/MS-Nets

  9. arXiv:2004.08566  [pdf, other

    cs.CV

    On the Synergies between Machine Learning and Binocular Stereo for Depth Estimation from Images: a Survey

    Authors: Matteo Poggi, Fabio Tosi, Konstantinos Batsos, Philippos Mordohai, Stefano Mattoccia

    Abstract: Stereo matching is one of the longest-standing problems in computer vision with close to 40 years of studies and research. Throughout the years the paradigm has shifted from local, pixel-level decision to various forms of discrete and continuous optimization to data-driven, learning-based methods. Recently, the rise of machine learning and the rapid proliferation of deep learning enhanced stereo m… ▽ More

    Submitted 31 March, 2021; v1 submitted 18 April, 2020; originally announced April 2020.

    Comments: Accepted to TPAMI. Paper version of our CVPR 2019 tutorial: "Learning-based depth estimation from stereo and monocular images: successes, limitations and future challenges" (https://sites.google.com/view/cvpr-2019-depth-from-image/home)

  10. arXiv:1905.02553  [pdf, other

    cs.CV cs.RO

    Oriented Point Sampling for Plane Detection in Unorganized Point Clouds

    Authors: Bo Sun, Philippos Mordohai

    Abstract: Plane detection in 3D point clouds is a crucial pre-processing step for applications such as point cloud segmentation, semantic map** and SLAM. In contrast to many recent plane detection methods that are only applicable on organized point clouds, our work is targeted to unorganized point clouds that do not permit a 2D parametrization. We compare three methods for detecting planes in point clouds… ▽ More

    Submitted 3 May, 2019; originally announced May 2019.

    Comments: 7 pages, 3 figures, 2019 IEEE International Conference on Robotics and Automation (Accepted)

  11. arXiv:1804.01967  [pdf, other

    cs.CV

    CBMV: A Coalesced Bidirectional Matching Volume for Disparity Estimation

    Authors: Konstantinos Batsos, Changjiang Cai, Philippos Mordohai

    Abstract: Recently, there has been a paradigm shift in stereo matching with learning-based methods achieving the best results on all popular benchmarks. The success of these methods is due to the availability of training data with ground truth; training learning-based systems on these datasets has allowed them to surpass the accuracy of conventional approaches based on heuristics and assumptions. Many of th… ▽ More

    Submitted 5 April, 2018; originally announced April 2018.

    Comments: Accepted to Computer Vision and Pattern Recognition (CVPR) 2018

  12. arXiv:1706.01966  [pdf, other

    cs.RO

    Controlling a Robotic Stereo Camera Under Image Quantization Noise

    Authors: Charles Freundlich, Yan Zhang, Alex Zihao Zhu, Philippos Mordohai, Michael M. Zavlanos

    Abstract: In this paper, we address the problem of controlling a mobile stereo camera under image quantization noise. Assuming that a pair of images of a set of targets is available, the camera moves through a sequence of Next-Best-Views (NBVs), i.e., a sequence of views that minimize the trace of the targets' cumulative state covariance, constructed using a realistic model of the stereo rig that captures i… ▽ More

    Submitted 13 January, 2018; v1 submitted 6 June, 2017; originally announced June 2017.

    Comments: International Journal of Robotics Research, October 2017

  13. arXiv:1312.6826  [pdf, other

    cs.CV

    3D Interest Point Detection via Discriminative Learning

    Authors: Leizer Teran, Philippos Mordohai

    Abstract: The task of detecting the interest points in 3D meshes has typically been handled by geometric methods. These methods, while greatly describing human preference, can be ill-equipped for handling the variety and subjectivity in human responses. Different tasks have different requirements for interest point detection; some tasks may necessitate high precision while other tasks may require high recal… ▽ More

    Submitted 24 December, 2013; originally announced December 2013.