Skip to main content

Showing 1–6 of 6 results for author: Pizer, S M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17915  [pdf, other

    cs.CV

    Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

    Authors: Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang, Inbar Fried, Stephen M. Pizer, Marc Niethammer, Roni Sengupta

    Abstract: Monocular depth estimation in endoscopy videos can enable assistive and robotic surgery to obtain better coverage of the organ and detection of various health issues. Despite promising progress on mainstream, natural image depth estimation, techniques perform poorly on endoscopy images due to a lack of strong geometric features and challenging illumination effects. In this paper, we utilize the ph… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 26 pages, 7 tables, 7 figures

  2. arXiv:2303.07264  [pdf, other

    cs.CV cs.LG

    A Surface-normal Based Neural Framework for Colonoscopy Reconstruction

    Authors: Shuxian Wang, Yubo Zhang, Sarah K. McGill, Julian G. Rosenman, Jan-Michael Frahm, Soumyadip Sengupta, Stephen M. Pizer

    Abstract: Reconstructing a 3D surface from colonoscopy video is challenging due to illumination and reflectivity variation in the video frame that can cause defective shape predictions. Aiming to overcome this challenge, we utilize the characteristics of surface normal vectors and develop a two-step neural framework that significantly improves the colonoscopy reconstruction quality. The normal-based depth i… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Accepted at IPMI 2023; first two authors contributed equally

  3. arXiv:2111.10371  [pdf, other

    eess.IV cs.CV cs.LG

    ColDE: A Depth Estimation Framework for Colonoscopy Reconstruction

    Authors: Yubo Zhang, Jan-Michael Frahm, Samuel Ehrenstein, Sarah K. McGill, Julian G. Rosenman, Shuxian Wang, Stephen M. Pizer

    Abstract: One of the key elements of reconstructing a 3D mesh from a monocular video is generating every frame's depth map. However, in the application of colonoscopy video reconstruction, producing good-quality depth estimation is challenging. Neural networks can be easily fooled by photometric distractions or fail to capture the complex shape of the colon surface, predicting defective shapes that result i… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: 13 pages, 5 figures

  4. arXiv:2103.10310  [pdf, other

    cs.CV cs.LG

    Lighting Enhancement Aids Reconstruction of Colonoscopic Surfaces

    Authors: Yubo Zhang, Shuxian Wang, Ruibin Ma, Sarah K. McGill, Julian G. Rosenman, Stephen M. Pizer

    Abstract: High screening coverage during colonoscopy is crucial to effectively prevent colon cancer. Previous work has allowed alerting the doctor to unsurveyed regions by reconstructing the 3D colonoscopic surface from colonoscopy videos in real-time. However, the lighting inconsistency of colonoscopy videos can cause a key component of the colonoscopic reconstruction system, the SLAM optimization, to fail… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: Accepted at IPMI 2021 (The 27th international conference on Information Processing in Medical Imaging)

  5. arXiv:1904.07087  [pdf, other

    cs.CV

    Recurrent Neural Network for (Un-)supervised Learning of Monocular VideoVisual Odometry and Depth

    Authors: Rui Wang, Stephen M. Pizer, Jan-Michael Frahm

    Abstract: Deep learning-based, single-view depth estimation methods have recently shown highly promising results. However, such methods ignore one of the most important features for determining depth in the human vision system, which is motion. We propose a learning-based, multi-view dense depth map and odometry estimation method that uses Recurrent Neural Networks (RNN) and trains utilizing multi-view imag… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

  6. arXiv:1805.06558  [pdf, other

    cs.CV

    Recurrent Neural Network for Learning DenseDepth and Ego-Motion from Video

    Authors: Rui Wang, Jan-Michael Frahm, Stephen M. Pizer

    Abstract: Learning-based, single-view depth estimation often generalizes poorly to unseen datasets. While learning-based, two-frame depth estimation solves this problem to some extent by learning to match features across frames, it performs poorly at large depth where the uncertainty is high. There exists few learning-based, multi-view depth estimation methods. In this paper, we present a learning-based, mu… ▽ More

    Submitted 16 May, 2018; originally announced May 2018.