Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries

Viviers, Christiaan G. A.; Filatova, Lena; Termeer, Maurice; de With, Peter H. N.; van der Sommen, Fons

doi:10.1109/TIP.2024.3378469

Computer Science > Computer Vision and Pattern Recognition

arXiv:2405.11677 (cs)

[Submitted on 19 May 2024]

Title:Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries

Authors:Christiaan G.A. Viviers, Lena Filatova, Maurice Termeer, Peter H.N. de With, Fons van der Sommen

View PDF HTML (experimental)

Abstract:Accurate 6-DoF pose estimation of surgical instruments during minimally invasive surgeries can substantially improve treatment strategies and eventual surgical outcome. Existing deep learning methods have achieved accurate results, but they require custom approaches for each object and laborious setup and training environments often stretching to extensive simulations, whilst lacking real-time computation. We propose a general-purpose approach of data acquisition for 6-DoF pose estimation tasks in X-ray systems, a novel and general purpose YOLOv5-6D pose architecture for accurate and fast object pose estimation and a complete method for surgical screw pose estimation under acquisition geometry consideration from a monocular cone-beam X-ray image. The proposed YOLOv5-6D pose model achieves competitive results on public benchmarks whilst being considerably faster at 42 FPS on GPU. In addition, the method generalizes across varying X-ray acquisition geometry and semantic image complexity to enable accurate pose estimation over different domains. Finally, the proposed approach is tested for bone-screw pose estimation for computer-aided guidance during spine surgeries. The model achieves a 92.41% by the 0.1 ADD-S metric, demonstrating a promising approach for enhancing surgical precision and patient outcomes. The code for YOLOv5-6D is publicly available at this https URL

Comments:	Early author version of paper. Refer to the full paper at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2405.11677 [cs.CV]
	(or arXiv:2405.11677v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2405.11677
Journal reference:	IEEE Transactions on Image Processing (2024) (Volume: 33) Page(s): 2462 - 2476
Related DOI:	https://doi.org/10.1109/TIP.2024.3378469

Submission history

From: Christiaan Viviers [view email]
[v1] Sun, 19 May 2024 21:35:12 UTC (10,586 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators