• LOGIN
    Login with username and password
Repository logo

BORIS Portal

Bern Open Repository and Information System

  • Publications
  • Theses
  • Research Data
  • Projects
  • Organizations
  • Researchers
  • More
  • Collections
  • Statistics
  • LOGIN
    Login with username and password
Repository logo
Unibern.ch
  1. Home
  2. Publications
  3. Learning how to robustly estimate camera pose in endoscopic videos.
 

Learning how to robustly estimate camera pose in endoscopic videos.

Options
  • Details
  • Files
BORIS DOI
10.48350/182594
Publisher DOI
10.1007/s11548-023-02919-w
PubMed ID
37184768
Description
PURPOSE

Surgical scene understanding plays a critical role in the technology stack of tomorrow's intervention-assisting systems in endoscopic surgeries. For this, tracking the endoscope pose is a key component, but remains challenging due to illumination conditions, deforming tissues and the breathing motion of organs.

METHOD

We propose a solution for stereo endoscopes that estimates depth and optical flow to minimize two geometric losses for camera pose estimation. Most importantly, we introduce two learned adaptive per-pixel weight mappings that balance contributions according to the input image content. To do so, we train a Deep Declarative Network to take advantage of the expressiveness of deep learning and the robustness of a novel geometric-based optimization approach. We validate our approach on the publicly available SCARED dataset and introduce a new in vivo dataset, StereoMIS, which includes a wider spectrum of typically observed surgical settings.

RESULTS

Our method outperforms state-of-the-art methods on average and more importantly, in difficult scenarios where tissue deformations and breathing motion are visible. We observed that our proposed weight mappings attenuate the contribution of pixels on ambiguous regions of the images, such as deforming tissues.

CONCLUSION

We demonstrate the effectiveness of our solution to robustly estimate the camera pose in challenging endoscopic surgical scenes. Our contributions can be used to improve related tasks like simultaneous localization and mapping (SLAM) or 3D reconstruction, therefore advancing surgical scene understanding in minimally invasive surgery.
Date of Publication
2023-07
Publication Type
Article
Subject(s)
600 Technology > 610 Medicine & health
500 Science > 570 Life sciences; biology
Keyword(s)
Camera pose estimation Deep declarative network Endoscopic surgery
Language(s)
en
Contributor(s)
Hayoz, Michel
ARTORG Center for Biomedical Engineering Research - AI in Medical Imaging Laboratory
Hahne, Christopherorcid-logo
ARTORG Center for Biomedical Engineering Research - AI in Medical Imaging Laboratory
Gallardo, Mathiasorcid-logo
ARTORG Center for Biomedical Engineering Research - AI in Medical Imaging Laboratory
Candinas, Daniel
Universitätsklinik für Viszerale Chirurgie und Medizin - Viszeral- und Transplantationschirurgie
Kurmann, Thomas
Allan, Maximilian
Sznitman, Raphaelorcid-logo
ARTORG Center for Biomedical Engineering Research
Additional Credits
ARTORG Center for Biomedical Engineering Research - AI in Medical Imaging Laboratory
Universitätsklinik für Viszerale Chirurgie und Medizin - Viszeral- und Transplantationschirurgie
ARTORG Center for Biomedical Engineering Research
Series
International journal of computer assisted radiology and surgery
Publisher
Springer
ISSN
1861-6429
Access(Rights)
open.access
Show full item
BORIS Portal
Bern Open Repository and Information System
Build: dd892c [ 9.04. 8:30]
Explore
  • Projects
  • Funding
  • Publications
  • Research Data
  • Organizations
  • Researchers
  • Audiovisual Material
  • Software & other digital items
  • Events
More
  • About BORIS Portal
  • Send Feedback
  • Cookie settings
  • Service Policy
Follow us on
  • Mastodon
  • YouTube
  • LinkedIn
UniBe logo