Xiong et al., 2025 - Google Patents

PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth Estimation

Xiong et al., 2025

Document ID: 10827060055711790480
Author: Xiong X; Beltran A; Choi J; Niethammer M; Sengupta R
Publication year: 2025
Publication venue: arXiv preprint arXiv:2504.17067

External Links

Cited by

Snippet

Accurate depth estimation enhances endoscopy navigation and diagnostics, but obtaining ground-truth depth in clinical settings is challenging. Synthetic datasets are often used for training, yet the domain gap limits generalization to real data. We propose a novel image-to …

Continue reading at arxiv.org (PDF) (other versions)

238000013519 translation 0 title abstract description 40

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/50—Lighting effects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
- G06T5/006—Geometric correction
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general

Similar Documents

Publication	Publication Date	Title
Wu et al.	2024	Multi-channel optimization generative model for stable ultra-sparse-view CT reconstruction
Bonilla et al.	2024	Gaussian pancakes: geometrically-regularized 3d gaussian splatting for realistic endoscopic reconstruction
Chen et al.	2025	Generalizable human gaussians from single-view image
Ling et al.	2022	Semantically disentangled variational autoencoder for modeling 3D facial details
Xiong et al.	2025	PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth Estimation
CN120655691A (en)	2025-09-16	Endoscope monocular image depth estimation method and system
Wu et al.	2025	Single-step latent diffusion for underwater image restoration
Liang et al.	2025	Flow-Anything: Learning real-world optical flow estimation from large-scale single-view images
Li et al.	2025	Collaborative surgical instrument segmentation for monocular depth estimation in minimally invasive surgery
Zhang et al.	2022	Spatiotemporally enhanced photometric loss for self-supervised monocular depth estimation
EP4401040B1 (en)	2026-03-18	Medical image rendering technique
Cho et al.	2025	DogRecon: Canine Prior-Guided Animatable 3D Gaussian Dog Reconstruction From A Single Image: G. Cho et al.
Wang et al.	2024	Structure-preserving image translation for depth estimation in colonoscopy
Kaleta et al.	2025	Pr-endo: Physically based relightable gaussian splatting for endoscopy
Xie et al.	2026	Dvg-diffusion: Dual-view guided diffusion model for ct reconstruction from x-rays
Sun et al.	2023	Learning monocular regression of 3d people in crowds via scene-aware blending and de-occlusion
CN119991937B (en)	2025-11-25	A single-view 3D human body reconstruction method based on Gaussian surface elements
Jeong et al.	2021	Depth estimation of endoscopy using sim-to-real transfer
CN120612734A (en)	2025-09-09	Multi-hypothesis 3D human pose estimation method based on bidirectional state space diffusion model
Li et al.	2024	Multiscale residual and attention guidance for low-light image enhancement in visual SLAM
Recasens Lafuente et al.	2023	The drunkard’s odometry: Estimating camera motion in deforming scenes
Li et al.	2025	Realistic and controllable 3d gaussian-guided object editing for driving video generation
CN119693582A (en)	2025-03-25	Three-dimensional reconstruction method, device, equipment and storage medium
Teng et al.	2022	Blind face restoration via multi-prior collaboration and adaptive feature fusion
Farooq et al.	2020	Generating thermal image data samples using 3D facial modelling techniques and deep learning methodologies