Xiong et al., 2025 - Google Patents

PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth Estimation

Xiong et al., 2025

View PDF
Document ID
10827060055711790480
Author
Xiong X
Beltran A
Choi J
Niethammer M
Sengupta R
Publication year
Publication venue
arXiv preprint arXiv:2504.17067

External Links

Snippet

Accurate depth estimation enhances endoscopy navigation and diagnostics, but obtaining ground-truth depth in clinical settings is challenging. Synthetic datasets are often used for training, yet the domain gap limits generalization to real data. We propose a novel image-to …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • G06T15/50Lighting effects
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
    • G06T5/006Geometric correction
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2200/00Indexing scheme for image data processing or generation, in general

Similar Documents

Publication Publication Date Title
Wu et al. Multi-channel optimization generative model for stable ultra-sparse-view CT reconstruction
Bonilla et al. Gaussian pancakes: geometrically-regularized 3d gaussian splatting for realistic endoscopic reconstruction
Chen et al. Generalizable human gaussians from single-view image
Ling et al. Semantically disentangled variational autoencoder for modeling 3D facial details
Xiong et al. PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth Estimation
CN120655691A (en) Endoscope monocular image depth estimation method and system
Wu et al. Single-step latent diffusion for underwater image restoration
Liang et al. Flow-Anything: Learning real-world optical flow estimation from large-scale single-view images
Li et al. Collaborative surgical instrument segmentation for monocular depth estimation in minimally invasive surgery
Zhang et al. Spatiotemporally enhanced photometric loss for self-supervised monocular depth estimation
EP4401040B1 (en) Medical image rendering technique
Cho et al. DogRecon: Canine Prior-Guided Animatable 3D Gaussian Dog Reconstruction From A Single Image: G. Cho et al.
Wang et al. Structure-preserving image translation for depth estimation in colonoscopy
Kaleta et al. Pr-endo: Physically based relightable gaussian splatting for endoscopy
Xie et al. Dvg-diffusion: Dual-view guided diffusion model for ct reconstruction from x-rays
Sun et al. Learning monocular regression of 3d people in crowds via scene-aware blending and de-occlusion
CN119991937B (en) A single-view 3D human body reconstruction method based on Gaussian surface elements
Jeong et al. Depth estimation of endoscopy using sim-to-real transfer
CN120612734A (en) Multi-hypothesis 3D human pose estimation method based on bidirectional state space diffusion model
Li et al. Multiscale residual and attention guidance for low-light image enhancement in visual SLAM
Recasens Lafuente et al. The drunkard’s odometry: Estimating camera motion in deforming scenes
Li et al. Realistic and controllable 3d gaussian-guided object editing for driving video generation
CN119693582A (en) Three-dimensional reconstruction method, device, equipment and storage medium
Teng et al. Blind face restoration via multi-prior collaboration and adaptive feature fusion
Farooq et al. Generating thermal image data samples using 3D facial modelling techniques and deep learning methodologies