Xiong et al., 2025 - Google Patents
PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth EstimationXiong et al., 2025
View PDF- Document ID
- 10827060055711790480
- Author
- Xiong X
- Beltran A
- Choi J
- Niethammer M
- Sengupta R
- Publication year
- Publication venue
- arXiv preprint arXiv:2504.17067
External Links
Snippet
Accurate depth estimation enhances endoscopy navigation and diagnostics, but obtaining ground-truth depth in clinical settings is challenging. Synthetic datasets are often used for training, yet the domain gap limits generalization to real data. We propose a novel image-to …
- 238000013519 translation 0 title abstract description 40
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/50—Lighting effects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
- G06T5/006—Geometric correction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Wu et al. | Multi-channel optimization generative model for stable ultra-sparse-view CT reconstruction | |
| Bonilla et al. | Gaussian pancakes: geometrically-regularized 3d gaussian splatting for realistic endoscopic reconstruction | |
| Chen et al. | Generalizable human gaussians from single-view image | |
| Ling et al. | Semantically disentangled variational autoencoder for modeling 3D facial details | |
| Xiong et al. | PPS-Ctrl: Controllable Sim-to-Real Translation for Colonoscopy Depth Estimation | |
| CN120655691A (en) | Endoscope monocular image depth estimation method and system | |
| Wu et al. | Single-step latent diffusion for underwater image restoration | |
| Liang et al. | Flow-Anything: Learning real-world optical flow estimation from large-scale single-view images | |
| Li et al. | Collaborative surgical instrument segmentation for monocular depth estimation in minimally invasive surgery | |
| Zhang et al. | Spatiotemporally enhanced photometric loss for self-supervised monocular depth estimation | |
| EP4401040B1 (en) | Medical image rendering technique | |
| Cho et al. | DogRecon: Canine Prior-Guided Animatable 3D Gaussian Dog Reconstruction From A Single Image: G. Cho et al. | |
| Wang et al. | Structure-preserving image translation for depth estimation in colonoscopy | |
| Kaleta et al. | Pr-endo: Physically based relightable gaussian splatting for endoscopy | |
| Xie et al. | Dvg-diffusion: Dual-view guided diffusion model for ct reconstruction from x-rays | |
| Sun et al. | Learning monocular regression of 3d people in crowds via scene-aware blending and de-occlusion | |
| CN119991937B (en) | A single-view 3D human body reconstruction method based on Gaussian surface elements | |
| Jeong et al. | Depth estimation of endoscopy using sim-to-real transfer | |
| CN120612734A (en) | Multi-hypothesis 3D human pose estimation method based on bidirectional state space diffusion model | |
| Li et al. | Multiscale residual and attention guidance for low-light image enhancement in visual SLAM | |
| Recasens Lafuente et al. | The drunkard’s odometry: Estimating camera motion in deforming scenes | |
| Li et al. | Realistic and controllable 3d gaussian-guided object editing for driving video generation | |
| CN119693582A (en) | Three-dimensional reconstruction method, device, equipment and storage medium | |
| Teng et al. | Blind face restoration via multi-prior collaboration and adaptive feature fusion | |
| Farooq et al. | Generating thermal image data samples using 3D facial modelling techniques and deep learning methodologies |