| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Monocular depth estimation | C3VD (test) | Abs Rel0.134 | 16 | |
| Depth estimation | C3VD | RMSE1.88 | 14 | |
| Metric Depth Estimation | C3VD (first split) | Delta1 Acc95.4 | 13 | |
| Novel View Synthesis | C3VD average across ten scenes | PSNR34.24 | 10 | |
| Camera Localization | C3VD Average v2 | RMSE8.13 | 9 | |
| Camera Localization | C3VD v2 (c2_transverse1_t1_v4) | RMSE10.47 | 9 | |
| Camera Localization | C3VD c1_sigmoid2_t4_v4 v2 | RMSE6.09 | 9 | |
| Camera Localization | C3VD c1_descending_t4_v4 v2 | RMSE6.81 | 9 | |
| Camera Localization | C3VD c1_sigmoid1_t4_v4 v2 | RMSE7.96 | 8 | |
| Camera Tracking | C3VD high-definition (test) | ATE (mm)0.32 | 8 | |
| Depth Reconstruction | C3VD high-definition (test) | RMSE (mm)1.88 | 8 | |
| Rendering | C3VD high-definition (test) | PSNR22.16 | 8 | |
| Rendering | C3VD Average v2 | PSNR26.33 | 7 | |
| Rendering | C3VD v2 (c2_transverse1_t1_v4) | PSNR25.53 | 7 | |
| Rendering | C3VD c1_sigmoid2_t4_v4 v2 | PSNR26.45 | 7 | |
| Rendering | C3VD c1_sigmoid1_t4_v4 v2 | PSNR28.73 | 7 | |
| Rendering | C3VD v2 (c1_descending_t4_v4) | PSNR24.62 | 7 | |
| Camera Pose Estimation | C3VD (test) | ATE1.2533 | 6 | |
| Monocular Depth Estimation | C3VD (split 2) | AbsRel0.049 | 6 | |
| Endoscopic SLAM | C3VD | Tracking Time/Frame8.5 | 6 | |
| Camera Tracking | C3VD average across ten scenes | ATE (mm)0.23 | 4 | |
| Depth Estimation | C3VD (average across ten scenes) | RMSE (mm)1.54 | 4 | |
| Uneven illumination scene reconstruction | C3VD Sigmoid t2 a (test) | PSNR33.13 | 4 | |
| Uneven illumination scene reconstruction | C3VD Cecum t2 b (test) | PSNR33.55 | 4 |