| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Low-shot recognition | Toys4K (test) | Accuracy84.13 | 72 | |
| 3D Generation | Toys4K | CLIP Score92.97 | 16 | |
| Mesh Reconstruction | Toys4K | Chamfer Distance0.033 | 16 | |
| Low-shot recognition | Toys4k multi-object setting (test) | LSA60.49 | 15 | |
| Text-to-3D | Toys4k | CLIP Score29.3 | 14 | |
| Refining VFM-derived artifacts | Toys4k | mIoU44.6 | 13 | |
| Text-to-3D Generation | Toys4K CL (Base) | CLIP Similarity29.6 | 12 | |
| 3D Geometry Synthesis | Toys4K (test) | Throughput (iter/s)0.6426 | 12 | |
| Text-to-3D Generation | Toys4K-CL Forgetting | CLIP Similarity17.36 | 10 | |
| Text-to-3D Generation | Toys4K CL (All) | CLIP Similarity29.51 | 10 | |
| Text-to-3D Generation | Toys4K CL (Novel) | CLIP Similarity29.86 | 10 | |
| Image-to-3D | Toys4k | FD (Inception)6.216 | 10 | |
| Multi-object Category Recognition (Categ-MObj) | Toys4k multi-object setting | LSA60.49 | 10 | |
| Shape-conditioned 3D object generation (geometric primitives) | Toys4K generalization (test) | CD4.89 | 9 | |
| Mesh Generation | Toys4k (Artist Meshes) | Chamfer Distance (CD)0.038 | 7 | |
| Mesh Reconstruction | Toys4k Artist Meshes (test) | Chamfer Distance (CD)0.038 | 7 | |
| Low-shot object recognition | Toys4k Inst-SObj | Accuracy96.5 | 6 | |
| Low-shot recognition | Toys4k Categ-SObj 1.0 (test) | Accuracy79.69 | 6 | |
| Low-shot recognition | Toys4k Inst-SObj 1.0 (test) | Accuracy96.5 | 6 | |
| Multi-object Category Recognition with Support Assignment (Categ-MObj-SuppAssign) | Toys4k multi-object setting | LSA45.91 | 5 | |
| Camera Pose Estimation | Toys4k | ATE0.0255 | 3 | |
| Depth Estimation | Toys4k | Abs Rel0.0039 | 3 | |
| Text-to-3D generation | Toys4K | CLIP Score0.299 | 3 | |
| Low-Shot Mutual Exclusivity | Toys4k (test) | LSA0.477 | 3 | |
| Low-shot recognition | Toys4k LSME variant | LSA47.7 | 3 |