| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Hallucination Detection | Custom Dataset | F1 Score72.9 | 15 | |
| Belief Dynamics Prediction | Custom Dataset Belief Dynamics 1.0 (test) | Macro-Avg Precision (m1)39.7 | 9 | |
| Grounding | Custom Dataset | mIoU25 | 8 | |
| Omission Detection | Custom Dataset | Accuracy64.5 | 7 | |
| Image Classification | Custom Dataset 3 | Accuracy0.9498 | 7 | |
| Image Classification | Custom Dataset 2 | Accuracy98.66 | 7 | |
| Self-Reenactment | Custom Dataset (test) | PSNR33.451 | 6 | |
| Cross-Object Reenactment | Custom Dataset (Ours) (test) | Hand Fidelity99.4 | 6 | |
| Change Detection | Custom Dataset Lab | Scan-wise IoU76.7 | 5 | |
| Change Detection | Custom Dataset Const-2F | Scan-wise IoU48.6 | 5 | |
| Change Detection | Custom Dataset Const-1F | Scan-wise IoU72.8 | 5 | |
| 3D Velocity Estimation | Custom Dataset (Scene 1) | AVE [m/s]0.22 | 4 | |
| Single-Concept Generation | Custom Single-Concept Dataset (test) | S^T_CLIP Score35.22 | 4 | |
| Stroke Guided Image Synthesis | Custom dataset 800 image-prompt pairs | F(x, y)88.93 | 4 | |
| Autonomous Driving | Custom Dataset Dolphins | Metric- | 0 | |
| Zero-day Ransomware Detection | Custom dataset Autoencoder | Accuracy- | 0 |