| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image Captioning | IIW-400 | Precision84.2 | 40 | |
| Image-Text Retrieval | IIW (test) | Recall@177.9 | 21 | |
| Albedo Estimation | IIW v1.1 (test) | WHDR 10%12.8 | 11 | |
| Image-to-Text Retrieval | IIW (It Is What it is) | R@199.8 | 9 | |
| Intrinsic Image Decomposition | IIW (test) | WHDR11.9 | 9 | |
| Zero-shot Image-Text Retrieval | IIW | Zero-shot Accuracy (IIW)81.8 | 7 | |
| Intrinsic Decomposition | IIW 5 (test) | WHDR12 | 6 | |
| Text-to-Image Retrieval | IIW | Score99.7 | 5 | |
| Text-to-Image Retrieval | IIW (It Is What it is) | R@197.4 | 4 |