Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Predictive Inference on L4 Out-of-Distribution (test)
Loading...
83.3
Accuracy
LLaTiSA (L1→L2→L3→L4)
39.308
50.729
62.15
73.571
Apr 19, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
LLaTiSA (L1→L2→L3→L4)
Modality=Vision (plot...
2026.04
83.3
Claude-3.5-Sonnet
Modality=Text (w/o index)
2026.04
82.2
GPT-4.1
Modality=Text (w/o index)
2026.04
79.1
GPT-4o
Modality=Vision (plot...
2026.04
78.3
GPT-4o
Modality=Text (w/o index)
2026.04
75.6
Qwen3-8B
Modality=Text (w/o index)
2026.04
67.1
LLaTiSA (L1→L2→L3)
Modality=Vision (plot...
2026.04
54.2
Qwen3-VL-8B
Modality=Vision (plot...
2026.04
42.1
LLaMA3.1-8B
Modality=Text (w/o index)
2026.04
41
Feedback
Search any
task
Search any
task