Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language-conditioned visual reasoning on VLABench (test)

76Precision Score (Toy)

pi_0

51.0457.526470.48Nov 26, 2025
Updated 24d ago

Evaluation Results

MethodLinks
2025.11
767216108.1436.43
2025.11
76651277.332551.07
2025.11
7268264233.3348.27
2025.11
52493628.6717.3945.77