Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Path-level reasoning on UNOBench synthetic Hard (test)

56.8SR-P

UNOGrasp

3.86417.60731.3545.093Nov 28, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.11
56.855.354.50.51
2025.11
42.235.437.20.67
2025.11
38.133.334.3-
2025.11
33.931.531.9-
2025.11
17.213.414.60.89
2025.11
14.914.813.80.68
2025.11
11.79.410.10.91
2025.11
9.717.210.20.79
2025.11
9.79.48.90.89
2025.11
5.95.75.40.74