Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TreeBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Grounded ReasoningTreeBench
Overall Score54.8
17
PerceptionTWI-oriented TreeBench online setting
Accuracy70.5
16
ReasoningTWI-oriented TreeBench online setting
Accuracy37.1
16
PerceptionTWI-oriented TreeBench (offline)
Accuracy71.1
12
ReasoningTreeBench TWI-oriented (offline)
Accuracy37.1
12
Showing 5 of 5 rows