Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SCONE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Sequential Instruction UnderstandingSCONE 1.0 (test)
Score (Sce)74.5
6
Grounded ReasoningSCONE STREET
Answer Accuracy72.4
3
Sequential instruction understandingSCONE (dev)
Sce. Score64.1
3
Showing 3 of 3 rows