Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PROST

Benchmarks

Task NameDataset NameSOTA ResultTrend
Physical ReasoningPROST
Accuracy29.6
12
Question AnsweringPROST
Accuracy33.68
1
Showing 2 of 2 rows