Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Detective Puzzle ReasoningDP Complex
SA79.3
18
Detective Puzzle ReasoningDP Medium
SA83.2
18
Detective Puzzle ReasoningDP Easy
SA Score85.7
18
3D Solid Deformation SimulationDP
Position Error13.78
12
Dependency ParsingDP Average over all languages (test)
UAS57.1
6
Showing 5 of 5 rows