Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PLASMA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Binary decisionPLASMA
Accuracy84.5
27
Reasoning/PlanningPlasma
Accuracy85
10
Showing 2 of 2 rows