Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Control

Benchmarks

Task NameDataset NameSOTA ResultTrend
Control calibrationControl
Accuracy94.6
16
Control Flow EvaluationControl (test)
Per Character Accuracy99.6
4
Showing 2 of 2 rows