Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Exec

Benchmarks

Task NameDataset NameSOTA ResultTrend
Code ExecutionCode Exec 5 variables (test)
Accuracy93.2
6
Code ExecutionCode Exec 3 variables (test)
Accuracy99
6
Showing 2 of 2 rows