Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CS-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Chart spatial understandingCS-Bench
R@0.345.3
8
Autonomous LLM Fine-tuningCS-Bench
Accuracy85.3
4
Showing 2 of 2 rows