Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Autonomous LLM Fine-tuning on CS-Bench

85.3Accuracy

Qwen3-235B-2507

51.91660.58369.2577.917Apr 15, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.04
85.3
2026.04
58.1
2026.04
57.2
2026.04
53.2