Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Text-to-code generation on DTVBENCH

10.6Manim Score

GPT-4o

4.69286.22647.769.2936Oct 27, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.10
10.64.92
2025.10
9.76.07
2025.10
9.614.98
2025.10
8.564.04
2025.10
8.415.97
2025.10
6.635.08
2025.10
6.25.18
2025.10
4.923.15