Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Editing on Aider-Polyglot
Loading...
61.7
Accuracy
OpenAI-o1-1217
14.172
26.511
38.85
51.189
Jan 22, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
OpenAI-o1-1217
2025.01
61.7
DeepSeek-R1
Architecture=MoE, Acti...
2025.01
53.3
DeepSeek-V3
Architecture=MoE, Acti...
2025.01
49.6
Claude-3.5-Sonnet-1022
2025.01
45.3
OpenAI-o1-mini
2025.01
32.9
GPT-4o-0513
2025.01
16
Feedback
Search any
task
Search any
task