Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Synthesis on Ag-LiveCodeBench-X 5.0 (derived)
Loading...
7
OCaml Pass@1
Llama 3.3 70B Ins
-0.28
1.61
3.5
5.39
Aug 6, 2025
OCaml Pass@1
Fortran Pass@1
Updated 1mo ago
Evaluation Results
Method
Method
Links
OCaml Pass@1
Fortran Pass@1
Llama 3.3 70B Ins
Parameters=70B, Varian...
2025.08
7
3
DSC v2 Lite Ins 16B
Parameters=16B, Varian...
2025.08
7
6
Qwen3-4B-CF-X
Parameters=4B, Trainin...
2025.08
7
15
Qwen3-8B-CF-X
Parameters=8B, Trainin...
2025.08
7
17
Sonnet 4
2025.08
6
6
Qwen 3 32B
Parameters=32B
2025.08
2
1
Qwen 3 4B
Parameters=4B
2025.08
1
0
Qwen 3 8B
Parameters=8B
2025.08
0
0
Feedback
Search any
task
Search any
task