Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Refinement on HE-PY
Loading...
27.08
PR
CoCoA
-0.22
6.8675
13.955
21.0425
Oct 4, 2024
PR
RR
Updated 17d ago
Evaluation Results
Method
Method
Links
PR
RR
CoCoA
lambda=2, Evaluator Mo...
2024.10
27.08
63
CoCoA
lambda=5, Evaluator Mo...
2024.10
22.91
61
CoCoA
lambda=5, Evaluator Mo...
2024.10
12.5
83
CoCoA
lambda=5, Evaluator Mo...
2024.10
11.27
86
CoCoA
lambda=2, Evaluator Mo...
2024.10
11.2
84
CoCoA
lambda=2, Evaluator Mo...
2024.10
10.15
90
CA-SP
Evaluator Model=Llama-...
2024.10
3.51
81
BC
Evaluator Model=Llama-...
2024.10
2.19
75
CodeJudge
Evaluator Model=Qwen-2...
2024.10
2.01
81
CA-SP
Evaluator Model=Qwen-2...
2024.10
1.81
82
BC
Evaluator Model=Qwen-2...
2024.10
1.54
80
BC
Evaluator Model=GPT-4o...
2024.10
1.35
82
CA-SP
Evaluator Model=GPT-4o...
2024.10
1.26
81
CodeJudge
Evaluator Model=GPT-4o...
2024.10
0.98
83
CodeJudge
Evaluator Model=Llama-...
2024.10
0.83
85
Feedback
Search any
task
Search any
task