Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code execution on SingleLine (test)
Loading...
94.03
Trace Accuracy
CodeExecutor
34.5836
50.0168
65.45
80.8832
May 8, 2023
Trace Accuracy
Output Accuracy
Line Precision
Line Recall
Line F1
Identifier Precision
Identifier Recall
Identifier F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Trace Accuracy
Output Accuracy
Line Precision
Line Recall
Line F1
Identifier Precision
Identifier Recall
Identifier F1
CodeExecutor
curriculum_learning=true
2023.05
94.03
-
94.03
94.03
94.03
97.28
97.18
97.23
CEL-S1
training_stage=Stage 1...
2023.05
93.32
-
93.32
93.32
93.32
96.94
96.86
96.9
Codex
shots=3
2023.05
36.87
36.87
36.87
36.87
36.87
71.87
69.34
70.58
Feedback
Search any
task
Search any
task