Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Generation on MBPP (Accuracy, Utility Preservation)
Loading...
69.8
Accuracy
Baseline
63.0192
64.7796
66.54
68.3004
Mar 16, 2026
Accuracy
Utility Preservation
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Utility Preservation
Baseline
2026.03
69.8
-
SFCoT
2026.03
63.28
90.7
Feedback
Search any
task
Search any
task