Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BCB

Benchmarks

Task NameDataset NameSOTA ResultTrend
Code Correctness EvaluationBCB
F1 Score69.6
25
Cost PredictionBCB
Mean Absolute Error (MAE)12.02
19
Reward PredictionBCB
MAE5.94
10
Code clone detectionBCB-F (test)
Precision0.621
5
Showing 4 of 4 rows