Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Datasets
BCB
Loading...
Benchmarks
Task Name
Dataset Name
Task Name
Dataset Name
SOTA Result
Trend
Results
Code Correctness Evaluation
BCB
F1 Score
69.6
25
Cost Prediction
BCB
Mean Absolute Error (MAE)
12.02
19
Reward Prediction
BCB
MAE
5.94
10
Code clone detection
BCB-F (test)
Precision
0.621
5
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Feedback
Search any
task
Search any
task