Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Question Answering on LongBench CodeQA v2

0.741Accuracy

SRLM (no sub-calls)

0.178360.324430.47050.61657Mar 7, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
0.741
2026.03
0.689
2026.03
0.652
2026.03
0.649
2026.03
0.598
2026.03
0.595
2026.03
0.59
2026.03
0.58
2026.03
0.538
2026.03
0.5
2026.03
0.26
2026.03
0.24
2026.03
0.24
2026.03
0.24
2026.03
0.22
2026.03
0.2