Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LongCodeQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Long Code QALongCodeQA 8× Constraint
Accuracy60
8
Long Code QALongCodeQA 4× Constraint
Accuracy61.26
8
Showing 2 of 2 rows