Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TahoeQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Direction of changeTahoeQA
C32 Accuracy85.5
8
Differential expressionTahoeQA
C32 Score0.47
8
Showing 2 of 2 rows