Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ProsQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
ReasoningProsQA
Acc100
26
Logical ReasoningProsQA
Accuracy98.5
6
Propositional Logic ReasoningProsQA (val)
Accuracy93.37
4
Question AnsweringProsQA
Accuracy91.8
3
Logical ReasoningProsQA Enhanced
OA (%)97.8
1
Showing 5 of 5 rows