Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ConflictQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Contextual Robustness Question AnsweringConflictQA Unknown queries
Accuracy (Short Context)99.28
22
Contextual Robustness Question AnsweringConflictQA (Known queries)
Accuracy (Contradictory Short)82.49
22
Generative Multiple-choice Question AnsweringConflictQA
TA Rate98.8
6
Showing 3 of 3 rows