Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
TG task on Natural Instructions task022_cosmosqa_passage_inappropriate_binary
Loading...
80
Correctness
Logitext
68.56
71.53
74.5
77.47
Feb 20, 2026
Correctness
Updated 4d ago
Evaluation Results
Method
Method
Links
Correctness
Logitext
approach=logitext
2026.02
80
Fewshot
shot_type=few-shot
2026.02
78
Neurosymbolic
approach=neurosymbolic
2026.02
69
Feedback
Search any
task
Search any
task