Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Distractor Generation on D-GEN Commonsense Reasoning
Loading...
4.97
Fluency
D-GEN
4.7215
4.84575
4.97
5.09425
Apr 18, 2025
Fluency
Coherence
Distracting Ability
Incorrectness
Updated 4d ago
Evaluation Results
Method
Method
Links
Fluency
Coherence
Distracting Ability
Incorrectness
D-GEN
Evaluation Protocol=Hu...
2025.04
4.97
4.38
4.06
3.76
Feedback
Search any
task
Search any
task