Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Distractor Generation on Human Evaluation Set (test)

4.14Relevance

GPT-3

2.4762.9083.343.772Apr 19, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
4.143.514.78
2026.04
3.743.154.65
3.733.284.58
2026.04
3.452.934.56
2026.04
3.342.974.47
2026.04
32.513.9
2026.04
2.542.183.58