Share your thoughts, 1 month free Claude Pro on usSee more

Distractor Generation on Human Evaluation Set (test)

4.14Relevance

GPT-3

Updated 3mo ago

Evaluation Results

Method	Links
GPT-3 2026.04		4.14	3.51	4.78
GPT-3 2026.04		3.74	3.15	4.65
Ground-truth 2026.04		3.73	3.28	4.58
GPT-3 2026.04		3.45	2.93	4.56
TinyLlama 2026.04		3.34	2.97	4.47
T5 2026.04		3	2.51	3.9
T5 2026.04		2.54	2.18	3.58