Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Paraphrasing Robustness on ELI5 prompts 32-bit payload Llama3.1-8B (test)

50.9Bit Accuracy

MC2Mark

49.75650.05350.3550.647May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
50.9
2026.05
50.9
2026.05
50.7
2026.05
50.5
2026.05
50.4
2026.05
50.1
2026.05
49.8