Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Natural Language Understanding on AdvGLUE (Adversarial Evaluation)

68.29Performance Score

Ours (+GDPO)

57.099660.004862.9165.8152May 20, 2026
Updated 13d ago

Evaluation Results

MethodLinks
2026.05
68.29---
2026.05
67.75---
2026.05
66.27---
64.9---
2026.05
59.62---
58.4---
2026.05
58.33---
2026.05
57.53---
2026.04
-87.363.7-23.6
2026.04
-85.174.2-10.9
2026.04
-84.773.8-10.9
2026.04
-86.977.3-9.6