Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Linguistic Reasoning on BIG-bench Hard Hyperbaton (test)

74Test Accuracy

PE2

46.9653.986168.02Nov 9, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.11
74
2023.11
72
2023.11
52
2023.11
48