Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Natural Language Understanding on BIG-Bench Hard (BBH)

42.1Accuracy

Arcana

34.40436.40238.440.398Oct 17, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.10
42.1
2024.10
41.2
2024.10
38.2
2024.10
35.6
2024.10
34.7