Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
NLP Tasks on Consolidated NLP Tasks (eval)
Loading...
48.44
Score (Single Best Aug)
Human-designed Augmentation
41.7736
43.5043
45.235
46.9657
Apr 26, 2024
Score (Single Best Aug)
Score (Task-Specific Best Aug)
Updated 4d ago
Evaluation Results
Method
Method
Links
Score (Single Best Aug)
Score (Task-Specific Best Aug)
Human-designed Augmentation
Target Model Size=large
2024.04
48.44
53.32
LLM-based Augmentation (LLMDA)
Target Model Size=medium
2024.04
47.8
51.09
Human-designed Augmentation
Target Model Size=medium
2024.04
47.25
49.52
Traditional Augmentation
Target Model Size=medium
2024.04
46.92
49.72
LLM-based Augmentation (LLMDA)
Target Model Size=large
2024.04
45.88
52.17
Traditional Augmentation
Target Model Size=large
2024.04
44.81
47.33
Human-designed Augmentation
Target Model Size=small
2024.04
42.43
44.52
Traditional Augmentation
Target Model Size=small
2024.04
42.37
44.18
LLM-based Augmentation (LLMDA)
Target Model Size=small
2024.04
42.03
48.47
Feedback
Search any
task
Search any
task