Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multitask NLP on DecaNLP Subset (SQuAD 2.0, WikiSQL, SST, QA-SRL, WOZ) (test)
Loading...
78.2
Average Score
Multitasked
50.12
57.41
64.7
71.99
Dec 17, 2025
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Score
Multitasked
description=Upper boun...
2025.12
78.2
PPSEBM
sampling_rate=0.2, mod...
2025.12
77.4
LSEBMCL
sampling_rate=0.2, mod...
2025.12
77.3
HMI-LAMOL
sampling_rate=0.2, mod...
2025.12
76.9
PPSEBM
sampling_rate=0.05, mo...
2025.12
76.7
LSEBMCL
sampling_rate=0.05, mo...
2025.12
76.5
HMI-LAMOL
sampling_rate=0.05, mo...
2025.12
76
LAMOL
sampling_rate=0.2, mod...
2025.12
74.1
LAMOL
sampling_rate=0.05, mo...
2025.12
70.3
Fine-tuned
2025.12
52.6
MAS
2025.12
51.2
Feedback
Search any
task
Search any
task