Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

T0

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language ProcessingT0 MTest11 P3 (test)
Accuracy61.4
42
Natural Language ProcessingT0 benchmark
RTE85.8
18
Natural Language ProcessingT0 Without SCloze dataset HyperT5 variant (test)
Accuracy60.6
14
Zero-shot Natural Language UnderstandingT0 (test)
Accuracy65.5
8
Task GeneralizationT0 Taxonomy Evaluation Tasks (val)
OBQA59.1
7
Natural Language UnderstandingT0 Evaluation Suite IA3 PEFT (held-out)
RTE71.9
6
Few-shot learningT0 11B (test)
Avg Test Score74.9
6
Instruction FollowingT0
Accuracy49
5
Instruction FollowingT0 Zero-Shot
Accuracy-
0
Showing 9 of 9 rows