Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Natural Instructions

Benchmarks

Task NameDataset NameSOTA ResultTrend
Instruction FollowingNatural Instructions (test)
Rouge-L97.9
90
Cross-Task GeneralizationSuper-NaturalInstructions English Track (unseen clients)
Weighted Avg Rouge-L62.2
27
Instruction TuningNatural Instructions Meta Non-IID
Rouge-L34.81
22
Federated LearningNatural Instructions (NI)
Speedup48.8
10
Continual Pre-trainingNatural Instructions (val)
Answer Verification2.391
7
TG taskNatural Instructions task459_matres_static_classification
Correctness69
3
TG taskNatural Instructions task457_matres_conditional_classification
Correctness87
3
TG taskNatural Instructions task108_contextualabusedetection_classification
Correctness75
3
TG taskNatural Instructions task022_cosmosqa_passage_inappropriate_binary
Correctness80
3
TG taskNatural Instructions task021_mctaco_grammatical_logical
Correctness0.5
3
Cross-task GeneralizationNatural Instructions (test)
Answerability Classification3.076
3
Showing 11 of 11 rows