Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Instruction Tuning benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Instruction Tuning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Dolly-15K alpha=5.0
FedHDS
Rouge-L
35.79
22
1mo ago
Dolly-15K alpha=0.5
Coreset-Cent
Rouge-L
35.48
22
1mo ago
Natural Instructions Meta Non-IID
Coreset-Cent
Rouge-L
34.81
22
1mo ago
UNI
ARMADA
RougeL
34.53
21
1mo ago
SelfInst
ARMADA
ROUGE-L
21.31
21
1mo ago
SNI
ARMADA
RougeL
29.63
21
1mo ago
Dolly
LLaMA-3.1-8B
RougeL
35.34
21
1mo ago
CoT
ADG
Reasoning Score
70.55
20
4d ago
WizardLM
ADG
Reasoning Score
75.07
20
4d ago
Alpaca GPT4
ADG
Reasoning
75.43
20
4d ago
Instruction Tuning Datasets 1.0 (train test)
K-Center-Greedy
Model Performance
1.45
20
1mo ago
Alpaca instruction-tuning 52k
GRADFILTERING
Pairwise Winning Score
116
19
1mo ago
IT Evaluation Suite MMLU, BBH, GSM, TydiQA, CodeX, AE
Alpaca-GPT4
MMLU
55.7
18
1mo ago
Data Mix
GRADIENTSPACE
Accuracy
59.1
16
1mo ago
Vicuna
ARMADA
RougeL Score
18.73
11
1mo ago
FLAN subset Average (test)
FedRouter*
ROUGE-1
56.7
7
16d ago
FLAN All (test)
FedRouter*
ROUGE-1
57.5
7
16d ago
FLAN Dual (test)
FedRouter*
ROUGE-1
56.3
7
16d ago
FLAN Single (test)
FedRouter*
ROUGE-1
56.2
7
16d ago
AlpacaEval 2.0 (test)
Full SFT
Win Rate (LC)
11.49
7
1mo ago
Alpaca 52K instruction LLaMA-7B (test)
DQ (2%)
BBH
32.9
4
1mo ago
Anthropic HH (test)
SCAR
Win Rate
56.3
2
1mo ago
Anthropic HH-RLHF (test)
-
-
0
1mo ago
Dolly-15K
-
-
0
1mo ago
Showing 24 of 24 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs