Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Instruction Following on SelfInst (GPT-4 Feedback Score & Rouge-L)

21.7Rouge-L

Adversarial Moment-Matching Distillation

9.53212.69115.8519.009Jun 14, 2023Nov 23, 2023May 4, 2024Oct 14, 2024Mar 25, 2025Sep 4, 2025Feb 14, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2024.06
21.754.2-
2024.06
21.656.7-
2024.06
20.952.4-
2024.06
20.853.4-
2026.02
20.8-25.1
2024.06
20.251.8-
2024.06
18.445-
2026.02
18.2-19
2026.02
17.5-17.2
2024.06
17.443.5-
2026.02
17.4-16.9
2023.06
17.357.4-
2023.06
17.152.5-
2026.02
17.1-15.7
2026.02
17-16.8
2026.02
16.9-16.9
2023.06
16.648.5-
2023.06
16.343.7-
2024.06
16.340.8-
2026.02
16.3-16
2023.06
16.247.2-
2026.02
16.2-14.8
2023.06
1647-
2023.06
15.845.4-
2024.06
15.846.8-
2026.02
14.6-10.3
2023.06
14.546-
2023.06
14.342.9-
2024.06
14.334.7-
2026.02
14.3-12.5
2024.06
14.222.7-
2023.06
1439.2-
2023.06
13.643.2-
2023.06
13.438.9-
2026.02
13.3-9.4
2024.06
13.220.5-
2026.02
12.9-10
2024.06
12.720.7-
2026.02
12.7-9.1
2024.06
12.521.7-
2023.06
12.438.3-
2026.02
12.4-7
2024.06
12.318-
2026.02
12.3-9.2
2026.02
12.1-8.5
2026.02
12.1-8.9
2026.02
12-9.3
2024.06
11.618.2-
2024.06
11.518.2-
2026.02
10.6-5.9
2026.02
10.4-6.1
2026.02
10.4-6.2
2026.02
10.4-6.4
2024.06
10.320.2-
2026.02
10.3-5.7
2026.02
10.1-4.8
2026.02
10-4.9