Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Chat on IFEval
Loading...
48.8
Loose Prompt Metric
Fine-tuned
9.9664
20.0482
30.13
40.2118
Apr 17, 2025
Loose Prompt Metric
Updated 4d ago
Evaluation Results
Method
Method
Links
Loose Prompt Metric
Fine-tuned
CR=1, Backbone=LLaMA3-...
2025.04
48.8
Fine-tuned
CR=1, Backbone=LLaMA2-...
2025.04
33.64
IMPART
CR=32, Backbone=LLaMA3...
2025.04
33.27
Fine-tuned
CR=1, Backbone=LLaMA2-...
2025.04
31.79
DARE
CR=32, Backbone=LLaMA3...
2025.04
30.5
LowRank
CR=32, Backbone=LLaMA3...
2025.04
29.39
IMPART
CR=32, Backbone=LLaMA2...
2025.04
27.91
IMPART
CR=32, Backbone=LLaMA2...
2025.04
26.8
LowRank
CR=32, Backbone=LLaMA2...
2025.04
26.06
DARE
CR=32, Backbone=LLaMA2...
2025.04
24.77
LowRank
CR=32, Backbone=LLaMA2...
2025.04
23.84
Backbone
CR=1, Backbone=LLaMA2-...
2025.04
20.52
Backbone
CR=1, Backbone=LLaMA2-...
2025.04
19.04
DARE
CR=32, Backbone=LLaMA2...
2025.04
16.82
Backbone
CR=1, Backbone=LLaMA3-...
2025.04
11.46
Feedback
Search any
task
Search any
task