Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Scientific Text Generation on VTechAGP CELLS 1.0 (test)
Loading...
14.59
ROUGE2
Phi-4
7.7052
9.4926
11.28
13.0674
May 24, 2025
ROUGE2
s-BLEU
COMET
FRES
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE2
s-BLEU
COMET
FRES
Phi-4
2025.05
14.59
4.08
78.87
34.15
Sci-LoRA
2025.05
14.02
3.99
79.1
44.36
Qwen2.5
2025.05
11.92
3.07
78.95
43.43
Mixtral
2025.05
9.83
2.55
79.46
49.35
OPT
2025.05
9.77
-
74.74
34.05
GPT-3.5
2025.05
9.54
2.56
80.23
41.29
Mistral
2025.05
8.98
1.51
77.84
51.06
GPT-4o
2025.05
8.7
2.15
79.97
41.9
LLaMA3
2025.05
7.97
1.84
79.17
49.55
Feedback
Search any
task
Search any
task