Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Textual Quality Evaluation on Cadets E3 (test)
Loading...
56
GLEU
ProvSyn
-0.16
14.42
29
43.58
Jun 6, 2025
GLEU
Distinct-1
Updated 1mo ago
Evaluation Results
Method
Method
Links
GLEU
Distinct-1
ProvSyn
base_model=Llama3.2-3B...
2025.06
56
100
GPT
2025.06
36
100
ProvSyn
base_model=Llama3.2-3B...
2025.06
22
100
Claude
2025.06
16
100
GPT
2025.06
10
100
Qwen3
2025.06
7
100
Claude
2025.06
7
100
DeepSeek
2025.06
4
100
Qwen3
2025.06
3
100
DeepSeek
2025.06
2
100
Feedback
Search any
task
Search any
task