Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text Quality Evaluation on General Capability Prompts
Loading...
5.44
Fluency
Prompt_auto
3.5472
4.0386
4.53
5.0214
May 23, 2025
Fluency
Updated 4d ago
Evaluation Results
Method
Method
Links
Fluency
Prompt_auto
Model=Gemma-2-9b-it
2025.05
5.44
SAE_AXBENCH
Model=Gemma-2-9b-it
2025.05
5.43
STA
Model=Gemma-2-9b-it
2025.05
5.43
CAA
Model=Gemma-2-9b-it
2025.05
5.42
Prompt_hand
Model=Gemma-2-9b-it
2025.05
5.41
Vanilla
Model=Gemma-2-9b-it
2025.05
5.39
CAA
Model=Gemma-2-9b-pt
2025.05
4.38
SAE_AXBENCH
Model=Gemma-2-9b-pt
2025.05
4.33
Vanilla
Model=Gemma-2-9b-pt
2025.05
4.31
STA
Model=Gemma-2-9b-pt
2025.05
4.29
Prompt_auto
Model=Gemma-2-9b-pt
2025.05
4.19
Vanilla
Model=Llama-3.1-8B
2025.05
4.04
Prompt_auto
Model=Llama-3.1-8B
2025.05
4.03
SAE_AXBENCH
Model=Llama-3.1-8B
2025.05
3.96
STA
Model=Llama-3.1-8B
2025.05
3.92
CAA
Model=Llama-3.1-8B
2025.05
3.89
Prompt_hand
Model=Gemma-2-9b-pt
2025.05
3.88
Prompt_hand
Model=Llama-3.1-8B
2025.05
3.62
Feedback
Search any
task
Search any
task