Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stylized Dialogue on 148-query style-1 persona (test)
Loading...
4.257
Context Score
SFR
4.17172
4.19386
4.216
4.23814
May 27, 2026
Context Score
Relevance Score
Style Score (SS)
Fluency Score
Updated 6d ago
Evaluation Results
Method
Method
Links
Context Score
Relevance Score
Style Score (SS)
Fluency Score
SFR
Temperature=0.8, Backb...
2026.05
4.257
4.628
3.135
4.791
SFT
Temperature=0.8, Backb...
2026.05
4.216
4.581
2.858
4.838
DeepSeek-R1-prompt
Temperature=0.8, Backb...
2026.05
4.175
4.562
3.117
4.781
Feedback
Search any
task
Search any
task