Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stylized Dialogue on 148-query style-3 persona (test)
Loading...
4.696
Context
SFR
4.5036
4.55355
4.6035
4.65345
May 27, 2026
Context
Relevance
Style Score (SS)
Fluency
Updated 6d ago
Evaluation Results
Method
Method
Links
Context
Relevance
Style Score (SS)
Fluency
SFR
Temperature=0.8, Backb...
2026.05
4.696
4.851
4.73
4.98
SFT
Temperature=0.8, Backb...
2026.05
4.568
4.818
3.932
4.831
DeepSeek-R1-prompt
Temperature=0.8, Backb...
2026.05
4.511
4.759
3.891
4.796
Feedback
Search any
task
Search any
task