Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stylized Dialogue on 148-query style-0 persona (test)
Loading...
4.622
Context Score
SFR
4.49512
4.52806
4.561
4.59394
May 27, 2026
Context Score
Relevance Score
Style Score (SS)
Fluency Score
Updated 6d ago
Evaluation Results
Method
Method
Links
Context Score
Relevance Score
Style Score (SS)
Fluency Score
SFR
Temperature=0.8, Backb...
2026.05
4.622
4.77
4.905
4.905
SFT
Temperature=0.8, Backb...
2026.05
4.568
4.764
4.588
4.831
DeepSeek-R1-prompt
Temperature=0.8, Backb...
2026.05
4.5
4.662
4.654
4.838
Feedback
Search any
task
Search any
task