Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stylized Dialogue on 148-query style-2 persona (test)
Loading...
4.169
Context
SFT
4.11492
4.12896
4.143
4.15704
May 27, 2026
Context
Relevance
Style Score (SS)
Fluency
Updated 6d ago
Evaluation Results
Method
Method
Links
Context
Relevance
Style Score (SS)
Fluency
SFT
Temperature=0.8, Backb...
2026.05
4.169
4.311
4.73
4.98
SFR
Temperature=0.8, Backb...
2026.05
4.128
4.223
4.824
4.993
DeepSeek-R1-prompt
Temperature=0.8, Backb...
2026.05
4.117
4.285
4.766
4.964
Feedback
Search any
task
Search any
task