Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Shift on Dialogue shift Overall
Loading...
1.554
Alignment Score
CAA
0.39336
0.69468
0.996
1.29732
May 7, 2026
Alignment Score
Dialogue Quality Score
Updated 26d ago
Evaluation Results
Method
Method
Links
Alignment Score
Dialogue Quality Score
CAA
Model=Llama3.1-8B, Set...
2026.05
1.554
7.34
Memory Inception
Model=Llama3.1-8B, Set...
2026.05
1.516
7
Prompt-init
Model=Llama3.1-8B, Set...
2026.05
1.178
7.58
Memory Inception
Model=Qwen3-30B-A3B, S...
2026.05
0.816
7.75
CAA
Model=Qwen3-30B-A3B, S...
2026.05
0.526
8.44
Prompt-init
Model=Qwen3-30B-A3B, S...
2026.05
0.438
8.54
Feedback
Search any
task
Search any
task