Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Expert Evaluation on AnnoMI Expert Evaluation Subset
Loading...
4.06
Cultivating Change Talk
HQ (High-quality human sessions)
1.5848
2.2274
2.87
3.5126
Feb 5, 2025
Cultivating Change Talk
Softening Sustain Talk
Partnership
Empathy
Change Talk Exploration
Evoking Change Talk
Counselor Realism
Client Realism
Updated 3mo ago
Evaluation Results
Method
Method
Links
Cultivating Change Talk
Softening Sustain Talk
Partnership
Empathy
Change Talk Exploration
Evoking Change Talk
Counselor Realism
Client Realism
HQ (High-quality human sessions)
Type=Human Counselor
2025.02
4.06
3.9
4.26
4.26
4.18
2.68
4.68
4.68
CAMI
Framework=STAR framework
2025.02
3.68
3.32
3.9
4
3.94
2.4
3.6
4.32
CoS
Description=Chain-of-S...
2025.02
2.74
2.74
3.6
3.72
3
1.8
3.06
4
LQ (Low-quality human sessions)
Type=Human Counselor
2025.02
1.68
1.74
1.46
1.38
1.58
1.2
2.32
4
Feedback
Search any
task
Search any
task