Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Symptom Severity Assessment on Psychotherapy dialogue datasets
Loading...
0.21
MSE
GPT-4
0.155556
0.523053
0.89055
1.258047
Oct 8, 2024
MSE
MAE
Updated 26d ago
Evaluation Results
Method
Method
Links
MSE
MAE
GPT-4
Model Category=Closed-...
2024.10
0.21
0.3292
GPT-4o-mini
Model Category=Closed-...
2024.10
0.2245
0.3329
Llama3.1-70B
Model Category=Open-So...
2024.10
0.3379
0.4041
Llama3.1-405B
Model Category=Open-So...
2024.10
0.3922
0.4476
Qwen2-72B
Model Category=Open-So...
2024.10
0.3962
0.4559
GPT-4-turbo
Model Category=Closed-...
2024.10
0.4055
0.449
Mistral-8X22B
Model Category=Open-So...
2024.10
0.5205
0.5452
Mistral-8X7B
Model Category=Open-So...
2024.10
1.5711
1.0927
Feedback
Search any
task
Search any
task