Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Detection on Algospeak (Emotion) 1.0 (test)
Loading...
0.99
Adjusted R2
Qwen
-9.3164
-6.6407
-3.965
-1.2893
May 7, 2026
Adjusted R2
Spearman Correlation Significance
Majority Fit Estimation
Significance Count
Updated 26d ago
Evaluation Results
Method
Method
Links
Adjusted R2
Spearman Correlation Significance
Majority Fit Estimation
Significance Count
Qwen
2026.05
0.99
-
-
-
GPT-4o-m
2026.05
0.98
-
-
-
GPT-4o
2026.05
0.96
-
-
-
Llama
2026.05
0.95
-
-
-
Grok
2026.05
0.94
-
-
-
Mistral
2026.05
0.88
-
-
-
Claude
2026.05
-8.92
-
-
-
Significance Count
2026.05
-
-
-
6
Feedback
Search any
task
Search any
task