Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Theme Detection on Colorado
Loading...
84.8
Macro F1 Score
Context-based strategy
79.184
80.642
82.1
83.558
Mar 28, 2026
Macro F1 Score
Updated 19d ago
Evaluation Results
Method
Method
Links
Macro F1 Score
Context-based strategy
LLM=Mistral
2026.03
84.8
Self-Debias
LLM=Gemma
2026.03
84.2
Context-based strategy
LLM=Gemma
2026.03
84.2
Zero-CoT
LLM=Mistral
2026.03
83.6
Context-based strategy
LLM=Llama3
2026.03
83.2
Self-Debias
LLM=Mistral
2026.03
83
Zero-CoT
LLM=Gemma
2026.03
82.6
Self-Debias
LLM=Llama3
2026.03
79.5
Zero-CoT
LLM=Llama3
2026.03
79.4
Feedback
Search any
task
Search any
task