Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Intent Detection on Thomas
Loading...
87.4
Macro-F1
Goal-based strategy
78.872
81.086
83.3
85.514
Mar 28, 2026
Macro-F1
Updated 19d ago
Evaluation Results
Method
Method
Links
Macro-F1
Goal-based strategy
LLM=Mistral
2026.03
87.4
Goal-based strategy
LLM=Llama3
2026.03
86.2
Goal-based strategy
LLM=Gemma
2026.03
86.2
Zero-CoT
LLM=Llama3
2026.03
82.5
Self-Debias
LLM=Llama3
2026.03
82.4
Self-Debias
LLM=Gemma
2026.03
80.8
Zero-CoT
LLM=Mistral
2026.03
79.4
Zero-CoT
LLM=Gemma
2026.03
79.3
Self-Debias
LLM=Mistral
2026.03
79.2
Feedback
Search any
task
Search any
task