Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Compliance evaluation on Compliance domain (test)
Loading...
98
Accuracy
Groq-qwen3-32b
93.736
94.843
95.95
97.057
Mar 13, 2026
Accuracy
ECE
Avg Runtime (ms)
Cost per Query
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
ECE
Avg Runtime (ms)
Cost per Query
Groq-qwen3-32b
Parameters=32B, Evalua...
2026.03
98
79.7
3,008.6
0.003
Groq-kimi-k2
Parameters=Unknown, Ev...
2026.03
98
80.5
764.2
0.002
LPF-SPN
Parameters=~50M, Evalu...
2026.03
97.8
1.4
14.8
0
Groq-llama-3.3-70b
Parameters=70B, Evalua...
2026.03
95.9
81.6
1,578.7
0.004
Groq-gpt-oss-120b
Parameters=120B, Evalu...
2026.03
93.9
81.3
1,541.7
0.006
Feedback
Search any
task
Search any
task