Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-Motif Detection on LLMTM Level 2 1.0 (test)
Loading...
39.64
Accuracy
o3
9.5528
17.3639
25.175
32.9861
Dec 24, 2025
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
o3
2025.12
39.64
DeepSeek-R1
2025.12
32.14
DeepSeek-Qwen-32B
2025.12
24.91
Qwen2.5-32B
2025.12
23.88
DeepSeek-Qwen-14B
2025.12
23.67
QwQ-32B
2025.12
23.19
GPT-4o-mini
2025.12
18.8
DeepSeek-Qwen-7B
2025.12
11.19
openPangu-7B
2025.12
10.71
Feedback
Search any
task
Search any
task