Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Root Cause Mapping on CTIBench RCM
Loading...
0.753
Score
Foundation-Sec-8B-Reasoning
0.42124
0.50737
0.5935
0.67963
Jan 28, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Foundation-Sec-8B-Reasoning
Model Group=our reason...
2026.01
0.753
GPT-4.1
Model Group=frontier O...
2026.01
0.73
GPT-5
Model Group=frontier O...
2026.01
0.728
GPT-5-Mini
Model Group=frontier O...
2026.01
0.723
GPT-OSS-120B
Model Group=GPT-OSS mo...
2026.01
0.712
o3-Mini
Model Group=frontier O...
2026.01
0.708
Foundation-Sec-8B-Instruct
Model Group=Llama-fami...
2026.01
0.704
Llama-3.3-70B-Instruct
Model Group=Llama-fami...
2026.01
0.684
GPT-5-Nano
Model Group=frontier O...
2026.01
0.672
Llama-Primus-Merged
Model Group=Llama-fami...
2026.01
0.665
Llama-Primus-Nemotron-70B-Instruct
Model Group=Llama-fami...
2026.01
0.664
Phi-4
Model Group=smaller sp...
2026.01
0.629
Qwen-3-14B
Model Group=smaller sp...
2026.01
0.612
GPT-OSS-20B
Model Group=GPT-OSS mo...
2026.01
0.61
Qwen-3-8B
Model Group=smaller sp...
2026.01
0.542
Llama-3.1-8B-Instruct
Model Group=Llama-fami...
2026.01
0.531
DeepHat-V1-7B
Model Group=smaller sp...
2026.01
0.434
Feedback
Search any
task
Search any
task