Share your thoughts, 1 month free Claude Pro on usSee more

Root Cause Mapping on CTIBench RCM

0.753Score

Foundation-Sec-8B-Reasoning

Updated 4mo ago

Evaluation Results

Method	Links
Foundation-Sec-8B-Reasoning 2026.01		0.753
GPT-4.1 2026.01		0.73
GPT-5 2026.01		0.728
GPT-5-Mini 2026.01		0.723
GPT-OSS-120B 2026.01		0.712
o3-Mini 2026.01		0.708
Foundation-Sec-8B-Instruct 2026.01		0.704
Llama-3.3-70B-Instruct 2026.01		0.684
GPT-5-Nano 2026.01		0.672
Llama-Primus-Merged 2026.01		0.665
Llama-Primus-Nemotron-70B-Instruct 2026.01		0.664
Phi-4 2026.01		0.629
Qwen-3-14B 2026.01		0.612
GPT-OSS-20B 2026.01		0.61
Qwen-3-8B 2026.01		0.542
Llama-3.1-8B-Instruct 2026.01		0.531
DeepHat-V1-7B 2026.01		0.434