Share your thoughts, 1 month free Claude Pro on usSee more

Attack Technique Extraction on CTIBench ATE

69.6Micro F1

GPT-4.1

Updated 4mo ago

Evaluation Results

Method	Links
GPT-4.1 2026.01		69.6
GPT-5-Mini 2026.01		68.1
o3-Mini 2026.01		59.9
GPT-5 2026.01		57.8
Llama-3.3-70B-Instruct 2026.01		51.9
Qwen-3-14B 2026.01		50.2
Foundation-Sec-8B-Reasoning 2026.01		49.1
GPT-OSS-20B 2026.01		47.8
GPT-5-Nano 2026.01		45.3
Phi-4 2026.01		43.5
Qwen-3-8B 2026.01		40.8
Foundation-Sec-8B-Instruct 2026.01		35.8
GPT-OSS-120B 2026.01		28.2
Llama-Primus-Nemotron-70B-Instruct 2026.01		26.8
Llama-3.1-8B-Instruct 2026.01		13.2
Llama-Primus-Merged 2026.01		5.8
DeepHat-V1-7B 2026.01		0.4