Share your thoughts, 1 month free Claude Pro on usSee more

Question Answering on HITL-filtered Mid (unseen)

99.8Accuracy (%)

OpsLLM-32B

Updated 2mo ago

Evaluation Results

Method	Links
OpsLLM-32B 2026.04		99.8
Zhiyu-32B 2026.04		99.4
Qwen3-Max-2025-09-23 2026.04		99
Qwen2.5-32B-Instruct 2026.04		98.8
Moonshot-Kimi-K2-Instruct 2026.04		98.6
GPT-5.2 2026.04		98.4
OpsLLM-14B 2026.04		98.4
Qwen-Turbo-2025-07-15 2026.04		98.2
Qwen-Plus-2025-09-11 2026.04		97.8
Deepseek-v3.2-exp 2026.04		97.6
OpsLLM-7B 2026.04		97.6
Qwen2.5-14B-Instruct 2026.04		97.2
Qwen2.5-7B-Instruct 2026.04		97
Qwen3-Next-80b-a3b-Thinking 2026.04		96.8
R1-Distill-SRE-Qwen-32B-INT8 2026.04		96.6
aiops-qwen-4b 2026.04		95.8
R1-Distill-SRE-Qwen-7B 2026.04		38.6