Share your thoughts, 1 month free Claude Pro on usSee more

Question Answering on SFT data Easy (seen)

99.6Accuracy

OpsLLM-32B

Updated 2mo ago

Evaluation Results

Method	Links
OpsLLM-32B 2026.04		99.6
OpsLLM-14B 2026.04		99.2
GPT-5.2 2026.04		98.8
Moonshot-Kimi-K2-Instruct 2026.04		98.6
Zhiyu-32B 2026.04		98.6
Qwen2.5-32B-Instruct 2026.04		98
OpsLLM-7B 2026.04		98
Qwen2.5-14B-Instruct 2026.04		97.6
Qwen2.5-7B-Instruct 2026.04		96.8
Deepseek-v3.2-exp 2026.04		96.6
Qwen3-Max-2025-09-23 2026.04		96.6
Qwen-Turbo-2025-07-15 2026.04		96.4
Qwen3-Next-80b-a3b-Thinking 2026.04		96.2
Qwen-Plus-2025-09-11 2026.04		96
R1-Distill-SRE-Qwen-32B-INT8 2026.04		95.4
aiops-qwen-4b 2026.04		95
R1-Distill-SRE-Qwen-7B 2026.04		40