Share your thoughts, 1 month free Claude Pro on usSee more

Resume quality judgment on Resume screening dataset ground truth GPT-5.1

86.82Accuracy

Qwen3-8B (AutoScreen-FW)

Updated 4mo ago

Evaluation Results

Method	Links
Qwen3-8B (AutoScreen-FW) 2026.03		86.82
Qwen3-8B 2026.03		85.52
Llama-3.1-8B (AutoScreen-FW) 2026.03		84.85
GPT-5-mini 2026.03		84.45
GPT-5-nano 2026.03		83.74
Llama-3.1-8B 2026.03		79.27