Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Resume quality judgment on Resume screening dataset ground truth GPT-5.1
Loading...
86.82
Accuracy
Qwen3-8B (AutoScreen-FW)
78.968
81.0065
83.045
85.0835
Mar 19, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-8B (AutoScreen-FW)
Few-shot=true, Samplin...
2026.03
86.82
Qwen3-8B
Few-shot=false, Sampli...
2026.03
85.52
Llama-3.1-8B (AutoScreen-FW)
Few-shot=true, Samplin...
2026.03
84.85
GPT-5-mini
Few-shot=false, Sampli...
2026.03
84.45
GPT-5-nano
Few-shot=false, Sampli...
2026.03
83.74
Llama-3.1-8B
Few-shot=false, Sampli...
2026.03
79.27
Feedback
Search any
task
Search any
task