Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Response Selection on AlignX
Loading...
75.03
Accuracy
ALIGNXPLORE+
53.19
58.86
64.53
70.2
Jan 8, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
ALIGNXPLORE+
Inference Setting=Full...
2026.01
75.03
ALIGNXPLORE+
Inference Setting=Stre...
2026.01
73.67
ALIGNXPLORE
Inference Setting=Stre...
2026.01
69.9
ALIGNXPLORE
Inference Setting=Full...
2026.01
66.6
TALLRec
Inference Setting=Dire...
2026.01
66.3
DeepSeek-R1-671B
Inference Setting=Full...
2026.01
65.9
Qwen3-32Bthinking
Inference Setting=Full...
2026.01
64.93
Qwen3-32Bthinking
Inference Setting=Stre...
2026.01
64.6
DeepSeek-R1-671B
Inference Setting=Stre...
2026.01
64.06
Qwen3-8Bthinking
Inference Setting=Stre...
2026.01
62.9
Qwen3-8Bthinking
Inference Setting=Full...
2026.01
62.73
Qwen3-8Bnon-thinking
Inference Setting=Dire...
2026.01
59.63
GPT-OSS-20B
Inference Setting=Stre...
2026.01
56.86
DS-R1-Distill-Qwen-7B
Inference Setting=Stre...
2026.01
56.4
GPT-OSS-20B
Inference Setting=Full...
2026.01
55.63
DS-R1-Distill-Qwen-7B
Inference Setting=Full...
2026.01
54.03
Feedback
Search any
task
Search any
task