Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Question Answering on Supply Chain Question Answering 1.0 (test)

0.8391Accuracy

GPT-5-mini

0.179220.3505350.521850.693165Feb 7, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
0.8391
2026.02
0.8391
2026.02
0.8276
2026.02
0.8207
2026.02
0.8069
2026.02
0.8069
2026.02
0.8069
2026.02
0.8068
2026.02
0.8046
2026.02
0.8045
2026.02
0.8023
2026.02
0.7977
2026.02
0.7816
2026.02
0.7816
2026.02
0.7793
2026.02
0.777
2026.02
0.7724
2026.02
0.7724
2026.02
0.6897
2026.02
0.6805
2026.02
0.6781
2026.02
0.6775
2026.02
0.669
2026.02
0.667
2026.02
0.6667
2026.02
0.6643
2026.02
0.6621
2026.02
0.6506
2026.02
0.6483
2026.02
0.6368
2026.02
0.623
2026.02
0.6207
2026.02
0.6161
2026.02
0.6
2026.02
0.5977
2026.02
0.5655
2026.02
0.4748
2026.02
0.2046