Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General AI Assistants Evaluation on GAIA n=50

0.5944Accuracy

GPT-5 search

0.1111120.2365810.362050.487519Dec 7, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
0.5944-0.1830.745
2025.12
0.3971-0.1840.756
2025.12
0.3226-0.1640.738
2025.12
0.3128-0.180.772
2025.12
0.2228-0.1230.699
2025.12
0.2159-0.1150.671
2025.12
0.1297-0.0850.736