Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General AI Assistant Tasks on GAIA Level 1 (val)

62.3Accuracy

GPT-5

21.11631.80842.553.192Dec 7, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
62.3-0.1850.774
2025.12
22.7-0.10.561