Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General AI assistant tasks on GAIA 68 curated tasks
Loading...
67.6
Accuracy
GraphBit
34.008
42.729
51.45
60.171
Mar 8, 2026
Accuracy
Hallucination Rate
Updated 19d ago
Evaluation Results
Method
Method
Links
Accuracy
Hallucination Rate
GraphBit
LLM=GPT-5.2, Proc. (ms...
2026.03
67.6
0
Pydantic AI
LLM=GPT-5.2, Proc. (ms...
2026.03
52.9
0
LlamaIndex
LLM=GPT-5.2, Proc. (ms...
2026.03
50
0
CrewAI
LLM=GPT-5.2, Proc. (ms...
2026.03
44.9
14.3
LangChain
LLM=GPT-5.2, Proc. (ms...
2026.03
38.2
41.2
LangGraph
LLM=GPT-5.2, Proc. (ms...
2026.03
36.8
47.1
AutoGen
LLM=GPT-5.2, Proc. (ms...
2026.03
35.3
33.8
Feedback
Search any
task
Search any
task