Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Data Analysis on T2 1.0 (test)
Loading...
92.5
Task Completion Rate
LangChain
67.54
74.02
80.5
86.98
Jan 19, 2026
Task Completion Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Task Completion Rate
LangChain
2026.01
92.5
AgentForge
2026.01
91.2
AutoGPT
2026.01
71.2
Direct API
2026.01
68.5
Feedback
Search any
task
Search any
task