Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Research

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-Agent System PerformanceResearch
TS Score77.62
16
Financial ReasoningResearch
Accuracy54.2
5
Showing 2 of 2 rows