Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-agent Task Fulfillment on AWS Benchmark Travel scenario
Loading...
78.79
User GSR
SEAgent
72.4876
74.1238
75.76
77.3962
Jan 17, 2026
User GSR
System GSR
Overall GSR
User Queries
Execution Time (s)
Token Usage (Count)
Updated 4d ago
Evaluation Results
Method
Method
Links
User GSR
System GSR
Overall GSR
User Queries
Execution Time (s)
Token Usage (Count)
SEAgent
Memory=SEMemory
2026.01
78.79
69.7
74.24
3.13
44.38
3,180.94
P2PEnv
Communication=Point-to...
2026.01
72.73
72.31
71.97
2.67
45.85
4,710.83
Feedback
Search any
task
Search any
task