Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MemoryBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
MemoryBench TaskMemoryBench Long-Input-Long-Output
Norm-Score64.23
24
MemoryBench TaskMemoryBench Short-Input-Short-Output
Norm-Score76.26
24
MemoryBench TaskMemoryBench Short-Input-Long-Output 1.0
Norm-Score77.09
24
MemoryBench TaskMemoryBench Long-Input-Short-Output
Norm-Score52.6
22
Robot ManipulationMemoryBench (test)
Success Rate100
7
3D Robotic ManipulationMemoryBench
Avg Success Rate94.3
3
Showing 6 of 6 rows