Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MemoryBench Task on MemoryBench Short-Input-Long-Output 1.0

77.09Norm-Score

UNO

20.25435.009549.76564.5205Feb 6, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
77.097.16
2026.02
76.362.99
2026.02
74.62-8.96
2026.02
74.54-8.83
2026.02
74.43-9.52
2026.02
73.22-17.04
2026.02
72.29-23.07
2026.02
71.95-25.55
2026.02
70.66-36.69
2026.02
70.39-34.68
2026.02
70.36-35.02
2026.02
69.95-37.85
2026.02
69.61-43.08
2026.02
69.33-44.72
2026.02
68.81-48.25
2026.02
68.25-51.93
2026.02
67.96-53.78
2026.02
67.27-57.95
2026.02
67.17-58.71
2026.02
66.97-60.05
2026.02
66.67-63.21
2026.02
66.66-61.81
2026.02
66.27-64.57
2026.02
22.44-350.94