Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Failure attribution on MemTraceBench Long-Context
Loading...
7.5
ETA
MemTrace-OBS
6.9668
10.5659
14.165
17.7641
May 27, 2026
ETA
OIA
Updated 7d ago
Evaluation Results
Method
Method
Links
ETA
OIA
MemTrace-OBS
Backbone=GPT-5.4
2026.05
7.5
7.5
MemTrace-OBS
Backbone=GPT-4.1 mini
2026.05
9.17
3.33
MemTrace
Backbone=GPT-5.4
2026.05
20
20
MemTrace
Backbone=GPT-4.1 mini
2026.05
20.83
4.17
Feedback
Search any
task
Search any
task