Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Structured Reasoning on LoCoMo
Loading...
61.8
F1 Score
MLMF
58.16
59.105
60.05
60.995
Mar 31, 2026
F1 Score
Multi-hop F1 Score
Updated 18d ago
Evaluation Results
Method
Method
Links
F1 Score
Multi-hop F1 Score
MLMF
Ref=MLMF
2026.03
61.8
59.4
[18]
Ref=[18]
2026.03
58.3
-
Feedback
Search any
task
Search any
task