Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Question Answering on LoCoMo (LLJ Metrics)

97.1Single-Hop LLJ Score

Mnemis

27.83645.81863.881.782Jan 6, 2026Jan 26, 2026Feb 16, 2026Mar 9, 2026Mar 30, 2026Apr 20, 2026May 11, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.02
97.190.779.292.993.9--
2026.02
96.189.770.891.192.3--
2026.02
94.992.577.188.392.1--
2026.05
9489.185.880.590.6--
2026.05
93.887.879.379.888.8--
2026.05
93.592.190.883.192--
2026.05
93.291.986.884.791.1--
2026.02
90.580.871.779.685.3--
2026.05
87.275.76674.781.2--
86.974.256.677.280.6--
2026.02
85.188.465.683.785.4--
2026.02
84.977.65175.179.5--
2026.02
84.550.8597173.4--
2026.05
83.169.251.172.576.3--
2026.01
81.572.176.884.280.7--
2026.05
81.173.145.470.675.3--
2026.05
8046.153.165.968.7--
2026.01
79.970.872.979.877.6--
2026.05
79.151.542.862.968.1--
2026.05
78.765.640.159.670.1--
2026.01
78.372.754.663.767.7--
2026.05
78.271.751.770.273.7--
2026.02
76.576.667.764.973.8--
2026.05
72.77163.459.369.3--
2026.02
71.456.947.968.266.3--
2026.05
69.857.25954.363.7--
2026.01
67.158.175.751.257.1--
2026.02
66.960.243.853.761.6--
2026.05
66.951.442.862.961.4--
2026.02
66.274.846.96165.8--
2026.05
64.445.336.555.257--
2026.05
62.355.438.349.557--
2026.01
62.223.471.147.946.9--
2026.01
61.749.376.641.449--
2026.05
59.254.533.645.754.1--
2026.05
56.520.842.34746.4--
2026.05
49.242.343.846.146.6--
2026.05
48.751.253.150.349--
2026.05
47.840.146.944.545--
2026.01
45.551.560.128.238.2--
2026.01
41.250.455.819.532.2--
2026.01
39.849.954.118.931.4--
2026.01
38.548.25317.830.1--
2026.01
37.141.550.216.527.6--
2026.01
30.535.845.314.223.6--
2026.01
-----25.7825.15
2026.01
-----30.3736.85
2026.01
-----42.5244.14
2026.01
-----45.3643.74
2026.01
-----45.0247.41
2026.01
-----54.454.68