Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Reasoning on LoCoMo

95.06Average F1

FluxMem

1.574425.844750.11574.3853Jan 25, 2026Feb 14, 2026Mar 6, 2026Mar 27, 2026Apr 16, 2026May 6, 2026May 27, 2026
Updated 6d ago

Evaluation Results

MethodLinks
2026.05
95.0693.26-95.64-90.62-95.95------------
2026.05
93.4491.49-89.72-87.5-96.19------------
2026.05
93.0591.84-89.72-76.04-96.67------------
2026.05
85.3883.7-88.39-65.62-85.11------------
2026.05
82.476.95-80.37-57.29-87.87------------
2026.05
81.2380.5-71.03-58.33-87.99------------
2026.05
81.177.3-76.32-55.21-87.16------------
2026.05
74.8769.86-51.71-57.29-87.4------------
2026.05
71.4361.35-71.03-50-77.41------------
2026.05
71.3668.09-50.16-52.08-82.76------------
2026.05
70.6566.67-55.45-60.42-78.95------------
2026.05
66.368.2-56.9-47.9-71.4------------
2026.05
61.653.7-60.2-43.8-66.9------------
2026.05
59.8163.12-32.09-48.96-79.51------------
2026.05
56.157.45-27.73-43.75-67.9------------
2026.05
51.0146.46-35.36-20.05-62.05------------
2026.05
50.9245.52-34.82-21.21-62.27------------
2026.05
50.744.35-34.76-17.9-62.66------------
2026.05
49.1242.57-39.31-18.12-58.59------------
2026.05
48.1137.95-34.99-16.75-60.11------------
2026.05
47.235.18-35.05-20.49-58.92------------
2026.05
47.0541-36.48-14.08-56.88------------
2026.05
47.0241.28-30.43-14.2-59.03------------
2026.05
46.5342.14-37.2-13.35-55.36------------
2026.01
44.9433.8825.9155.4147.7728.1421.0946.5739.43--37.55--------
2026.03
44.6537.732.0855.942.3725.8721.744.8440.02--37.92--------
2026.01
44.1834.6925.8243.4234.9224.7717.4349.8641.17--35.58--------
2026.04
43.1435.8827.9651.9642.2628.1224.5156.5851.18--36.48--------
2026.03
41.8435.1826.3251.4937.0625.8221.4642.2136.43--33.78--------
2026.03
41.7636.4528.5342.5735.9128.823.1344.8238.25--35.1--------
2026.05
40.7938.39-32.89-10.77-48.05------------
2026.04
40.3833.3225.3850.2539.3320.418.557.5550.99--33.55--------
2026.04
4035.3225.7748.9739.6821.1417.8954.5848.62--32.99--------
2026.05
39.4129.78-27.15-19.59-49.59------------
2026.05
39.3929.12-27.09-22.57-49.45------------
2026.02
38.6228.2422.7638.3933.6415.4313.8142.0936.5743.7943.1434.51--------
2026.03
38.1635.1926.5534.3829.0720.8815.5742.6636.9--32.04--------
2026.05
37.7426.35-26.05-15.41-48.58------------
2026.03
37.0830.622.539.633.5724.2117.2839.6930.6--30.51--------
2026.04
36.2728.7422.3546.1237.2421.1818.3349.0242.18--30.02--------
2026.05
35.4627.48-25.34-17.04-44.1------------
2026.01
34.7135.7525.6515.7212.2525.2118.8542.3835.83--28.16--------
2026.01
34.0535.1925.8613.029.8624.317.1542.835.96--27.5--------
2026.02
33.6530.823.1329.2524.5614.1111.0342.2535.5226.5925.0228.45--------
2026.03
33.4824.5717.2839.9634.0220.4314.0435.4630.06--27.63--------
2026.02
32.7624.316.934.523.113.112.238.133.33130.127.58--------
2026.02
32.4233.3724.2631.4916.4213.9211.0225.4624.8249.173525.02--------
2026.03
32.426.7317.9120.9717.5521.2416.9239.9635.1--27.09--------
2026.03
32.3130.2421.628.8923.6726.5521.2435.0129.88--26.55--------
2026.02
30.8822.1313.4431.4722.1614.5113.5433.4934.1234.5831.4427.67--------
2026.01
29.9535.6930.6511.3210.1132.6326.4621.7515.88--24.38--------
2026.01
29.43-----------85.90.45------
2026.02
29.2721.2514.5330.221.1111.3310.5332.7526.3330.9530.1323.9--------
2026.02
28.9924.1215.4125.4819.0413.4412.6434.7432.4127.1124.3225.06--------
2026.02
28.6225.0915.7332.8227.1414.4713.3520.1818.3946.7740.8124.22--------
2026.02
28.3721.3614.9823.0618.0412.6211.4935.4330.9226.7125.7824.48--------
2026.01
28.37-----------83.60------
2026.02
28.1625.9718.1625.3718.7613.5211.6934.9230.621.9417.5623.08--------
2026.02
27.823.0419.7429.6523.1620.6313.7530.4625.6226.0222.4823.11--------
2026.03
26.3711.1611.0742.333.7530.6925.3824.8418.54--20.7--------
2026.02
25.8713.179.334.9127.048.87.4526.4424.8329.9828.3422.93--------
2026.01
25.75-----------81.20------
2026.03
25.7410.8910.841.3133.0329.9724.8424.318.09--22.95--------
2026.04
24.4429.418.265.0116.8713.6530.6824.38--18.54--------
2026.04
24.225.112.928.44615.3211.5330.9324.52--17.72--------
2026.05
24.1822.41-16.49-10.87-29.22------------
2026.02
23.8620.9816.2731.521.7312.713.2224.719.1421.0119.8419.02--------
2026.03
22.510.3510.2630.0624.0328.3523.4922.9517.1--17.64--------
2026.03
21.9610.089.9929.3423.4927.7222.9522.4116.74--17.28--------
2026.01
21.1635.1130.168.247.5319.8112.113.710.48--15.28--------
2026.02
20.6313.169.6118.1212.3312.169.2532.8328.355.964.216.75--------
2026.04
19.3421.3212.35.515.2515.112.424.4419.49--14.77--------
2026.04
12.5813.997.833.722.6920.117.7414.6311.67--9.47--------
2026.01
9.46-----------82.90.15------
2026.01
5.17-----------78.90------
2026.02
--------------46.8127.4148.9676.9359.35-
2026.02
--------------45.0424.6141.6773.4856.1-
2026.02
--------------20.576.2334.3832.124.74-
2026.02
--------------17.383.4323.9615.113.64-
2026.02
--------------9.931.8727.0810.349.55-
2026.02
--------------11.351.872510.119.55-
2026.02
--------------12.061.5630.218.328.96-
2026.02
--------------36.8823.9933.3380.3857.73-
2026.02
--------------34.7516.8242.7174.6753.31-
2026.02
--------------30.1417.1336.4672.0650.71-
2026.02
--------------42.5527.7340.6285.4962.79-
2026.02
--------------46.8127.7348.9676.8159.35-
2026.02
--------------48.2338.0146.8885.1466.17-
2026.05
--------------38.3932.8910.77-40.7948.05
2026.05
--------------38.1220.349.99-36.6845.47
2026.05
--------------39.0730.1310.98-40.9849.19
2026.05
--------------37.8821.7613.35-38.1447.31
2026.05
--------------37.9534.9916.75-48.1160.11
2026.05
--------------32.9333.312.67-40.0548.13
2026.05
--------------32.8716.728.81-26.8530.75
2026.05
--------------4136.4814.08-47.0556.88
2026.05
--------------42.1437.213.35-46.5355.36
2026.05
--------------42.5739.3118.12-49.1258.59