Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context reasoning on BABILong

66.5Accuracy

FoLoRA

64.461664.990865.5266.0492May 28, 2026
Updated 23h ago

Evaluation Results

MethodLinks
2026.05
66.5-------
2026.05
66.34-------
2026.05
66.17-------
2026.05
66.09-------
2026.05
66.07-------
2026.05
66.06-------
2026.05
65.98-------
2026.05
65.74-------
2026.05
65.57-------
2026.05
65.52-------
2026.05
65.23-------
2026.05
64.54-------
2025.12
-17.716.19.19.45.97.811
2025.12
-14.115.612.29.98.39.711.6
2025.12
-19.819.816.115.812.312.816.1
2025.12
-33.530.723.62215.112.122.8
2025.12
-31.926.518.616.21112.219.4
2025.12
-32.429.924.424.518.614.824.1