Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-context Instruction Following on LIFBench
Loading...
64.4
List Score
DIRECTER
54.832
57.316
59.8
62.284
Mar 6, 2026
List Score
OD Score
MD Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
List Score
OD Score
MD Score
DIRECTER
Method Group=Ours
2026.03
64.4
70
51.7
*-marked
Method Group=Prompting...
2026.03
64.3
66.9
44.9
Zero-shot
Method Group=Baseline,...
2026.03
63.4
68.6
40.9
"-marked
Method Group=Prompting...
2026.03
63.4
69.9
41
PASTA*
Method Group=Steering,...
2026.03
61.8
66
47.8
SpotLight*
Method Group=Steering,...
2026.03
61.4
70.8
38.8
PASTA
Method Group=Steering,...
2026.03
61.1
62.8
22.5
Few-shot
Method Group=Prompting...
2026.03
55.5
57.7
42.2
SpotLight
Method Group=Steering,...
2026.03
55.2
56.3
36.8
Feedback
Search any
task
Search any
task