Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Instruction Following on IFEval strict instance
Loading...
74.12
Accuracy
T2M
22.848
36.159
49.47
62.781
Feb 13, 2026
Feb 21, 2026
Mar 2, 2026
Mar 11, 2026
Mar 19, 2026
Mar 28, 2026
Apr 6, 2026
Accuracy
Updated 7d ago
Evaluation Results
Method
Method
Links
Accuracy
T2M
Backbone=LLaDA2.1-mini...
2026.04
74.12
Original (T2T)
Backbone=LLaDA2.1-mini...
2026.04
73.01
Memory Transformer (Combined objective)
Objective=Combined, dm...
2026.02
25.06
Memory Transformer (Causal objective)
Objective=Causal, dm=5...
2026.02
24.82
Feedback
Search any
task
Search any
task