Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Modeling on NLP Benchmark Suite Aggregate
Loading...
-9.2
Average Delta
LoFIT
-9.532
-7.291
-5.05
-2.809
Feb 28, 2026
Average Delta
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Delta
LoFIT
Model=Llama-3.1-8B, Pa...
2026.02
-9.2
JoLA
Model=Llama-3.1-8B, Pa...
2026.02
-8.9
LoFIT
Model=Llama-3.2-1B, Pa...
2026.02
-7.3
JoLA
Model=Llama-3.2-1B, Pa...
2026.02
-5.9
JoLA
Model=gemma-3-1b, Para...
2026.02
-5.5
Ours (1-vec)
Model=gemma-3-1b, Para...
2026.02
-4.7
LoFIT
Model=gemma-3-1b, Para...
2026.02
-4.5
JoLA
Model=Qwen 3 4B, Param...
2026.02
-3.5
Ours (1-vec)
Model=Qwen 3 4B, Param...
2026.02
-3.4
Ours r=1
Model=Llama-3.2-1B, Pa...
2026.02
-2.8
Ours (1-vec)
Model=Llama-3.2-1B, Pa...
2026.02
-2.6
Ours r=1
Model=Qwen 3 4B, Param...
2026.02
-2.4
LoFIT
Model=Qwen 3 4B, Param...
2026.02
-2.3
Ours (1-vec)
Model=Llama-3.1-8B, Pa...
2026.02
-1.7
Ours r=1
Model=gemma-3-1b, Para...
2026.02
-1.5
Ours r=1
Model=Llama-3.1-8B, Pa...
2026.02
-0.9
Feedback
Search any
task
Search any
task