Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Modeling on ProofPile (16K)
Loading...
3.24
Perplexity
SHAREDLLM
3.202
3.4585
3.715
3.9715
Mar 5, 2026
Perplexity
Updated 1mo ago
Evaluation Results
Method
Method
Links
Perplexity
SHAREDLLM
Base Model=LLaMA-2, Su...
2026.03
3.24
Activation Beacon
Base Model=LLaMA-2, Su...
2026.03
3.34
LongAlpaca-16K
Base Model=Mistral-7B,...
2026.03
3.34
LongAlpaca-16K
Base Model=LLaMA-2, Su...
2026.03
3.37
SHAREDLLM
Base Model=Mistral-7B,...
2026.03
3.38
StreamingLLM
Base Model=LLaMA-2, Su...
2026.03
3.51
Activation Beacon
Base Model=Mistral-7B,...
2026.03
3.64
StreamingLLM
Base Model=Mistral-7B,...
2026.03
4.19
Feedback
Search any
task
Search any
task