Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Modeling on Six-source heterogeneous dataset (test)
Loading...
2.54
Perplexity
Random Ranking
2.5384
2.5492
2.56
2.5708
Sep 19, 2025
Perplexity
Updated 1mo ago
Evaluation Results
Method
Method
Links
Perplexity
Random Ranking
Model=Pythia 6.9B, Dat...
2025.09
2.54
Random Ranking
Model=Pythia 6.9B, Dat...
2025.09
2.55
Random Ranking
Model=Pythia 6.9B, Dat...
2025.09
2.55
Dropout
Model=Pythia 6.9B, Dat...
2025.09
2.55
Random Ranking
Model=Pythia 6.9B, Dat...
2025.09
2.56
Dropout
Model=Pythia 6.9B, Dat...
2025.09
2.56
Dropout
Model=Pythia 6.9B, Dat...
2025.09
2.57
Dropout
Model=Pythia 6.9B, Dat...
2025.09
2.58
Feedback
Search any
task
Search any
task