Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Language Modeling on BIG-Bench (test)
Loading...
83.6
Accuracy
Best Model
76.112
78.056
80
81.944
Oct 26, 2025
Accuracy
Number of Layers Dropped
Relative Inference Speed
Updated 22d ago
Evaluation Results
Method
Method
Links
Accuracy
Number of Layers Dropped
Relative Inference Speed
Best Model
Backbone LLM=LLaMA 3.1...
2025.10
83.6
5
-14.4
Best Model
Backbone LLM=Mistral 7...
2025.10
76.4
9
-28
Feedback
Search any
task
Search any
task