Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Evaluation Benchmarks Zero-shot
Loading...
71.76
Average Accuracy
Dense
49.9928
55.6439
61.295
66.9461
May 23, 2025
Average Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Average Accuracy
Dense
Model=LLaMA2-13B, PR=0...
2025.05
71.76
TRSP
Model=LLaMA2-13B, PR=2...
2025.05
65.11
Dense
Model=OPT-13B, PR=0%,...
2025.05
61.79
TRSP
Model=OPT-13B, PR=25%,...
2025.05
60.84
TRSP
Model=LLaMA2-13B, PR=5...
2025.05
57.57
TRSP
Model=OPT-13B, PR=50%,...
2025.05
50.83
Feedback
Search any
task
Search any
task