Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Fine-Tuning Data Selection on DATE-LM MMLU, GSM8K, BBH (test)
Loading...
61.87
MMLU Accuracy
BipCov (ref-aligned)
59.4884
60.1067
60.725
61.3433
Feb 6, 2025
MMLU Accuracy
GSM8K Accuracy
BBH Accuracy
Average Score
Updated 2d ago
Evaluation Results
Method
Method
Links
MMLU Accuracy
GSM8K Accuracy
BBH Accuracy
Average Score
BipCov (ref-aligned)
pipeline=DATE-LM, seed...
2025.02
61.87
63.53
66.27
63.89
Random
pipeline=DATE-LM, seed...
2025.02
60
61.89
66.58
62.82
RDS+
pipeline=DATE-LM, seed...
2025.02
59.58
62.24
66.8
62.88
Feedback
Search any
task
Search any
task