Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Knowledge Evaluation on MMLU Computer Security
Loading...
46
Accuracy
NPO+KL w/ RNA
28.32
32.91
37.5
42.09
Jan 31, 2025
Accuracy
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
NPO+KL w/ RNA
Model Backbone=Llama-3...
2025.01
46
RMU w/ RNA
Model Backbone=Mistral...
2025.01
46
NPO+KL
Model Backbone=Mistral...
2025.01
35
RMU w/ RNA
Model Backbone=Llama-3...
2025.01
33
RMU
Model Backbone=Mistral...
2025.01
33
NPO+KL
Model Backbone=Llama-3...
2025.01
30
NPO+KL w/ RNA
Model Backbone=Mistral...
2025.01
30
RMU
Model Backbone=Llama-3...
2025.01
29
Feedback
Search any
task
Search any
task