Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Gemma

Benchmarks

Task NameDataset NameSOTA ResultTrend
Circuit Discovery EvaluationGemma-2-2B
Clarity82
70
Automated Interpretability EvaluationGemma-2-2B
Clarity80
50
Jailbreak AttackGemma 4B 3
NR66
20
Jailbreak attackGemma-7b five finetuned variants
Average ASR66.2
16
Jailbreak Attackgemma-7b v1 (pretrained)
ASR6
13
LLM AlignmentGemma-3-4B
Win Rate94.33
12
LLM fingerprintingGemma 2 2B
AUC1
10
Language ModelingGemma 3
Accuracy47.06
10
Jailbreak AttackGemma-3 27B-it
ASR92
9
Neuron DescriptionGemma 2
Faithfulness47
8
Output-based feature description evaluationGemma-2 MLP SAE features
Score49.9
8
Output-based feature description evaluationGemma-2 Residual SAE features
Score66.9
8
DebiasingGemma-3-4b-it (test)
Mean Log-Likelihood Difference5.07
6
Multi-path Speculative DecodingGemma (test)
Throughput (tokens/s)13.26
6
Chat Fine-tuningGemma 1B Chat
vNMSE0.0012
6
LLM Attack EffectivenessGemma3 12B-it
TTFT (s)0.13
6
Multi-path speculative decodingGemma held-out (test)
Throughput Ratio Improvement2.17
5
Adversarial AttackGemma 4B-it 3
ASR25
5
Opaque Serial Depth CalculationGemma 3
Final Depth Formula11,322
4
Explanation EvaluationGemma vision encoder later layer SAE 3 (test)
IoU (Masks)20.4
3
Model Lineage AttestationGemma family
TPR98
1
Showing 21 of 21 rows