Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Physical Commonsense Reasoning on PIQA (Character-level Accuracy)
Loading...
75.52
Character-level Accuracy
MobileLLM-Flash 1.4B
68.0528
69.9914
71.93
73.8686
Mar 16, 2026
Character-level Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Character-level Accuracy
MobileLLM-Flash 1.4B
Parameter Count=1.4B,...
2026.03
75.52
Nemotron-Flash 1B
Parameter Count=1B, Ev...
2026.03
75.41
Llama3.2 1B
Parameter Count=1B, Ev...
2026.03
75.14
LFM2 1.2B
Parameter Count=1.2B,...
2026.03
74.27
Gemma3 1B
Parameter Count=1B, Ev...
2026.03
73.8
MobileLLM-Flash 650M
Parameter Count=650M,...
2026.03
71.82
LFM2 700M
Parameter Count=700M,...
2026.03
71.11
MobileLLM-Flash 350M
Parameter Count=350M,...
2026.03
70.08
Qwen3 0.6B
Parameter Count=0.6B,...
2026.03
69.86
LFM2 350M
Parameter Count=350M,...
2026.03
69.48
Gemma3 270M
Parameter Count=270M,...
2026.03
68.34
Feedback
Search any
task
Search any
task