Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Commonsense Reasoning on PIQA (Accuracy, Normalized Accuracy)
Loading...
74
Accuracy
Nemotron-2 Approx.
71.816
72.383
72.95
73.517
May 15, 2026
Accuracy
Normalized Accuracy
Updated 16d ago
Evaluation Results
Method
Method
Links
Accuracy
Normalized Accuracy
Nemotron-2 Approx.
Shot=0-shot
2026.05
74
73.9
AIRAhybrid-A
Shot=0-shot, Architect...
2026.05
73.9
73.4
AIRAhybrid-D
Shot=0-shot, Architect...
2026.05
73.6
74.7
AIRAhybrid-B
Shot=0-shot, Architect...
2026.05
73.4
74
Nemotron-H Approx.
Shot=0-shot
2026.05
73.3
73.7
AIRAhybrid-B
Shot=0-shot, Architect...
2026.05
73.1
73.4
Mamba (Mb + M)
Shot=0-shot
2026.05
72.9
73.1
AIRAhybrid-C
Shot=0-shot, Architect...
2026.05
72.8
73.6
AIRAhybrid-D
Shot=0-shot, Architect...
2026.05
72.8
73.7
AIRAhybrid-E
Shot=0-shot, Architect...
2026.05
72.3
72.9
AIRAhybrid-E
Shot=0-shot, Architect...
2026.05
72.1
73.3
Composer (2Mb-M-3A)
Shot=0-shot
2026.05
71.9
73.9
Feedback
Search any
task
Search any
task