| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| PIQA | Accuracy94.9 | 696 | 1d ago | ||
| PIQA (val) | Mistral-v0.1 | Accuracy83 | 118 | 14d ago | |
| PIQA | Thoughts-as-Planning | Accuracy (PIQA)81.5 | 99 | 5d ago | |
| PIQA | Fanar-27B | Accuracy85.91 | 78 | 1mo ago | |
| PIQA (test) | UL20B | Accuracy90.7 | 59 | 16d ago | |
| PIQA | HPTQ | Accuracy7,497 | 56 | 3mo ago | |
| PIQA | Accuracy82.54 | 45 | 19d ago | ||
| PiQA | CoA-LoRA | Accuracy76.56 | 45 | 1mo ago | |
| PIQA | AutoPrompt | Delta Accuracy0 | 24 | 3mo ago | |
| PIQA | LLAMA-4-SCOUT | PIQA Score81.1 | 16 | 5d ago | |
| PIQA | LinFTPL | Mean Per-Step Regret0.152 | 15 | 3mo ago | |
| PIQA | MobileLLM-Flash 1.4B | Character-level Accuracy75.52 | 11 | 2mo ago | |
| PIQA | Qwen3-32B (FP16) | Accuracy93.74 | 10 | 21h ago | |
| PIQA | Accuracy (PIQA)79.8 | 10 | 2d ago | ||
| GlobalPIQA Estonian | Llama-EstLLM-8B-Instruct-CV | Accuracy68 | 10 | 3mo ago | |
| PIQA | Phi-4 14B (w/ LoopUS) | Accuracy81.8 | 8 | 21d ago | |
| PIQA | Accuracy84 | 8 | 1mo ago | ||
| PIQA (val test) | LLaDA (Base) | Accuracy79.42 | 8 | 12d ago | |
| PIQA | OPT-IML 175B | Accuracy (0-shot)79.8 | 6 | 3mo ago | |
| PIQA | LSP | Score86.4 | 4 | 2mo ago | |
| PIQA 5-shot | MoE 128e8a1s | Accuracy77.8 | 3 | 9d ago | |
| PIQA | TBDF | Average Relative Improvement1.89 | 3 | 3mo ago | |
| PIQA | PIQA Score0.806 | 2 | 19d ago | ||
| PIQA | Flexora | Time (h)3.87 | 2 | 3mo ago |