| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Bitcoin Simulation | Verification Time (ms)22.9609 | 54 | 4d ago | ||
| SCI-Bench | CoSineVerifier-Tool-4B | Mean@3 Accuracy89.7 | 18 | 4d ago | |
| VerifyBench Hard 1.0 (test) | CoSineVerifier-Tool-4B | Mean@3 Accuracy91.9 | 18 | 4d ago | |
| VerifyBench 1.0 (test) | CoSineVerifier-Tool-4B | m@3 Accuracy96.6 | 18 | 4d ago | |
| Simulation Distribution Shift (test) | AUC88.3 | 18 | 4d ago | ||
| Simulation In Distribution (test) | VarAUC | AUC95 | 18 | 4d ago | |
| GSM8K | CRV | AUROC70.17 | 11 | 4d ago | |
| Synthetic Arithmetic | CRV | AUROC92.47 | 11 | 4d ago | |
| Synthetic Boolean | CRV | AUROC75.87 | 11 | 4d ago | |
| Ethereum Simulation | Verification Time (ms)2.9024 | 10 | 4d ago | ||
| Voice-Face F-V Gender-restricted | Ours | AUC76.1 | 8 | 4d ago | |
| Voice-Face Gender-restricted | Ours | AUC0.775 | 8 | 4d ago | |
| Voice-Face F-V, Unrestricted | Ours | AUC87 | 8 | 4d ago | |
| Voice-Face Unrestricted | Ours | AUC87.2 | 8 | 4d ago | |
| Weather | MLPLir | Time (s)0.18 | 4 | 4d ago | |
| Prosthetic | MLPLir | Execution Time (s)0.19 | 4 | 4d ago | |
| PINNHeat | MLPLir | Time (s)0.43 | 4 | 4d ago | |
| F-exp100 | Time (s)3.16 | 4 | 4d ago | ||
| F-exp4 | MLPLir | Latency (s)1.16 | 4 | 4d ago | |
| F-exp | Execution Time (s)0.22 | 4 | 4d ago | ||
| F-xy | Latency (s)0.02 | 4 | 4d ago | ||
| F-Bessel | Latency (s)0.24 | 4 | 4d ago |