| LexEval | | Memoization74.9 | | 35 | 1mo ago |
| CaseHOLD (test) | AutoAdapt | Test Accuracy89.22 | | 22 | 2mo ago |
| LegalBench | evaluation-instructed prompt optimization | Accuracy90 | | 18 | 1d ago |
| CaseHold | XPERT-DeepSeek | Accuracy (CaseHold)83.13 | | 16 | 22d ago |
| legalbench | Ensemble | MAE0.033 | | 16 | 23d ago |
| LegalBench Hearsay | SciDC (Qwen3-14B) | Accuracy86.46 | | 16 | 1mo ago |
| LegalBench | Llama3.1-70B | Balanced Accuracy79.3 | | 16 | 2d ago |
| LegalArg | Qwen3-4B-Instruct | Accuracy65.42 | | 14 | 2mo ago |
| Law | ProbMoE | LLM-as-judge Score34.4 | | 13 | 1d ago |
| Law | SeqTopK | Score26.52 | | 13 | 2mo ago |
| BarExam MBE (test) | | Accuracy82.1 | | 12 | 2mo ago |
| LexEval | Qwen2.5-7B + PPO w/ VERL. | Pass@157.1 | | 10 | 1mo ago |
| LEXam | Qwen2.5-7B + PPO w/ VERL. | Pass@1 Accuracy23.4 | | 10 | 1mo ago |
| LegalBench Learned Hands Courts | MAD | Accuracy75.5 | | 10 | 3mo ago |
| LawBench | MCE | Micro-F170 | | 10 | 3mo ago |
| LegalBench | ReConcile | Exact Match69 | | 8 | 8d ago |
| CaseHold | AutoAdapt | Cumulative Score (CS)96 | | 8 | 2mo ago |
| LegalBench CUAD Cardlytics Buffalo Wild Wings PF Hospitality 2023 | Agentic Adversarial QA | Accuracy (Cardl)82.7 | | 6 | 3mo ago |
| Real-World Trust | LegalDrill 1.7B | Accuracy90 | | 5 | 1mo ago |
| Real-World POA | LegalDrill 1.7B | Accuracy92 | | 5 | 1mo ago |
| Law (test) | SeqTopK | Score45.29 | | 5 | 2mo ago |
| LawBench | SIA-W+H | Top-1 Accuracy70.1 | | 4 | 7d ago |
| LSAT (test) | RADAR | Hypervolume0.9188 | | 4 | 2mo ago |