| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LawBench revised (test) | JurisMA | ROUGE-144.68 | 17 | 1mo ago | |
| JEC-QA | Kimi-K2-thinking | KD Score69.49 | 15 | 3mo ago | |
| AIBE | Legal Assist AI | AIBE Score60.08 | 10 | 1mo ago | |
| Japanese Bar Examination 2024 (Reiwa 6) | Ours | Overall Accuracy49.35 | 9 | 3mo ago | |
| DISC-Law-SFT (test) | LLM-AutoDP | Win Rate90.07 | 6 | 3mo ago | |
| Competition Law Question Bank Case-specific 1.0 (test) | Maat | Average Expert Rating4.6 | 5 | 7d ago | |
| Competition Law Question Bank Theoretical 1.0 (test) | Average Expert Rating3.5 | 5 | 7d ago | ||
| Legal Task QA (test) | LEGALMIDM-11B | ROUGE-L17.74 | 5 | 1mo ago | |
| Bar Exam QA | L-MARS | Accuracy55.9 | 5 | 2mo ago | |
| LegalSearchQA (50 questions) | L-MARS (Simple) | Accuracy96 | 3 | 2mo ago | |
| Curated Indian Legal Knowledge Base | NyayaAI | Response Accuracy72 | 1 | 22d ago |