| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LongLaMP | RankGPT (Llama-3-8B-Instruct) | R130.8 | 16 | 4d ago | |
| USC SF | GPT4 | Average Aggregate Score0.337 | 13 | 4d ago | |
| Human Baseline | GPT4 | Avg Aggregate Score0.612 | 8 | 4d ago | |
| Multi-News | GPT4 | Average Aggregate Score0.524 | 8 | 4d ago |