| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Large Language Model Evaluation | Open PL LLM Leaderboard instruction-tuned | Overall Average Score69.84 | 44 | |
| Linguistic Implicatures Decoding | Open PL LLM Leaderboard Implicatures component base models | Average Score67.38 | 30 |