| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Helpfulness Evaluation | Manual Evaluation Set | Average Helpfulness Score4.57 | 24 | |
| Safety Evaluation | Manual Evaluation Set | Average Safety Score3.83 | 12 | |
| Actionable Suggestion Extraction | Manual evaluation set 1.0 (test) | BERTScore92 | 4 |