Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-turn conversation on Long-MT-Bench+

7.36Accuracy

Rhea

1.25522.84014.4256.0099Dec 7, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
7.3629.08
2025.12
6.6510.81
2025.12
6.3227.29
2025.12
6.07-
2025.12
5.0313.89
4.5531.79
2025.12
2.4323.73
2025.12
1.8811.87
2025.12
1.529.73
2025.12
1.4933.55