Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Chatbot Evaluation on Vicuna benchmark

13,481Elo Rating

GPT-4

8,603.49,869.711,13612,402.3May 23, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.05
13,481
2023.05
10,221
2023.05
9,921
2023.05
9,741
2023.05
9,661
2023.05
9,161
2023.05
9,021
2023.05
8,791