Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Conversational AI Evaluation on Chatbot Arena

1Rank

GPT-4

0.245.3710.515.63Jan 29, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.01
1-72.851.594.1
2026.01
1-110.697.8123.4
2026.01
2-72.551.393.8
2026.01
2-108.893.6123.9
2026.01
3-94.282106.4
2026.01
3-70.148.891.5
2026.01
4-43.129.656.6
2026.01
4-61.650.772.4
2026.01
5-21.36.636
2026.01
5-24.6346.2
2026.01
6-16.5330
2026.01
6-7.8-1.717.3
2026.01
7-15.98.623.1
2026.01
7-2-10.714.6
2026.01
8--1-21.619.6
2026.01
8-13.14.721.6
2026.01
9-7.2-115.4
2026.01
9--2.9-12.66.8
2026.01
10--4.5-17.98.8
2026.01
10--3.8-9.82.3
2026.01
11--7.8-19.94.3
2026.01
11--11.3-23.20.6
2026.01
12--12.7-33.48.1
2026.01
12--13-21.8-4.2
2026.01
13--23-31.9-14
2026.01
13--16.5-27.2-5.8
2026.01
14--24.6-34.4-14.9
2026.01
14--24.2-36.4-11.9
2026.01
15--29.4-43.1-15.7
2026.01
15--25.9-35.3-16.5
2026.01
16--33.5-45.4-21.6
2026.01
16--42.1-54.8-29.5
2026.01
17--42.9-53.2-32.5
2026.01
17--42.5-56.5-28.5
2026.01
18--48.6-64.4-32.8
2026.01
18--58.1-72.1-44
2026.01
19--51.6-68.1-35
2026.01
19--81.6-98.2-65
2026.01
20--82.2-96.9-67.6
2026.01
20--58.4-77.4-39.5
2023.10
-1,047---
2023.10
-1,031---
2023.10
-1,012---
2023.10
-1,041---
2023.10
-985---
2023.10
-997---
2023.10
-914---
-1,243---
-1,117---
2024.03
-1,077---
2024.03
-1,110---