Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Conversational AI Evaluation on Chatbot Arena

1Rank

GPT-4

0.245.3710.515.63Jan 29, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.01
1-72.851.594.1----
2026.01
1-110.697.8123.4----
2026.01
2-72.551.393.8----
2026.01
2-108.893.6123.9----
2026.01
3-94.282106.4----
2026.01
3-70.148.891.5----
2026.01
4-43.129.656.6----
2026.01
4-61.650.772.4----
2026.01
5-21.36.636----
2026.01
5-24.6346.2----
2026.01
6-16.5330----
2026.01
6-7.8-1.717.3----
2026.01
7-15.98.623.1----
2026.01
7-2-10.714.6----
2026.01
8--1-21.619.6----
2026.01
8-13.14.721.6----
2026.01
9-7.2-115.4----
2026.01
9--2.9-12.66.8----
2026.01
10--4.5-17.98.8----
2026.01
10--3.8-9.82.3----
2026.01
11--7.8-19.94.3----
2026.01
11--11.3-23.20.6----
2026.01
12--12.7-33.48.1----
2026.01
12--13-21.8-4.2----
2026.01
13--23-31.9-14----
2026.01
13--16.5-27.2-5.8----
2026.01
14--24.6-34.4-14.9----
2026.01
14--24.2-36.4-11.9----
2026.01
15--29.4-43.1-15.7----
2026.01
15--25.9-35.3-16.5----
2026.01
16--33.5-45.4-21.6----
2026.01
16--42.1-54.8-29.5----
2026.01
17--42.9-53.2-32.5----
2026.01
17--42.5-56.5-28.5----
2026.01
18--48.6-64.4-32.8----
2026.01
18--58.1-72.1-44----
2026.01
19--51.6-68.1-35----
2026.01
19--81.6-98.2-65----
2026.01
20--82.2-96.9-67.6----
2026.01
20--58.4-77.4-39.5----
2023.10
-1,047-------
2023.10
-1,031-------
2023.10
-1,012-------
2023.10
-1,041-------
2023.10
-985-------
2023.10
-997-------
2023.10
-914-------
-1,243-------
-1,117-------
2024.03
-1,077-------
2024.03
-1,110-------
2026.04
-1,753.749---69.891.572.70.877
-1,678.951---49.464.364.40.516
2026.04
-1,604.549---4961.264.60.513
-1,541.837---47.860.562.80.461
-1,624.634---47.283.168.20.691
-1,626.565---46.340.961.90.42
-1,626.607---4649.556.40.179
-1,505.213---44.545.856.10.091
2026.04
-1,531.531---39.241.153.70.083
2026.04
-1,402.303---37.752.950.1-0.053