Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pointwise Grading on AlignBench

0.997Pearson (r)

GPT-4

0.005880.263190.52050.77781Nov 30, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.11
0.9970.9760.929
2023.11
0.99511
2023.11
0.99511
2023.11
0.9840.9760.929
2023.11
0.9720.9760.929
2023.11
0.9550.9760.929
2023.11
0.9540.9760.929
2023.11
0.9010.9290.786
2023.11
0.8630.9290.786
2023.11
0.8540.9290.786
2023.11
0.790.8330.643
2023.11
0.7780.8330.643
2023.11
0.7720.8330.643
2023.11
0.7170.9050.786
2023.11
0.6630.5270.4
2023.11
0.6630.6670.5
2023.11
0.6290.5830.532
2023.11
0.5580.5710.5
2023.11
0.5550.5230.477
2023.11
0.5470.4290.286
2023.11
0.5440.5480.429
2023.11
0.5230.4940.447
2023.11
0.4740.4710.426
2023.11
0.450.430.391
2023.11
0.4430.4210.379
2023.11
0.3730.3790.358
2023.11
0.3660.3520.319
2023.11
0.3020.3060.282
2023.11
0.2920.2870.266
2023.11
0.2550.2540.239
2023.11
0.2230.2220.207
2023.11
0.1990.20.187
2023.11
0.170.1620.155
2023.11
0.1590.150.14
2023.11
0.1520.1620.109
2023.11
0.1250.1170.11
2023.11
0.1230.1220.113
2023.11
0.0440.0450.041