Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Dialogue Evaluation on TopicalChat (CV & Rho Metrics)

0.48CV (Understandability)

LLaMA-4-Scout

0.0120.13350.2550.3765Jan 31, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.480.430.330.810.330.820.110.80.180.73----------
2026.01
0.270.340.230.870.170.860.160.860.30.71----------
2026.01
0.210.460.490.710.250.810.210.70.10.71----------
2026.01
0.20.490.110.80.170.810.20.790.120.7----------
2026.01
0.180.390.290.850.270.790.150.780.170.72----------
2026.01
0.160.480.280.570.140.840.520.720.070.71----------
2026.01
0.030.530.180.870.130.850.210.870.180.73----------
2026.01
----------2.460.331.710.481.790.452.210.422.510.56
2026.01
----------2.590.333.190.542.410.553.070.442.380.56
----------2.590.342.520.552.010.432.160.432.410.57
2026.01
----------2.560.362.120.411.930.432.080.512.460.54
2026.01
----------2.890.361.960.51.970.532.210.542.380.56
2026.01
----------2.290.312.310.51.880.432.050.442.470.57
2026.01
----------2.750.332.350.411.970.432.750.512.460.55