Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Conversational Question Answering on CHATRAG BENCH 1.0 (test)

57.14Average Score (w/o HDial)

Llama3-ChatQA-1.5-70B

37.192842.371447.5552.7286Jan 18, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.01
57.1458.2541.2638.8251.478.4450.7681.8883.8255.6368.2732.31
2024.01
54.7254.0335.3540.151.4677.7341.684.1679.9848.3247.8633.75
2024.01
54.3553.934.1640.2952.0177.4243.3981.2879.2145.0949.8136.34
2024.01
53.9955.1739.3339.7349.0376.4649.678.4673.2849.9665.7630.1
2024.01
53.8954.1438.941.8248.0578.5751.9473.6969.1450.9856.4431.9
2024.01
52.9552.5237.8836.9651.3476.9841.2476.669.6149.7248.5936.23
2024.01
51.450.9333.5134.1649.7769.7140.6771.2174.0753.7746.735.76
2024.01
50.6950.3734.8337.1750.4679.3341.1173.1560.6344.347.4235.27
2024.01
46.9647.7137.8829.6946.9776.6141.5751.6161.8745.4554.5130.96
2024.01
46.7646.733.5933.645.775.2637.3358.0559.7244.9646.232.59
2024.01
44.6445.2136.8732.4749.480.4138.9746.8537.6244.3150.3534.88
2024.01
37.9638.8633.2725.8346.0272.2833.1536.5826.1436.6847.0231.67