Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Generative Question Answering on Bolmo Evaluation Suite GenQA 7B

81.6GenQA Average

Llama 3.1 70B

-2.8334419.0867841.00762.92722Dec 15, 2025Dec 16, 2025Dec 17, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
81.688.491.779.692.478.38453.192.973.9
2025.12
80.387.290.576.791.176.580.555.194.470.7
2025.12
79.884.890.375.793.580.975.34994.574.1
2025.12
79.187.589.47788.776.379.151.49468.7
2025.12
7886.290.879.391.974.980.345.192.661.1
2025.12
77.187.688.976.890.669.779.847.691.261.5
2025.12
7684.889.376.19676.177.430.789.164.4
2025.12
75.98488.673.985.67372.742.693.469.5
2025.12
75.681.888.876.389.368.275.140.488.871.5
2025.12
7584.587.774.887.556.377.243.190.772.8
2025.12
73.58691.377.594.975.982.149.292.412.4
2025.12
73.181.587.375.58859.570.936.789.269
2025.12
72.986.790.876.993.273.280.747.19314.9
2025.12
72.577.785.768.989.571.560.432.693.572.8
2025.12
72.482.287.470.582.261.570.837.491.568.3
2025.12
71.880.286.267.991.471.464.931.292.360.4
2025.12
71.480.686.573.189.769.365.633.190.354.4
2025.12
71.180.586.47393.557.265.133.889.261.6
2025.12
698185.870.983.837.170.13589.667.4
2025.12
68.586.387.576.294.253.77439.364.940.4
2025.12
67.883.789.47688.738.469.73789.637.8
2025.12
67.5818670.391.456.76331.28740.5
2025.12
65.375.280.358.383.259.458.933.589.349.8
2025.12
63.373.976.46780.554.955.528.88646.7
2025.12
0.7240.7780.8570.680.90.7150.6030.3260.9350.727
2025.12
0.7130.8280.8820.7050.8960.4860.6830.3430.8860.711
2025.12
0.7090.7880.8550.7110.8960.6520.5680.2860.9160.705
2025.12
0.6840.8110.8820.7280.8450.3880.6730.2920.8520.686
2025.12
0.4140.7010.7820.6290.8270.0780.1310.0540.3590.167