Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Knowledge Base Question Answering on GrailQA v1.0 (test)

77.45Overall EM

RetinaQA

47.75855.466563.17570.8835Dec 19, 2022Mar 4, 2023May 19, 2023Aug 2, 2023Oct 17, 2023Dec 31, 2023Mar 16, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
77.45-------83.382.69
2022.12
75.481.784.488.874.681.571.678.5--
2022.12
74.881.482.587.375.282.27178.4--
2024.03
73.76-------81.880.78
2022.12
73.779.982.687.174.981.269.176.1--
2022.12
73.679.984.788.873.180.168.675.8--
2022.12
7378.587.890.669.276.56873.9--
2022.12
69.574.685.588.565.171.16469.8--
2022.12
68.874.486.28963.871.26369.2--
2022.12
68.478.784.889.973.481.858.672.3--
2024.03
68.2-------80.579.4
2024.03
67.8-------77.877.1
2024.03
66.87-------77.6776.94
2024.03
66.53-------79.1477.89
2024.03
66.29-------78.2977.43
2024.03
66.14-------78.2976.91
2024.03
65.37-------77.3676.37
2024.03
64.79-------77.3175.71
2024.03
64.54-------76.8375.24
2022.12
63.873.785.688.965.875.352.966--
2022.12
58.165.384.487.561.570.944.652.5--
2024.03
57-------67.665.8
2022.12
56.46567.573.758.264.950.761.1--
2024.03
55.23-------73.2671.62
2024.03
54.55-------63.0960.06
2024.03
53.89-------74.8973.53
2024.03
53.79-------63.5960.42
2024.03
53.69-------75.0572.84
2022.12
53.362.754.762.954.563.752.362.2--
2024.03
52.46-------64.6862.58
2024.03
51.6-------67.865.6
50.65859.96745.553.948.655.7--
2022.12
48.956.351.858.143.351.250.157.8--
2022.12
-36.7-40.5-33-36.6--
2022.12
-70.1--------