Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Factual Question Answering on ID Datasets Average

70.82Precision

UALIGN

59.036862.095965.15568.2141Dec 16, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.12
70.8241.79
2024.12
70.7243.71
2024.12
70.1644.25
2024.12
68.4340.56
2024.12
68.1141.04
2024.12
68.1139.45
2024.12
68.0341.72
2024.12
67.7239.68
2024.12
67.6939.06
2024.12
67.4541.01
2024.12
66.2138.43
2024.12
65.6240.32
2024.12
65.5238.83
2024.12
65.4540.95
2024.12
64.3239.11
2024.12
64.2238.91
2024.12
64.0439.48
2024.12
63.7437.08
2024.12
63.1937.06
2024.12
62.4836.2
2024.12
62.2138.3
2024.12
61.4638.18
2024.12
60.9839.27
2024.12
59.4936.29