Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Question Answering on KILT Benchmark Natural Questions, TriviaQA, HotpotQA (test)

85.6EM Score (TriviaQA)

Llama3-ChatQA-1.5-70B

37.03249.64162.2574.859Jan 18, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.01
85.658.74742.2
2024.01
82.453.642.735.6
2024.01
8152.342.433.5
2024.01
75.450.135.239.7
2024.01
72.644.528.832
2024.01
70.742.530.926
2024.01
65.7-29.6-
2024.01
5942.13730.4
2024.01
56.939.426.734.7
2024.01
38.9--65.6