Share your thoughts, 1 month free Claude Pro on usSee more

Question Answering on HotpotQA (LLM Accuracy and EM)

36.3Exact Match (EM)

LUMOS-IQA

Updated 4mo ago

Evaluation Results

Method	Links
LUMOS-IQA 2023.11		36.3	57.4
LUMOS-IQA 2023.11		36	56.8
ReAcT 2023.11		32.4	40.8
LUMOS-IQA 2023.11		31.4	50.2
ReWOO 2023.11		30.4	42.4
LUMOS-IQA 2023.11		29.4	45.9
FiReAct 2023.11		27.8	-
FiReAct 2023.11		26.2	-
LUMOS-OQA 2023.11		24.9	39.2
LUMOS-IQA 2023.11		23.5	37.3
GPT-3.5-CoT 2023.11		22.4	37.8
AgentLM 2023.11		22.3	-
ReWOO-open 2023.11		-	37