Share your thoughts, 1 month free Claude Pro on usSee more

Multi-hop Reasoning on CommaQA-E compositional

80.8Exact Match

ChatGPT (SKiC)

Updated 4mo ago

Evaluation Results

Method	Links
ChatGPT (SKiC) 2023.08		80.8
text-davinci-003 (SKiC) 2023.08		74.8
ChatGPT (Decomp) 2023.08		73.5
text-davinci-003 (Decomp) 2023.08		66.6
LLAMA-65B (SKiC) 2023.08		52
ChatGPT (CoT) 2023.08		46.4
LLAMA-65B (Decomp) 2023.08		40.4
ChatGPT (4-shots) 2023.08		40.3
text-davinci-003 (CoT) 2023.08		38.2
text-davinci-003 (4-shots) 2023.08		33.5
LLAMA-65B (CoT) 2023.08		30.8
ChatGPT (zero-shot) 2023.08		30.6
text-davinci-003 (zero-shot) 2023.08		26.8
LLAMA-65B (4-shots) 2023.08		24.6
LLAMA-65B (zero-shot) 2023.08		16.3