Share your thoughts, 1 month free Claude Pro on usSee more

Multi-hop Reasoning on CommaQA-E (test)

70Exact Match

ChatGPT (SKiC)

Updated 4mo ago

Evaluation Results

Method	Links
ChatGPT (SKiC) 2023.08		70
text-davinci-003 (SKiC) 2023.08		66
ChatGPT (Decomp) 2023.08		64
text-davinci-003 (Decomp) 2023.08		58
ChatGPT (CoT) 2023.08		55
ChatGPT (4-shots) 2023.08		47
LLAMA-65B (SKiC) 2023.08		44
text-davinci-003 (CoT) 2023.08		44
text-davinci-003 (4-shots) 2023.08		42
ChatGPT (zero-shot) 2023.08		42
text-davinci-003 (zero-shot) 2023.08		34
LLAMA-65B (Decomp) 2023.08		32
LLAMA-65B (CoT) 2023.08		27
LLAMA-65B (4-shots) 2023.08		15
LLAMA-65B (zero-shot) 2023.08		12