Share your thoughts, 1 month free Claude Pro on usSee more

Question Answering on AmbigQA

4.96Helpfulness

Llama2

Updated 5mo ago

Evaluation Results

Method	Links
Llama2 2024.08		4.96	4.94	1.79	52
GPT3.5 2024.08		4.91	4.97	1.89	60
GPT4 2024.08		4.89	4.95	1.06	72
Claude 2024.08		4.89	4.94	1.36	62
Zephyr 2024.08		4.38	4.66	1.03	45