Share your thoughts, 1 month free Claude Pro on usSee more

Interactive Question Answering on IQA-EVAL MMLU-derived (TextBabbage)

3.87Helpfulness

IQA-EVAL-GPT3.5

Updated 3mo ago

Evaluation Results

Method	Links
IQA-EVAL-GPT3.5 2024.08		3.87	3.67	1.77	47
Human 2024.08		3.84	3.84	2.57	52
IQA-EVAL-Claude 2024.08		3.03	3.47	2.67	53
IQA-EVAL-GPT4 2024.08		2.3	3.87	2.27	83