Share your thoughts, 1 month free Claude Pro on usSee more

Closed-set Question Answering on PubHealth

74.5Accuracy

SELF-RAG

Updated 5mo ago

Evaluation Results

Method	Links
SELF-RAG 2023.10		74.5
SELF-RAG 2023.10		72.4
ChatGPT 2023.10		70.1
Llama2-FT 2023.10		64.3
Alpaca 2023.10		55.5
Ret-ChatGPT 2023.10		54.7
Ret-Llama2-Chat 2023.10		52.1
Alpaca 2023.10		51.1
Alpaca 2023.10		49.8
Llama2-Chat 2023.10		49.4
Alpaca 2023.10		40.2
Llama2 2023.10		34.2
Llama2 2023.10		30.2
Llama2 2023.10		30
Llama2 2023.10		29.4