Share your thoughts, 1 month free Claude Pro on usSee more

Generative Multiple-choice Question Answering on TruthfulQA

76.3TA Rate

Llama 2-Chat

Updated 5mo ago

Evaluation Results

Method	Links
Llama 2-Chat 2024.03		76.3	13.7	45
Mistral-Instruct-v0.2 2024.03		75.4	22.6	49
Mistral-Instruct-v0.2 + TACS-S 2024.03		46.3	91.4	68.9
Mistral-Instruct-v0.2 + TACS-T 2024.03		44.9	89.6	67.2
Llama 2-Chat + TACS-S 2024.03		43.7	74.9	64.3
Llama 2-Chat + TACS-T 2024.03		43.4	85.8	64.7