Share your thoughts, 1 month free Claude Pro on usSee more

Probabilistic Multiple-Choice on TruthfulQA (single info)

59.2MC1 Score

Mistral-Instruct-v0.2 + TACS-T

Updated 5mo ago

Evaluation Results

Method	Links
Mistral-Instruct-v0.2 + TACS-T 2024.03		59.2	69	44.8	57.7
Mistral-Instruct-v0.2 + TACS-S 2024.03		55.8	59.4	39.9	51.7
Mistral-Instruct-v0.2 2024.03		53.6	56.4	37	49
Llama 2-Chat + TACS-S 2024.03		50.8	57.8	33.7	47.5
Llama 2-Chat 2024.03		50.6	51.7	31.1	44.5
Llama 2-Chat + ITI 2024.03		50.6	51.2	30.5	44.1
Llama 2-Chat + TACS-T 2024.03		48.8	56.7	33.4	46.3