Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Generative multiple-choice on TruthfulQA (single)
Loading...
78.1
Accuracy
TACS-S (Sentence-level)
47.94
55.77
63.6
71.43
Mar 12, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
TACS-S (Sentence-level)
Backbone=Mistral-Instr...
2024.03
78.1
TACS-T (Token-level)
Backbone=Mistral-Instr...
2024.03
77.1
TACS-T (Token-level)
Backbone=Llama 2-Chat,...
2024.03
62.5
TACS-S (Sentence-level)
Backbone=Llama 2-Chat,...
2024.03
60.6
Mistral-Instruct-v0.2
Backbone=Mistral-Instr...
2024.03
54.7
Llama 2-Chat
Backbone=Llama 2-Chat,...
2024.03
49.1
Feedback
Search any
task
Search any
task