Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Generative multiple-choice on TruthfulQA (single)

78.1Accuracy

TACS-S (Sentence-level)

47.9455.7763.671.43Mar 12, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.03
78.1
2024.03
77.1
2024.03
62.5
2024.03
60.6
2024.03
54.7
2024.03
49.1