Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Question Answering on TruthfulQA MC1

88.8MC1 Accuracy

LinUCB

16.20835.05453.972.746Jul 24, 2025Aug 28, 2025Oct 3, 2025Nov 8, 2025Dec 13, 2025Jan 18, 2026Feb 23, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
88.88.1
2026.02
88.78
2026.02
85.24.5
2026.02
83.42.7
2026.02
81.40.7
2026.02
81.30.6
2026.02
80.70
2026.02
67.9-12.8
2026.02
66.5-14.2
2026.02
54.22-26.5
2026.02
53.6-27.1
2026.02
46.32-34.4
2025.07
38.19-
2025.07
37.82-
2025.07
37.58-
2025.07
37.21-
2025.07
36.84-
2026.02
36.7-44
2026.02
36.6-
2025.07
36.47-
2025.07
36.47-
2026.02
36.47-
2025.07
36.35-
2025.07
36.35-
2025.07
36.23-
2025.07
36.11-
2025.07
36.11-
2025.07
35.86-
2025.07
35.62-
2025.07
35.37-
2025.07
35.13-
2025.07
35.01-
2026.02
34.64-46.1
2026.02
34.64-46.1
2025.07
34.03-
2026.02
33.9-46.8
2025.07
33.9-
2026.02
32.2-48.5
2025.07
31.33-
2025.07
31.21-
2025.07
31.09-
2025.07
30.48-
2025.07
29.5-
2025.07
28.4-
2025.07
28.4-
2025.07
27.54-
2026.02
26.2-54.5
2026.02
24.4-56.3
2026.02
24.2-56.5
2025.07
24.11-
2026.02
22-58.7
2026.02
21-59.7
2026.02
20-60.7
2026.02
19-61.7