Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multiple Choice Question Answering on BertaQA Local
Loading...
80.45
Accuracy
Claude 3.5 Sonnet
43.5508
53.1304
62.71
72.2896
Jun 9, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Claude 3.5 Sonnet
Model Scale=Proprietary
2025.06
80.45
70B + CEU IEN
Model Scale=70B, Train...
2025.06
77.71
GPT-4o
Model Scale=Proprietary
2025.06
74.83
8B + CEU IEU
Model Scale=8B, Traini...
2025.06
66.07
8B + CEU IEN+EU
Model Scale=8B, Traini...
2025.06
65.57
8B + CEU IEN
Model Scale=8B, Traini...
2025.06
65.23
70B INSTRUCT EN
Model Scale=70B, Train...
2025.06
53.51
8B INSTRUCT EN
Model Scale=8B, Traini...
2025.06
44.97
Feedback
Search any
task
Search any
task