Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multiple Choice Question Answering on BertaQA Global
Loading...
93.52
Accuracy
Claude 3.5 Sonnet
66.0432
73.1766
80.31
87.4434
Jun 9, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Claude 3.5 Sonnet
Model Scale=Proprietary
2025.06
93.52
GPT-4o
Model Scale=Proprietary
2025.06
91.01
70B + CEU IEN
Model Scale=70B, Train...
2025.06
87.42
70B INSTRUCT EN
Model Scale=70B, Train...
2025.06
83.53
8B + CEU IEN
Model Scale=8B, Traini...
2025.06
74.62
8B + CEU IEU
Model Scale=8B, Traini...
2025.06
73.54
8B + CEU IEN+EU
Model Scale=8B, Traini...
2025.06
72.99
8B INSTRUCT EN
Model Scale=8B, Traini...
2025.06
67.1
Feedback
Search any
task
Search any
task