Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on WikidataQA BIG-bench 2023
Loading...
71.9
Accuracy
Ensemble
63.996
66.048
68.1
70.152
Jun 17, 2025
Accuracy
Updated 26d ago
Evaluation Results
Method
Method
Links
Accuracy
Ensemble
Aggregation Method=Byt...
2025.06
71.9
QWEN3
Model=QWEN3
2025.06
68.9
Average
Aggregation Method=Ave...
2025.06
66.3
LLAMA3.2
Model=LLAMA3.2
2025.06
65.8
OLMO2
Model=OLMO2
2025.06
64.3
Feedback
Search any
task
Search any
task