Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-Choice Question Answering on Weather Reasoning MCQA-L
Loading...
66.4
Accuracy
TimeClaw
43.936
49.768
55.6
61.432
May 11, 2026
Accuracy
Updated 22d ago
Evaluation Results
Method
Method
Links
Accuracy
TimeClaw
2026.05
66.4
Llama-3
2026.05
62.6
GPT-5.4
2026.05
58
DeepSeek
2026.05
57.3
Qwen-3
2026.05
56.2
Gemini
2026.05
54
GPT-4o
2026.05
44.8
Feedback
Search any
task
Search any
task