Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
English on TheoremQA
Loading...
44.4
Score
Qwen2-72B-Instruct
17.776
24.688
31.6
38.512
Jul 15, 2024
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
Qwen2-72B-Instruct
Type=Instruction-tuned
2024.07
44.4
Llama-3-70B-Instruct
Type=Instruction-tuned
2024.07
42.5
Mixtral-8x22B-Instruct
Type=Instruction-tuned
2024.07
40.8
Qwen1.5-72B-Chat
Type=Instruction-tuned
2024.07
28.8
Qwen1.5-110B-Chat
Type=Instruction-tuned
2024.07
18.8
Feedback
Search any
task
Search any
task