Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Scientific Reasoning on SciBench
Loading...
28.52
Score
GPT-4
-0.7248
6.8676
14.46
22.0524
Jan 15, 2024
Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Score
GPT-4
Parameter Scale=API
2024.01
28.52
GPT-3.5-turbo
Parameter Scale=API
2024.01
12.17
Mistral-7B: MetaMATH + SciInstruct
Fine-tuning=SciInstruc...
2024.01
6.23
Mistral-7B: MetaMATH
Evaluation Protocol=fe...
2024.01
6.17
SciGLM
Backbone=ChatGLM3-32B-...
2024.01
5.15
Mistral-7B: MetaMATH
Evaluation Protocol=ze...
2024.01
4.63
ChatGLM3-32B-Base
Parameter Scale=30B~32B
2024.01
4.29
SciGLM
Backbone=ChatGLM3-6B-B...
2024.01
3.77
Llama3-8B-Instruct
Evaluation Protocol=fe...
2024.01
3.6
Llama3-8B-Instruct + SciInstruct
Fine-tuning=SciInstruc...
2024.01
3.6
ChatGLM3-6B
Parameter Scale=6B~7B
2024.01
2.4
ChatGLM3-6B-Base
Parameter Scale=6B~7B
2024.01
2.4
ChatGLM2-6B
Parameter Scale=6B~7B
2024.01
1.54
LLaMA-2-13B
Parameter Scale=12B~13B
2024.01
1.37
ChatGLM2-6B-Base
Parameter Scale=6B~7B
2024.01
1.2
Llama3-8B-Instruct
Evaluation Protocol=ze...
2024.01
1.03
LLaMA-2-7B
Parameter Scale=6B~7B
2024.01
0.4
Feedback
Search any
task
Search any
task