Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning (Base-9) on NoRa Inaccurate Rationales
Loading...
76.7
Accuracy
CD-CoT
-2.132
18.334
38.8
59.266
Oct 31, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
CD-CoT
Model=Gemini-Pro
2024.10
76.7
CD-CoT
Model=GPT-3.5-turbo
2024.10
58.7
CC
Model=Gemini-Pro
2024.10
43.6
SC
Model=Gemini-Pro
2024.10
32.3
CC
Model=GPT-3.5-turbo
2024.10
31.7
BT
Model=Gemini-Pro
2024.10
26.7
Base
Model=Gemini-Pro
2024.10
21.2
BT
Model=GPT-3.5-turbo
2024.10
18.4
SC
Model=GPT-3.5-turbo
2024.10
17.3
CC
Model=Mixtral 8x7B
2024.10
12.5
Base
Model=GPT-3.5-turbo
2024.10
10.1
CD-CoT
Model=Mixtral 8x7B
2024.10
4.7
Base
Model=Mixtral 8x7B
2024.10
3.7
SC
Model=LLaMA2-70B
2024.10
3
CC
Model=LLaMA2-70B
2024.10
2.8
Base
Model=LLaMA2-70B
2024.10
2.7
CD-CoT
Model=LLaMA2-70B
2024.10
2.7
SC
Model=Mixtral 8x7B
2024.10
2.7
BT
Model=Mixtral 8x7B
2024.10
2.4
BT
Model=LLaMA2-70B
2024.10
0.9
Feedback
Search any
task
Search any
task