Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Explanation self-consistency on CODAH (test)
Loading...
83.39
Accuracy
PSCB
71.5964
74.6582
77.72
80.7818
Jun 9, 2025
Accuracy
CC-Cos (Worst)
CC-Cos (Mean)
CC-Cos (Best)
CC-Sp (Worst)
CC-Sp (Mean)
CC-Sp (Best)
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
CC-Cos (Worst)
CC-Cos (Mean)
CC-Cos (Best)
CC-Sp (Worst)
CC-Sp (Mean)
CC-Sp (Best)
PSCB
Model=LLaMA3.1-8B, Att...
2025.06
83.39
12.87
13.68
14.52
7.11
19.97
32.66
PSCB
Model=LLaMA3.2-3B, Att...
2025.06
75.48
7.49
8.06
8.64
6.82
19.39
31.95
PSCB
Model=LLaMA3.2-3B, Att...
2025.06
72.05
1.91
2.47
2.96
35.24
44.88
53.6
Feedback
Search any
task
Search any
task