Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Explanation self-consistency on ECQA (test)
Loading...
71.11
Accuracy
PSCB
64.5788
66.2744
67.97
69.6656
Jun 9, 2025
Accuracy
CC-Cos (Worst)
CC-Cos (Mean)
CC-Cos (Best)
CC-Sp (Worst)
CC-Sp (Mean)
CC-Sp (Best)
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
CC-Cos (Worst)
CC-Cos (Mean)
CC-Cos (Best)
CC-Sp (Worst)
CC-Sp (Mean)
CC-Sp (Best)
PSCB
Model=LLaMA3.1-8B, Att...
2025.06
71.11
0.077
0.0818
0.0867
0.0507
0.1847
0.3176
PSCB
Model=LLaMA3.1-8B, Att...
2025.06
66.51
0.0136
0.0167
0.0198
0.3379
0.417
0.4911
PSCB
Model=LLaMA3.2-3B, Att...
2025.06
65.85
0.013
0.0165
0.0199
0.0975
0.2228
0.3464
PSCB
Model=LLaMA3.2-3B, Att...
2025.06
64.83
0.0182
0.0225
0.0266
0.3055
0.3872
0.4654
Feedback
Search any
task
Search any
task