Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Reasoning on Average of Reasoning Tasks
Loading...
63.31
Average Accuracy
PASER
57.2468
58.8209
60.395
61.9691
Feb 18, 2025
Average Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Average Accuracy
PASER
Quantization=GPTQ 4 bi...
2025.02
63.31
w/o training
Quantization=w/o Quant...
2025.02
62.91
PASER
Quantization=RTN 4 bit...
2025.02
62.58
Nuggets
Quantization=GPTQ 4 bi...
2025.02
61.96
IFD
Quantization=GPTQ 4 bi...
2025.02
61.42
Nuggets
Quantization=RTN 4 bit...
2025.02
61.23
Instruction Mining
Quantization=GPTQ 4 bi...
2025.02
60.68
IFD
Quantization=RTN 4 bit...
2025.02
60.49
Full Data
Quantization=GPTQ 4 bi...
2025.02
60.4
w/o training
Quantization=GPTQ 4 bi...
2025.02
60.21
Full Data
Quantization=RTN 4 bit...
2025.02
59.74
Random
Quantization=GPTQ 4 bi...
2025.02
59.34
Instruction Mining
Quantization=RTN 4 bit...
2025.02
59.23
w/o training
Quantization=RTN 4 bit...
2025.02
59.05
Random
Quantization=RTN 4 bit...
2025.02
57.48
Feedback
Search any
task
Search any
task