Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Complex Reasoning on AQuA
Loading...
28.35
Accuracy
AGF
22.2036
23.7993
25.395
26.9907
May 9, 2026
Accuracy
Updated 22d ago
Evaluation Results
Method
Method
Links
Accuracy
AGF
Sparsity=30%, Model=Qw...
2026.05
28.35
RKU
Sparsity=30%, Model=Qw...
2026.05
28.35
RKU
Sparsity=40%, Model=Qw...
2026.05
27.56
AGF
Sparsity=40%, Model=Qw...
2026.05
27.17
Taylor-FO
Sparsity=30%, Model=Qw...
2026.05
26.77
AGF
Sparsity=50%, Model=Qw...
2026.05
26.38
Wanda-Struct
Sparsity=30%, Model=Qw...
2026.05
25.98
Wanda-Struct
Sparsity=40%, Model=Qw...
2026.05
24.41
Taylor-FO
Sparsity=40%, Model=Qw...
2026.05
24.02
RKU
Sparsity=50%, Model=Qw...
2026.05
24.02
Taylor-FO
Sparsity=50%, Model=Qw...
2026.05
23.23
Wanda-Struct
Sparsity=50%, Model=Qw...
2026.05
22.44
Feedback
Search any
task
Search any
task