Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on ARC Challenging
Loading...
15.65
LLMcritic Calls
VecCISC + KMeans
10.7516
12.0233
13.295
14.5667
May 8, 2026
LLMcritic Calls
Reduction Percentage
Updated 23d ago
Evaluation Results
Method
Method
Links
LLMcritic Calls
Reduction Percentage
VecCISC + KMeans
Budget=20, Backbone=Mi...
2026.05
15.65
-21.73
VecCISC + KMeans
Budget=20, Backbone=Ll...
2026.05
14.8
-26
VecCISC + KMeans
Budget=20, Backbone=GP...
2026.05
13.34
-33.29
VecCISC + KMeans
Budget=20, Backbone=Ll...
2026.05
13.31
-33.43
VecCISC + KMeans
Budget=20, Backbone=Qw...
2026.05
10.94
-45.3
Feedback
Search any
task
Search any
task