Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
max-int on max-int
Loading...
100
Answer Rate
Vanilla
95
97.5
100
102.5
Apr 9, 2026
Answer Rate
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Answer Rate
Accuracy
Vanilla
Prompting Strategy=Van...
2026.04
100
75.3
CoT
Prompting Strategy=CoT
2026.04
100
68.8
ICL
Prompting Strategy=ICL
2026.04
100
71
SepSeq
Prompting Strategy=SepSeq
2026.04
100
92.4
Feedback
Search any
task
Search any
task