Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
max-float on max-float
Loading...
100
Answer Rate
Vanilla
95
97.5
100
102.5
Apr 9, 2026
Answer Rate
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Answer Rate
Accuracy
Vanilla
Prompting Strategy=Van...
2026.04
100
63.9
CoT
Prompting Strategy=CoT
2026.04
100
60.5
ICL
Prompting Strategy=ICL
2026.04
100
63.3
SepSeq
Prompting Strategy=SepSeq
2026.04
100
81
Feedback
Search any
task
Search any
task