Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on Sum
Loading...
4.46
GenericJudge Score
GPT-4.1 + HYVE
3.7944
3.9672
4.14
4.3128
Apr 7, 2026
GenericJudge Score
Token Usage
Latency (s)
Updated 11d ago
Evaluation Results
Method
Method
Links
GenericJudge Score
Token Usage
Latency (s)
GPT-4.1 + HYVE
Optimization=HYVE pipe...
2026.04
4.46
206,000
4.99
GPT-4.1
Optimization=Standard...
2026.04
4.43
209,200
5.57
GPT-5 + HYVE
Optimization=HYVE pipe...
2026.04
3.85
372,400
22.23
GPT-5
Optimization=Standard...
2026.04
3.82
381,300
23.66
Feedback
Search any
task
Search any
task