Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Question Answering on Sum

4.46GenericJudge Score

GPT-4.1 + HYVE

3.79443.96724.144.3128Apr 7, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2026.04
4.46206,0004.99
2026.04
4.43209,2005.57
2026.04
3.85372,40022.23
2026.04
3.82381,30023.66