Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

STEM Question Answering on MQA

45First-Token Accuracy

Phi-4-14B

22.22428.13734.0539.963May 21, 2025
Updated 13d ago

Evaluation Results

MethodLinks
2025.05
45
2025.05
36.5
2025.05
34
2025.05
33.8
2025.05
29.8
2025.05
29.8
2025.05
27.8
2025.05
27.8
2025.05
25.4
2025.05
25.1
2025.05
25
2025.05
24.9
2025.05
24.5
2025.05
24.3
2025.05
24.1
2025.05
24
2025.05
24
2025.05
24
2025.05
23.5
2025.05
23.5
2025.05
23.3
2025.05
23.3
2025.05
23.1
2025.05
23.1