Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Evaluation on NaturalQuestions (Accuracy)

0.433Accuracy

Yuan3.0-1T Base

0.398680.407590.41650.42541Jan 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
0.433
2026.01
0.415
2026.01
0.4