Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Model Capability on Table 3 Evaluation Suite

70.94Average Score

OPTIMER

38.533646.946855.3663.7732Mar 30, 2026
Updated 18d ago

Evaluation Results

MethodLinks
2026.03
70.94
2026.03
70.19
54.37
39.78