Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Language Understanding on 12-task evaluation suite (test)

71.62Average Score

Efficient-DLM 8B

32.744842.837452.9363.0226Dec 16, 2025Dec 18, 2025Dec 20, 2025Dec 23, 2025Dec 25, 2025Dec 27, 2025Dec 30, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
71.62----39.9967.3669.2277.2274.88
2025.12
71.58----42.5168.4569.8776.9373.71
2025.12
70.93----103.8965.6468.5277.2274.88
2025.12
70.65----126.4364.9568.2177.2274.88
2025.12
67.97----47.1363.8566.2773.1970.91
2025.12
67.54----44.1362.0867.9871.870.87
2025.12
67.39----119.3361.3768.5671.870.87
2025.12
67.18----130.2460.9668.171.870.87
2025.12
65.3----28.1158.9258.396772.83
2025.12
60.257.566.454.4912-----
2025.12
59.39----71.5954.2254.1562.5364.99
2025.12
59.157.466.253312-----
2025.12
54.92----25.0438.149.1365.8668.5
2025.12
54.47----73.0342.1746.9860.9666
2025.12
52.09----68.5242.3342.657.6362.58
2025.12
52.04----158.8942.3342.2857.6362.58
2025.12
51.77----184.4841.7941.857.6362.58
2025.12
41.98----99.9331.925.9747.6555.31
2025.12
40.14----112.8421.0630.9749.9968.44
2025.12
34.24----143.9124.454.9830.9860.62