Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Large Language Model Evaluation on NorEval (test)

0.455Overall Score

NorwAI-Mistral-7B

0.352040.378770.40550.43223Jan 6, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.455690.4720.7070.3590.3670.3950.3710.3190.3770.732
2026.01
0.441590.4790.6630.2980.3020.3540.3880.3750.3770.729
2026.01
0.436610.5920.6870.340.3160.3870.4070.330.1460.72
2026.01
0.419470.5130.5950.2740.2660.250.2590.4940.3870.73
2026.01
0.409130.1690.7720.3520.2470.4930.2340.5480.5610.305
2026.01
0.397380.2340.7770.2110.460.4350.4710.2950.1160.575
2026.01
0.385320.5320.5750.2770.4030.2540.2230.3590.1490.697
2026.01
0.356280.5180.4080.2350.3910.2330.2390.3560.1390.688