Share your thoughts, 1 month free Claude Pro on usSee more

NorEval

Benchmarks

Task Name	Dataset Name	SOTA Result	Trend
Large Language Model Evaluation	NorEval (test)	Overall Score0.455		8

Showing 1 of 1 rows

Popular tasks

Large Language Model Evaluation

Follow for update

@wizwand_team Discord

© 2026 wizwand

Blog Contact Changelog Swarm

Privacy Policy Terms of Service FAQs Swarm Docs