Share your thoughts, 1 month free Claude Pro on usSee more

FairEval

Benchmarks

Task Name	Dataset Name	SOTA Result	Trend
LLM Evaluation Performance	FairEval	Accuracy0.6375		14

Showing 1 of 1 rows

Popular tasks

LLM Evaluation Performance

Follow for update

@wizwand_team Discord

© 2026 wizwand

Blog Contact Changelog Swarm

Privacy Policy Terms of Service FAQs Swarm Docs