Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Combined Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
LLM AlignmentCombined Suite Setup 3
Average Percentage Score54.38
9
Knowledge-Preserved AdaptationCombined Suite TriviaQA, NQ open, WebQS, HumanEval, MBPP
Average Score21.91
4
Showing 2 of 2 rows