Math, Chat, IF, and General QA tasks

Benchmarks

Task Name	Dataset Name	SOTA Result	Trend
Multi-task model alignment and mixing	Math, Chat, IF, and General QA tasks Llama-3.1-8B (test)	Math Accuracy36		3

Showing 1 of 1 rows