SelfInst

Benchmarks

Task Name	Dataset Name	SOTA Result
Instruction Following	SelfInst	Rouge-L21.7	73
Instruction Following	SelfInst	R-L Score23.4	50
Instruction-tuning	SelfInst	ROUGE-L21.31	21
Instruction Following	SelfInst (OOD)	Rouge-L23.2	20
Instruction Following Evaluation	SelfInst Out-of-Distribution	GPT-4o Score51.6	17
Dialogue Generation	SelfInst	Rouge-L11.31	16
Generation	SelfInst (test)	LLM-as-a-Judge Score60.16	2

Showing 7 of 7 rows