Natural Language Understanding

Benchmarks

Dataset Name	SOTA Method	Metric
GLUE	Fast Post-Training Pruning Framework	SST-2156	551	2mo ago
GLUE (dev)	XDELECTRA-l	SST-2 (Acc)97.36	529	1mo ago
GLUE (test)	Z-Code++	SST-2 Accuracy97.9	416	4mo ago
GLUE (val)	RoBERTa-Large + MUPPET	SST-297.4	201	2mo ago
SuperGLUE (dev)		Average Score93.2	91	4mo ago
GLUE (test dev)	SL-SAM	MRPC Accuracy93.45	90	1mo ago
SuperGLUE	Vega v2	SGLUE Score91.3	84	4mo ago
GLUE	PiSSA	Average Score (GLUE)89.5	76	26d ago
GLUE (test)		QNLI7,564.6	75	1mo ago
SuperGLUE (test)	ST-MoE-32B	BoolQ Accuracy92.4	74	2mo ago
GLUE	MoC	SST-2 Accuracy97.3	62	18d ago
GLUE (test val)	Full-FT	MRPC Accuracy94	59	4mo ago
GLUE	SIFT	SST-295.18	55	4mo ago
GLUE (val)		CoLA Score82.4	54	22d ago
GLUE (test)	LoRA	QNLI94.9	47	1mo ago
AGIEval	Llama 3 405B	Accuracy71.6	46	22d ago
NLP Suite (BoolQ, RTE, HellaSwag, WinoG, ARC-E, ARC-C, OpenBookQA) zero-shot		Average Accuracy72.5	41	4mo ago
GLUE	VB-LoRAall	COLA Score69.3	41	4mo ago
GLUE and SuperGLUE (test val)	SCALEARN UNIFORM	SST-295.7	37	4mo ago
GLUE 1.0 (test)		SST-2 (Acc)97.8	37	1mo ago
ARC Easy	Arcana	Accuracy78.3	36	2mo ago
HellaSwag	NLS	Accuracy85.6	35	2mo ago
ARC-c	mPLUG-Owl2	Accuracy65.8	34	2mo ago
GLUE (test)	FedTT	SST-2 Accuracy95.64	33	4mo ago
SuperGLUE	ARMADA	CB Accuracy94.5	32	4mo ago

Showing 25 of 281 rows

...