RTE

Benchmarks

Task Name	Dataset Name	SOTA Result
Natural Language Inference	RTE	Accuracy93.5	590
Text Classification	RTE	Accuracy84.4	104
Recognizing Textual Entailment	RTE	Accuracy83.13	78
Natural language inference	RTE (test)	Accuracy90.25	52
Natural Language Inference	RTE	Accuracy (0-shot)84.8	42
Recognizing Textual Entailment	RTE (test)	Accuracy76.53	26
Natural Language Inference	RTE (val)	Accuracy0.918	24
Recognizing Textual Entailment	RTE	Delta 126.24	24
Natural Language Inference	RTE	Avg Accuracy81.2	21
Recognizing Textual Entailment	RTE (Recognizing Textual Entailment) GLUE (val)	Accuracy66.06	18
Zero-shot Prediction	RTE	Zero-shot Accuracy (RTE)62.82	17
Natural Language Inference	RTE Neg	Accuracy (RTE Neg)88.1	14
Natural Language Inference	RTE	Normalized Accuracy94.4	13
Natural Language Inference	RTE (dev)	Accuracy90.5	12
Recognizing Textual Entailment	RTE	Total Communication Time ($10^3$ s)4.29	9
Recognizing Textual Entailment	RTE	Repair Accuracy100	8
Natural Language Inference	RTE GLUE (test dev)	Accuracy84	8
Natural Language Inference	RTE SuperGLUE (test)	Accuracy66.13	8
Natural Language Inference	RTE	F1 Score80.91	7
Natural language inference	RTE	Macro-F165.8	6
Recognizing Textual Entailment	RTE	F1 Macro92.1	5
Natural Language Inference	RTE	Wall-clock Time (hours)1.29	4
Natural Language Inference	RTE low-data regime GLUE	Accuracy58.53	4
Natural Language Inference	RTE (out-of-distribution)	Accuracy71.31	3
Natural Language Inference	RTE	Accuracy (RTE)61	3

Showing 25 of 30 rows