Share your thoughts, 1 month free Claude Pro on usSee more

Coreference Resolution on Winogender (test)

80.7Accuracy

GLM-130B

Updated 4mo ago

Evaluation Results

Method	Links
GLM-130B 2022.10		80.7	-
GLM-130B 2022.10		79.7	-
PaLM 540B 2022.10		79.4	-
Chinchilla 2022.10		78.3	-
PaLM 540B 2022.10		75	-
Gopher 280B 2022.10		71.4	-
GPT-3 (Davinci) 2022.10		64.2	-
GPT-3 (Davinci) 2022.10		62.6	-
OPT 175B 2022.10		54.8	-
BLOOM 176B 2022.10		53.1	-
BLOOM 176B 2022.10		49.1	-
HATified-SFT 2026.03		-	67.9
Llama-Instruct 2026.03		-	84.3