Share your thoughts, 1 month free Claude Pro on usSee more

Classification on GoEmotions (Accuracy and Macro-F1)

81.63Accuracy

slm-judge

Updated 3mo ago

Evaluation Results

Method	Links
slm-judge 2026.04		81.63	63.8
slm-judge 2026.04		79.37	49.98
slm-judge 2026.04		79.17	56.85
slm-judge 2026.04		78.19	49.67
GPT-5-nano 2026.04		51.77	39.69
GPT-5.2-chat 2026.04		50.62	40.99
GPT-5.2-chat 2026.04		49.9	39.26
GPT-4o 2026.04		47.41	37.32
GPT-5-mini-2025-08-07 2026.04		47.32	38.75
GPT-4o 2026.04		45.97	23.09