Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Sentiment Analysis on SST-2 (Accuracy and PEEM/Response/Prompt Metrics)
Loading...
93
Accuracy
GPT-4o-mini
73.136
78.293
83.45
88.607
Mar 11, 2026
Accuracy
PEEM Accuracy
Response Overall Score
Prompt Overall Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
PEEM Accuracy
Response Overall Score
Prompt Overall Score
GPT-4o-mini
Task Model=GPT-4o-mini
2026.03
93
4.725
4.844
4.753
Qwen-2.5-7B-IT
Task Model=Qwen-2.5-7B-IT
2026.03
92.2
4.787
4.714
4.554
Gemini-2.5-Flash
Task Model=Gemini-2.5-...
2026.03
91.5
4.705
4.812
4.62
LLaMA-3.1-8B-IT
Task Model=LLaMA-3.1-8...
2026.03
90.1
4.662
4.704
4.451
Gemma-2-9B-IT
Task Model=Gemma-2-9B-IT
2026.03
73.9
4.599
4.68
4.307
Feedback
Search any
task
Search any
task