Share your thoughts, 1 month free Claude Pro on usSee more

Figurative-to-Literal Steering on Human Evaluation (sample of 100)

75Successful Sentences Count

GPT-OSS-20B

Updated 3mo ago

Evaluation Results

Method	Links
GPT-OSS-20B 2026.04		75
Llama-3.1-8B 2026.04		71
Qwen3-8B 2026.04		68
Gemma2-9B 2026.04		67
Gemma2-9B 2026.04		52
Qwen3-8B 2026.04		45
Llama-3.1-8B 2026.04		42
GPT-OSS-20B 2026.04		39