Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Figurative-to-Literal Steering on Human Evaluation (sample of 100)

75Successful Sentences Count

GPT-OSS-20B

37.5647.285766.72Apr 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
75
2026.04
71
2026.04
68
2026.04
67
2026.04
52
2026.04
45
2026.04
42
2026.04
39