Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reasoning explanation generation on ConversationGoT-120h (test)

4.46Alignment

GPT-5 (thinking)

3.23283.55143.874.1886Feb 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
4.464.334.324.65
2026.02
4.44.274.214.38
2026.02
3.43.273.213.38
2026.02
3.283.133.213.88