Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reasoning explanation generation on ConversationGoT-120h (test)
Loading...
4.46
Alignment
GPT-5 (thinking)
3.2328
3.5514
3.87
4.1886
Feb 11, 2026
Alignment
Justification
Caption
Clarity
Updated 1mo ago
Evaluation Results
Method
Method
Links
Alignment
Justification
Caption
Clarity
GPT-5 (thinking)
Latency (s)=16.98 ± 5.29
2026.02
4.46
4.33
4.32
4.65
Graph-of-Thoughts
Latency (s)=0.74 ± 0.12
2026.02
4.4
4.27
4.21
4.38
GPT-4o
Latency (s)=2.98 ± 1.04
2026.02
3.4
3.27
3.21
3.38
Random selector
Latency (s)=0.73 ± 0.11
2026.02
3.28
3.13
3.21
3.88
Feedback
Search any
task
Search any
task