Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question-Type Diversity Alignment on Metacog Taxonomy
Loading...
0.059
Jensen Shannon Divergence
gemini-2.5-pro
0.05416
0.08683
0.1195
0.15217
Mar 5, 2026
Jensen Shannon Divergence
Updated 1mo ago
Evaluation Results
Method
Method
Links
Jensen Shannon Divergence
gemini-2.5-pro
Simulator Variant=AGENT
2026.03
0.059
gemini-2.5-pro
Simulator Variant=SCOT...
2026.03
0.072
Llama-3.3-70B-Instruct
Simulator Variant=SCOT...
2026.03
0.08
gpt4o
Simulator Variant=AGENT
2026.03
0.084
gpt4o
Simulator Variant=SCOT...
2026.03
0.089
gpt-oss-120b
Simulator Variant=AGENT
2026.03
0.129
Qwen3-32B
Simulator Variant=SCOT...
2026.03
0.149
gpt-oss-120b
Simulator Variant=SCOT...
2026.03
0.18
Feedback
Search any
task
Search any
task