Share your thoughts, 1 month free Claude Pro on usSee more

Multi-turn conversation performance on Average

94.7Avg Performance

Full

Updated 5mo ago

Evaluation Results

Method	Links
Full 2026.02		94.7	88.7
Full 2026.02		92.7	85.4
Full 2026.02		86.9	83.2
Experience-Driven Mediator 2026.02		81.9	69.9
Experience-Driven Mediator 2026.02		73.9	68.8
Experience-Driven Mediator 2026.02		72.6	63.5
Sharded 2026.02		60.8	56.2
Sharded 2026.02		53.6	54.2
Sharded 2026.02		48.5	46.8