Share your thoughts, 1 month free Claude Pro on usSee more

Multi-turn conversation performance on Actions

93.7Average Performance

Full

Updated 5mo ago

Evaluation Results

Method	Links
Full 2026.02		93.7	92.4
Full 2026.02		92.2	88.6
Full 2026.02		90.2	93.2
Experience-Driven Mediator 2026.02		88	71.6
Experience-Driven Mediator 2026.02		85.7	81.2
Experience-Driven Mediator 2026.02		76.2	65.2
Sharded 2026.02		45.5	60
Sharded 2026.02		42.3	48.6
Sharded 2026.02		35.6	46.6