Share your thoughts, 1 month free Claude Pro on usSee more

Tool Selection on Chess Specialists: opening, midgame, endgame, late-endgame

64.4Accuracy

Gold (Ground Truth Documentation)

Updated 4mo ago

Evaluation Results

Method	Links
Gold (Ground Truth Documentation) 2026.02		64.4	1,411
Gold (Ground Truth Documentation) 2026.02		52.8	1,243
TOOLOBSERVER 2026.02		40.1	1,020
Play2Prompt 2026.02		35.8	966
TOOLOBSERVER 2026.02		32.1	949
EasyTool 2026.02		25.8	739
Base (Opacified set) 2026.02		24.9	772
Base (Opacified set) 2026.02		23.5	728
EasyTool 2026.02		23.2	761
Play2Prompt 2026.02		19.5	754