Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GUI Navigation and Action on OS World (test)

2,056Success Rate (Avg)

ANCHOR

-79.12475.191,029.51,583.81Oct 30, 2024Jan 30, 2025May 3, 2025Aug 3, 2025Nov 4, 2025Feb 4, 2026May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.02
2,05636.3713.7918.5238.4657.14-26.6718.1837.5-7.1450
2026.02
1,77545.4510.3418.5238.4642.85-23.3327.2725-5.7125
2026.02
1,68227.276.914.8130.7742.85-23.3327.2737.5-7.1425
2026.02
1,63536.3713.7914.8123.0742.85-13.3336.3637.5-5.7125
2026.02
7949.093.453.715.3814.29-1027.2725-4.290
2026.02
70103.457.4115.3814.29-109.0912.5-2.8512.5
2026.02
5619.0903.715.380-1018.1812.5-2.860
2026.02
5149.093.453.77.690-6.6718.180-2.8612.5
2026.02
5149.0903.77.690-6.6718.1812.5-4.290
2026.02
4679.0907.417.6914.29-6.6700-4.290
2026.02
9300000-3.3300-1.430
2026.02
77.2995.8387.2378.7291.370.58-67.3910096.15-59.1480
2026.02
72.5879.1787.2372.2686.8362.94-62.9682.6173.08-63.973.33
2024.10
72.367561.780.8573.9170.5946.6778.2673.9173.0873.27--
67.1470.8380.4370.1969.5760.82-69.4878.2669.23-52.9773.33
65.7778.2676.648.8178.1361.53-62.9678.2680.77-55.380
2026.02
64.2270.8374.4763.7468.1846.29-62.9691.376.92-49.7366.67
2026.02
63.4179.1763.8365.5360.7457.88-58.6182.6176.92-50.9180
62.8870.8372.346882.4858.18-62.9673.9153.85-49.5460
2026.02
60.767570.2150.2873.9171.94-54.2678.2665.38-47.8773.33
2026.02
4700000-3.3300-00
2026.05
42.9------------
2026.05
39------------
2026.05
24.5------------
2026.05
24.3------------
2026.05
24------------
2026.05
22.7------------
2026.05
19.9------------
2026.05
16.9------------
2026.05
16.7------------
2024.10
14.63254.2617.028.729.4126.6719.5717.3919.238.91--
2026.05
13.24------------
2024.10
11.6520.832.2314.898.723.5213.3315.2213.0415.387.92--
2026.05
9.7------------
2024.10
9.2116.67012.764.3523.526.6710.868.711.547.92--
2026.05
9.1------------
2026.05
6.04------------
2024.10
5.038.3306.774.3516.104.354.353.855.58--
2024.10
4.5920.8306.774.356.5204.354.3503.6--
2026.05
4.4------------
2026.05
3------------