Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Perception on MMStar latest (test)

67.2CP

GPT-4o

44.3250.2656.262.14Jun 20, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.06
67.251.663.256.45236.854.5
2024.06
67.243.656.846.84230.847.9
2024.06
65.642.456.450.846.82848.3
2024.06
64.437.2503439.224.841.6
2024.06
6441.654.451.647.631.248.4
2024.06
63.238.849.238.837.623.241.8
2024.06
63.228.846.440.83422.839.3
2024.06
62.836.453.64235.226.442.7
2024.06
62.433.251.239.244.432.843.9
2024.06
62.428.849.245.237.221.240.7
2024.06
62.43654.846.4382443.6
2024.06
61.235.253.247.648.419.244.1
2024.06
61.234.854.442.845.225.243.9
2024.06
61.230.45244.440.822.841.9
2024.06
60.839.258.447.642.419.644.7
2024.06
60.830.849.637.636.820.439.3
2024.06
60.83848.835.243.620.841.2
2024.06
6041.652.44440.825.244
2024.06
59.233.251.242.840.430.442.9
2024.06
57.23651.643.245.623.242.8
2024.06
57.231.649.64243.62441.3
2024.06
56.837.647.242.436.421.640.3
2024.06
563254.833.637.220.839.1
2024.06
55.628.449.239.638.419.238.4
2024.06
54.828.848.837.6322037
2024.06
53.234.445.238.838.823.639
2024.06
50.831.650.438.83225.238.1
2024.06
48.829.64834.427.617.234.3
2024.06
47.22445.233.628.818.832.9
2024.06
45.228.445.235.63017.233.6