Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Text-centric Reasoning on VideoThinkBench mini (test)

89Average Score

Gemini 2.5 Pro

-3.5620.4744.568.53Nov 6, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
891001001009010083.393.3808085659585
2025.11
86.610010010010010083.393.38083.975558570
2025.11
77.6100100805084.658.3100568070659075
2025.11
77.2100100604010083.3100608080458075
2025.11
75.8100951007076.966.7806457.165658065
2025.11
72.510095805076.966.793.34065.775459065
2025.11
67.610090504076.966.773.35645.775459070
2025.11
6656.765808069.275804462.975457550
2025.11
57.176.765403069.266.773.35254.370456040
2025.11
55.710080201069.275603648.655605555
2025.11
48.38070502061.516.7605242.950453545
2025.11
44.593.380502061.541.7804051.42552010
2025.11
41.490400069.22546.74022.950406550
2025.11
38.476.640102061.533.386.61665.730103020
2025.11
12.5303520015.406.782.90251010
2025.11
10.5000015.416.76.748.615153025
2025.11
10.4201002015.400200530510
2025.11
8.96.7000006.742.925401020
2025.11
7.2000030.816.7082.9010205
2025.11
5.8000030.833.3005.70050
2025.11
00000000000000
2025.11
00000000000000