Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scientific Reasoning on Lens

56.2Accuracy

Gemini2.5-Pro

28.32835.56442.850.036May 21, 2025
Updated 20d ago

Evaluation Results

MethodLinks
2025.05
56.2
2025.05
53.65
2025.05
51.66
2025.05
51.14
2025.05
50.8
2025.05
49.39
2025.05
47.18
2025.05
46.28
2025.05
44.69
2025.05
44.58
2025.05
43.33
2025.05
40.56
2025.05
40.33
2025.05
39.53
2025.05
38.97
2025.05
29.4