Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Scenario Understanding on Scenario Understanding (OOD)

100Success Rate

GPT-4.1-2025-04-14

3.17628.31353.4578.587Sep 29, 2025
Updated 4d ago

Evaluation Results

MethodLinks
10087.8
10059.6
10069.2
10042.8
98.762.6
2025.09
98.387.7
2025.09
92.534
87.634.7
46.832.1
2025.09
32.625.1
2025.09
22.737.5
2025.09
22.127.8
2025.09
6.9-