Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Risk Identification on IS-Bench

69.9Step Accuracy

GPT-5.1

44.73251.26657.864.334May 29, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2026.05
69.929.164.238.9
2026.05
66.727.870.841
2026.05
63.125.771.738.3
49.922.288.240.7
2026.05
45.719.176.730.5