Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Autonomous Incident Management on AIOpsLab full benchmark (86 tasks)

100Detection Rate (best@k)

AOI

2242.2562.582.75Mar 3, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
10066.953.627.930.87.746.223.166.338.6
2026.03
90.6-32.1-38.5-53.8-58.1-
2026.03
78.1-25-15.4-23.1-43-
2026.03
7541.332.111.47.74.615.415.441.922.1
2026.03
68.8-53.6-15.4-76.9-57-
2026.03
25-9.5-7.7-7.7-14.7-