Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Gap discovery quality on Scientist-Bench 27 tasks

5Gaps/Task

AI-Supervisor (RWM)

1.882.693.54.31Mar 25, 2026
Updated 2mo ago

Evaluation Results

MethodLinks
2026.03
580.71004.44
2026.03
4.967.992.64.15
2026.03
275.592.64.04