Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Failure Reasoning and Correction on Real-World Benchmark (test)

62.1ROUGE-L

Dream2Fix-VLM

-2.48414.28331.0547.817Mar 13, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
62.166.88242.147.2
2026.03
46.758.9982537.4
2026.03
1947.37222.112.6
2026.03
993012.82.2
2026.03
6.147.89316.718.3
2026.03
5.217.62510.50.8
2026.03
00035.40