Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-agent Reasoning on ARMMAN

85.78Accuracy

OW-L

84.344884.717485.0985.4626Oct 1, 2025
Updated 14d ago

Evaluation Results

MethodLinks
2025.10
85.78
2025.10
85.78
2025.10
85.78
2025.10
85.32
2025.10
85.24
2025.10
85.1
2025.10
84.94
2025.10
84.79
2025.10
84.4