Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-agent coordination on TrafficJunction-Large (TJ-L) ablation (test)
Loading...
67
Task Success Rate
MMR
0.44
17.72
35
52.28
May 13, 2026
Task Success Rate
Updated 20d ago
Evaluation Results
Method
Method
Links
Task Success Rate
MMR
Attack rate=0.75
2026.05
67
ML
Attack rate=0.5
2026.05
51
ML
Attack rate=0.25
2026.05
44
MMR
Attack rate=0.5
2026.05
41
MMR
Attack rate=0.25
2026.05
28
NS
Attack rate=0.25
2026.05
27
CBTS
Attack rate=0.5
2026.05
25
NS
Attack rate=0.75
2026.05
25
J-W
Attack rate=0.75
2026.05
19
VL
Attack rate=0.5
2026.05
18
J-W
Attack rate=0.5
2026.05
17
ST
Attack rate=0.5
2026.05
17
J-W
Attack rate=0.25
2026.05
16
J-M
Attack rate=0.25
2026.05
15
ST
Attack rate=0.75
2026.05
15
CBTS
Attack rate=0.25
2026.05
14
CBTS
Attack rate=0.75
2026.05
14
J-M
Attack rate=0.75
2026.05
13
VL
Attack rate=0.25
2026.05
13
VL
Attack rate=0.75
2026.05
13
NS
Attack rate=0.5
2026.05
12
J-M
Attack rate=0.5
2026.05
11
ST
Attack rate=0.25
2026.05
11
ML
Attack rate=0.75
2026.05
3
Feedback
Search any
task
Search any
task