Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Procedure Planning on CrossTask long horizons T=6
Loading...
9.27
Success Rate (SR)
KEPP
0.8772
3.0561
5.235
7.4139
Mar 5, 2024
Success Rate (SR)
Updated 3d ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
KEPP
Evaluation setting=PDP...
2024.03
9.27
KEPP
Evaluation setting=Con...
2024.03
9.23
PDPP
Evaluation setting=PDP...
2024.03
8.41
KEPP
Evaluation setting=Con...
2024.03
8.09
PDPP
Evaluation setting=Con...
2024.03
7.49
E3P
Evaluation setting=Con...
2024.03
5.76
KEPP
Evaluation setting=Con...
2024.03
5.32
SkipPlan
Evaluation setting=Con...
2024.03
5.12
P³IV
Evaluation setting=Con...
2024.03
4.4
DDN
Evaluation setting=Con...
2024.03
1.2
Feedback
Search any
task
Search any
task