Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Script Generation on Gardening (test)
Loading...
1.92
Next Step Correctness
BART
1.2856
1.4503
1.615
1.7797
Aug 25, 2022
Next Step Correctness
Future Steps Correctness
Diversity
Executability
Updated 1mo ago
Evaluation Results
Method
Method
Links
Next Step Correctness
Future Steps Correctness
Diversity
Executability
BART
2022.08
1.92
2.05
2.43
1.6
+CP
Components=CP
2022.08
1.78
1.93
2.7
1.39
+CP+M
Components=CP, Selecti...
2022.08
1.77
1.95
2.41
1.37
+CP+M+R
Components=CP, Selecti...
2022.08
1.48
1.55
2.66
1.29
+CP+M+R+CL
Components=CP, Selecti...
2022.08
1.31
1.37
1.27
1.18
Feedback
Search any
task
Search any
task