Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Codebase Generation on Online Microblogging App
Loading...
0.3332
Feature Completeness (%)
AutoGPT
0.316521
0.324851
0.33318
0.341509
Oct 2, 2023
Feature Completeness (%)
Error Count
Test Failure Rate
Test Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Feature Completeness (%)
Error Count
Test Failure Rate
Test Success Rate
AutoGPT
LOC=148±35.5
2023.10
0.3332
-
32.86
-
Code-L2MAC
LOC=395±52.9
2023.10
-
0
-
-
Feedback
Search any
task
Search any
task