Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Codebase generation on Online Chat App
Loading...
112.26
Feature Completeness (%)
GPT4
106.647
109.4535
112.26
115.0665
Oct 2, 2023
Feature Completeness (%)
Error Count
Failed Tests
Passed Tests
Updated 1mo ago
Evaluation Results
Method
Method
Links
Feature Completeness (%)
Error Count
Failed Tests
Passed Tests
GPT4
LOC=127±24.1
2023.10
112.26
-
-
-
Code-L2MAC
LOC=374±123
2023.10
-
0
146.71
-
Feedback
Search any
task
Search any
task