Share your thoughts, 1 month free Claude Pro on usSee more

Code Generation on APPS Interview

2.64Pass@1

Codex

Updated 4mo ago

Evaluation Results

Method	Links
Codex 2021.07		2.64	5.78	3.23	7.13	-	-	-	-	-	-	-	-	-
LCP 2026.03		0.775	-	-	-	-	-	-	-	-	-	-	-	-
LPW 2026.03		0.652	-	-	-	-	-	-	-	-	-	-	-	-
GPT-Neo 2021.07		0.57	-	0.8	-	-	-	-	-	-	-	-	-	-
LDB 2026.03		0.522	-	-	-	-	-	-	-	-	-	-	-	-
Baseline 2026.03		0.435	-	-	-	-	-	-	-	-	-	-	-	-
Codex 2021.07		0.14	0.3	0.51	1.02	2.04	7.94	3.7	-	-	-	-	-	-
CODET 2022.07		0.081	-	-	-	-	-	-	-	0.112	0.181	-	-	-
Baseline 2022.07		0.051	-	-	-	-	-	-	-	-	0.128	0.23	-	-