Share your thoughts, 1 month free Claude Pro on usSee more

Programming on HumanEval (test)

79.5ACC1

o1-mini

Updated 4mo ago

Evaluation Results

Method	Links
o1-mini 2024.12		79.5	4.3	14.8
4o 2024.12		72.6	6.8	21.9
GPT-4 2023.03		67	-	-
CodeT + GPT-3.5 2023.03		65.8	-	-
FP16 2023.11		57.31	-	-
AFPQ (NF3-asym) 2023.11		52.43	-	-
3.5-turbo 2024.12		50.9	10.6	28.3
GPT-3.5 2023.03		48.1	-	-
AWQ (INT3) 2023.11		47.56	-	-
AWQ (NF3-sym) 2023.11		45.12	-	-
PaLM 2023.03		26.2	-	-