Share your thoughts, 1 month free Claude Pro on usSee more

Multi-task Language and Code Understanding on Open LLM Leaderboard and HumanEval

63.2ARC

Mistral-Pro

Updated 3mo ago

Evaluation Results

Method	Links
Mistral-Pro 2024.01		63.2	82.6	60.6	48.3	78.9	50.6	32.9
Gemma-7B 2024.01		61.9	82.2	64.6	44.8	79	50.9	32.3
Mistral-7B 2024.01		60.8	83.3	62.7	42.6	78	39.2	28.7