CodeGemma: Open Code Models Based on Gemma
About
This paper introduces CodeGemma, a collection of specialized open code models built on top of Gemma, capable of a variety of code and natural language generation tasks. We release three model variants. CodeGemma 7B pretrained (PT) and instruction-tuned (IT) variants have remarkably resilient natural language understanding, excel in mathematical reasoning, and match code capabilities of other open models. CodeGemma 2B is a state-of-the-art code completion model designed for fast code infilling and open-ended generation in latency-sensitive settings.
CodeGemma Team: Heri Zhao, Jeffrey Hui, Joshua Howland, Nam Nguyen, Siqi Zuo, Andrea Hu, Christopher A. Choquette-Choo, Jingyue Shen, Joe Kelley, Kshitij Bansal, Luke Vilnis, Mateo Wirth, Paul Michel, Peter Choy, Pratik Joshi, Ravin Kumar, Sarmad Hashmi, Shubham Agrawal, Zhitao Gong, Jane Fine, Tris Warkentin, Ale Jakse Hartman, Bin Ni, Kathy Korevec, Kelly Schaefer, Scott Huffman• 2024
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Code Generation | DS-1000 1.0 (test) | Matplotlib54.7 | 19 | |
| Code Generation | ClassEval | Pass@1 (Class-level)21 | 9 | |
| Code Generation | EvoEval semantic-altering | Pass@1 Average35.4 | 9 | |
| Code Generation | LiveCodeBench start date 2023-09-01 May 2023 to February 2024 | Pass@115.2 | 9 | |
| Code Editing | CanItEdit original (test) | Pass@1 (Average)34.2 | 9 | |
| Code Generation | LiveCodeBench May 2023 to February 2024 (start date 2023-07-01) | Pass@114.1 | 9 | |
| Code Generation | LiveCodeBench May 2023 to February 2024 (start date 2023-05-01) | Pass@113.6 | 9 |
Showing 7 of 7 rows