Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Summarization on Python Code Summarization downstream dataset
Loading...
57.1
CompScore
GPT-5-mini
54.396
55.098
55.8
56.502
Feb 2, 2026
CompScore
Updated 4d ago
Evaluation Results
Method
Method
Links
CompScore
GPT-5-mini
Input Modality=Text
2026.02
57.1
Gemini-3-Pro
Input Modality=Image
2026.02
56.8
Qwen-3-VL
Input Modality=Text
2026.02
56.6
GPT-5.1
Input Modality=Text
2026.02
56.6
GPT-5-mini
Input Modality=Image
2026.02
56.5
Qwen-3-VL
Input Modality=Image
2026.02
56.4
Gemini-3-Pro
Input Modality=Text
2026.02
56
GPT-5.1
Input Modality=Image
2026.02
55.9
Gemini-3-Flash
Input Modality=Image
2026.02
55.5
GLM-4.6v
Input Modality=Text
2026.02
55.4
Gemini-2.5-Pro
Input Modality=Image
2026.02
55.2
Gemini-3-Flash
Input Modality=Text
2026.02
55.2
GLM-4.6v
Input Modality=Image
2026.02
54.6
Gemini-2.5-Pro
Input Modality=Text
2026.02
54.5
Feedback
Search any
task
Search any
task