Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DS-1000

Benchmarks

Task NameDataset NameSOTA ResultTrend
Code GenerationDS-1000
Pass@158.65
28
Code GenerationDS-1000 1.0 (test)
Matplotlib67.2
19
Code Hallucination DetectionDS-1000
OP0.95
16
Code GenerationDS-1000
Matplotlib Score68.6
15
Data science code generationDS-1000
Matplotlib Score60.3
13
Code GenerationDS-1000
Accuracy69.9
11
Data Science Code CompletionDS-1000
Pandas (Pass@1)32
9
Data ScienceDS-1000
Performance Score56.5
8
Code CompletionDS-1000 (test)
Matplotlib Success Rate56.1
8
Code GenerationDS-1000 Python
Pass@139.7
7
Code CompletionDS-1000 1.0 (test)
Success Rate (Matplotlib)55.2
5
Code SuggestionDS-1000
Binary Reward51
4
Code Test GenerationDS-1000
Source Pass Rate68.12
4
Code InsertionDS-1000 1.0 (test)
Success Rate (Matplotlib)55.2
3
Showing 14 of 14 rows