Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Data Transformation (Pandas) on TDE
Loading...
45.6
Execution Accuracy
Table-Specialist
12.424
21.037
29.65
38.263
Oct 16, 2024
Execution Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Execution Accuracy
Table-Specialist
Base Model=GPT-4, Trai...
2024.10
45.6
GPT-4
Base Model=GPT-4, Trai...
2024.10
41.8
Table-Specialist
Base Model=GPT-3.5, Tr...
2024.10
34.6
GPT-3.5
Base Model=GPT-3.5, Tr...
2024.10
29.3
Table-Specialist
Base Model=Llama3.1-8B...
2024.10
16.1
Llama3.1-8B
Base Model=Llama3.1-8B...
2024.10
13.7
Feedback
Search any
task
Search any
task