Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Qwen-Agent Code Interpreter

Benchmarks

Task NameDataset NameSOTA ResultTrend
Code ExecutionQwen-Agent Code Interpreter Average
Accuracy70.5
3
Code ExecutionQwen-Agent Code Interpreter Visualization-Easy
Accuracy68.4
3
Code ExecutionQwen-Agent Code Interpreter Visualization-Hard
Accuracy72.6
3
Showing 3 of 3 rows