Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Execution on Qwen-Agent Code Interpreter Visualization-Hard
Loading...
72.6
Accuracy
MatPlotAgent
66.464
68.057
69.65
71.243
Feb 18, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
MatPlotAgent
Visual Feedback=Enabled
2024.02
72.6
GPT-4
2024.02
66.7
MatPlotAgent
Visual Feedback=Disabled
2024.02
66.7
Feedback
Search any
task
Search any
task