Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Integration on PM
Loading...
1
Performance Score
OpenCodeInterpreter
-0.04
0.23
0.5
0.77
Jul 24, 2024
Performance Score
Failure Reason Code
Updated 1mo ago
Evaluation Results
Method
Method
Links
Performance Score
Failure Reason Code
OpenCodeInterpreter
One-shot learning=true
2024.07
1
5
DataInterpreter
Mechanism=Tools, One-s...
2024.07
1
5
TaskWeaver
Mechanism=Plugins, One...
2024.07
1
5
LAMBDA
Mechanism=Knowledge, O...
2024.07
1
5
GPT-4-Advanced Data Analysis
One-shot learning=true
2024.07
0.8
4
ChatGLM-Data Analysis
One-shot learning=true
2024.07
0
2
OpenInterpreter
One-shot learning=true
2024.07
0
2
Chapyter
One-shot learning=true
2024.07
0
2
Feedback
Search any
task
Search any
task