Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Data analysis on QRData
Loading...
62.04
Pass@1
DATAMIND
57.4848
58.6674
59.85
61.0326
Sep 29, 2025
Pass@1
Pass@3
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@1
Pass@3
DATAMIND
Backbone=Qwen-2.5-14B
2025.09
62.04
77.62
ReAct
Backbone=DeepSeek-V3.1
2025.09
60.75
75.67
ReAct
Backbone=Qwen-2.5-72B
2025.09
60.5
72.75
DATAMIND
Backbone=Qwen-2.5-7B
2025.09
57.66
69.34
Feedback
Search any
task
Search any
task