Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Color Querying on CoP-QA-F
Loading...
1
AC Score
Talk2DM
0.897664
0.924232
0.9508
0.977368
Feb 12, 2026
AC Score
AQ Score
Updated 4d ago
Evaluation Results
Method
Method
Links
AC Score
AQ Score
Talk2DM
LLM Backbone=Qwen3:30B
2026.02
1
1
Talk2DM
LLM Backbone=GPT-oss:20B
2026.02
1
0.9979
Talk2DM
LLM Backbone=Magistral...
2026.02
1
0.8629
Talk2DM
LLM Backbone=Gemma3:27B
2026.02
1
1
Talk2DM
LLM Backbone=Llama3.1:8B
2026.02
0.9979
0.8755
Talk2DM
LLM Backbone=Deepseek-...
2026.02
0.9016
0.8962
Feedback
Search any
task
Search any
task