Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instrumental Variable Discovery on Gapminder Female literacy → Kids
Loading...
19.81
Relevance
IV Co-Scientist
17.4076
18.0313
18.655
19.2787
Feb 8, 2026
Relevance
Cnorm
Updated 4d ago
Evaluation Results
Method
Method
Links
Relevance
Cnorm
IV Co-Scientist
Backbone LLM=GPT-4o
2026.02
19.81
51.8
IV Co-Scientist
Backbone LLM=o3-mini
2026.02
19.81
51.8
IV Co-Scientist
Backbone LLM=QwQ
2026.02
18.23
52.9
IV Co-Scientist
Backbone LLM=Llama3.1 8b
2026.02
18.23
52.9
IV Co-Scientist
Backbone LLM=Llama3.1 70b
2026.02
17.5
55.9
Feedback
Search any
task
Search any
task