Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instrumental Variable Discovery on Gapminder Poverty → Cholesterol
Loading...
13.44
Relevance
IV Co-Scientist
10.1952
11.0376
11.88
12.7224
Feb 8, 2026
Relevance
Cnorm
Updated 4d ago
Evaluation Results
Method
Method
Links
Relevance
Cnorm
IV Co-Scientist
Backbone LLM=GPT-4o
2026.02
13.44
53.2
IV Co-Scientist
Backbone LLM=QwQ
2026.02
12.51
54.1
IV Co-Scientist
Backbone LLM=Llama3.1 8b
2026.02
12.51
54.1
IV Co-Scientist
Backbone LLM=Llama3.1 70b
2026.02
11.9
53.8
IV Co-Scientist
Backbone LLM=o3-mini
2026.02
10.32
56.8
Feedback
Search any
task
Search any
task