Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Research Idea Generation on Expert Evaluation Biotech Domain (test)
Loading...
91.7
Novelty Winrate vs Qwen-14B
DeepInnovator
87.115
89.4075
91.7
93.9925
Feb 21, 2026
Novelty Winrate vs Qwen-14B
Novelty Winrate vs GPT-4o
Feasibility Winrate vs Qwen-14B
Feasibility Winrate vs GPT-4o
Effectiveness Winrate vs Qwen-14B
Effectiveness Winrate vs GPT-4o
Detailedness Winrate vs Qwen-14B
Detailedness Winrate vs GPT-4o
Updated 4d ago
Evaluation Results
Method
Method
Links
Novelty Winrate vs Qwen-14B
Novelty Winrate vs GPT-4o
Feasibility Winrate vs Qwen-14B
Feasibility Winrate vs GPT-4o
Effectiveness Winrate vs Qwen-14B
Effectiveness Winrate vs GPT-4o
Detailedness Winrate vs Qwen-14B
Detailedness Winrate vs GPT-4o
DeepInnovator
evaluation_mode=Human...
2026.02
91.7
61.5
100
38.5
58.3
53.8
90.9
35.7
Feedback
Search any
task
Search any
task