Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Research Idea Evaluation on ScholarIdeas Neuroscience
Loading...
2.74
Coverage
ScholarEval GPT-4.1
1.7832
2.0316
2.28
2.5284
Oct 17, 2025
Coverage
Updated 3mo ago
Evaluation Results
Method
Method
Links
Coverage
ScholarEval GPT-4.1
Method Category=Schola...
2025.10
2.74
ScholarEval Claude
Method Category=Schola...
2025.10
2.55
DR Tulu
Method Category=Deep R...
2025.10
2.31
OpenAI Deep Research
Method Category=Deep R...
2025.10
2.25
GPT-4.1
Method Category=Langua...
2025.10
2.18
Claude-4-Sonnet
Method Category=Langua...
2025.10
2.15
ScholarEval GPT-5.1
Method Category=Schola...
2025.10
2.12
ScholarEval Llama
Method Category=Schola...
2025.10
2.05
GPT-5.1 Instant
Method Category=Langua...
2025.10
1.85
GPT-4o-search-preview
Method Category=Web-co...
2025.10
1.84
Llama-3.3-70B
Method Category=Langua...
2025.10
1.82
Feedback
Search any
task
Search any
task