Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Research Idea Evaluation on ScholarIdeas Ecology
Loading...
2.9
Coverage
ScholarEval Claude
1.6936
2.0068
2.32
2.6332
Oct 17, 2025
Coverage
Updated 3mo ago
Evaluation Results
Method
Method
Links
Coverage
ScholarEval Claude
Method Category=Schola...
2025.10
2.9
ScholarEval GPT-4.1
Method Category=Schola...
2025.10
2.52
DR Tulu
Method Category=Deep R...
2025.10
2.39
OpenAI Deep Research
Method Category=Deep R...
2025.10
2.35
GPT-4.1
Method Category=Langua...
2025.10
2.13
Claude-4-Sonnet
Method Category=Langua...
2025.10
2.11
ScholarEval GPT-5.1
Method Category=Schola...
2025.10
2.09
GPT-5.1 Instant
Method Category=Langua...
2025.10
1.95
GPT-4o-search-preview
Method Category=Web-co...
2025.10
1.95
ScholarEval Llama
Method Category=Schola...
2025.10
1.94
Llama-3.3-70B
Method Category=Langua...
2025.10
1.74
Feedback
Search any
task
Search any
task