Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Scientific Discovery on Scientific Discovery
Loading...
38.5
Param & Constraint Acc
CASTER
37.772
37.961
38.15
38.339
Jan 27, 2026
Param & Constraint Acc
Scientific Validity
Robustness
Code Quality
Updated 4d ago
Evaluation Results
Method
Method
Links
Param & Constraint Acc
Scientific Validity
Robustness
Code Quality
CASTER
2026.01
38.5
29.1
19.4
9.5
FrugalGPT
2026.01
37.8
28.1
19
9.1
Feedback
Search any
task
Search any
task