Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Data Science on Scaling Law Discovery domain_mix
Loading...
99.1
R2 Score
SIMPLETES
98.788
98.869
98.95
99.031
Apr 21, 2026
R2 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
R2 Score
SIMPLETES
Model=gpt-oss-120b
2026.04
99.1
SLDAgent
Model=GPT-5
2026.04
98.8
Feedback
Search any
task
Search any
task