Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Data Science on Scaling Law Discovery u_shape
Loading...
-0.008
R2 Score
SIMPLETES
-0.31688
-0.23669
-0.1565
-0.07631
Apr 21, 2026
R2 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
R2 Score
SIMPLETES
Model=gpt-oss-120b
2026.04
-0.008
SLDAgent
Model=GPT-5
2026.04
-0.305
Feedback
Search any
task
Search any
task