Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Regression on Santander Value Prediction Challenge
Loading...
1,487
Leaderboard Percentile
MCTS-Shaped
-59.48
342.01
743.5
1,144.99
Nov 29, 2025
Leaderboard Percentile
Updated 4d ago
Evaluation Results
Method
Method
Links
Leaderboard Percentile
MCTS-Shaped
Base LLM=GPT-4o
2025.11
1,487
LATS
Base LLM=GPT-4o
2025.11
717
MCTS-Outcome
Base LLM=GPT-4o
2025.11
712
Hierarchical MCTS
Base LLM=GPT-4o
2025.11
27
MCTS-Shaped
Base Model=GPT-4.1-mini
2025.11
14.24
Hierarchical MCTS
Base Model=GPT-4.1-mini
2025.11
14.24
ReAct
Base LLM=GPT-4o
2025.11
9
ReAct
Base Model=GPT-4.1-mini
2025.11
3.6
LATS
Base Model=GPT-4.1-mini
2025.11
0
MCTS-Outcome
Base Model=GPT-4.1-mini
2025.11
0
Feedback
Search any
task
Search any
task