Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Regression on BPM Prediction
Loading...
10,000
Leaderboard Percentile
Hierarchical MCTS
-400
2,300
5,000
7,700
Nov 29, 2025
Leaderboard Percentile
Updated 4d ago
Evaluation Results
Method
Method
Links
Leaderboard Percentile
Hierarchical MCTS
Base LLM=GPT-4o
2025.11
10,000
LATS
Base LLM=GPT-4o
2025.11
5,263
MCTS-Shaped
Base LLM=GPT-4o
2025.11
526
MCTS-Shaped
Base Model=GPT-4.1-mini
2025.11
100
Hierarchical MCTS
Base Model=GPT-4.1-mini
2025.11
100
ReAct
Base LLM=GPT-4o
2025.11
51
ReAct
Base Model=GPT-4.1-mini
2025.11
0
LATS
Base Model=GPT-4.1-mini
2025.11
0
MCTS-Outcome
Base Model=GPT-4.1-mini
2025.11
0
MCTS-Outcome
Base LLM=GPT-4o
2025.11
0
Feedback
Search any
task
Search any
task