Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical Reasoning on MEDREASON
Loading...
16.4
Best-of-16 Delta (Δ)
Expert Reasoning Reward Model
4.128
7.314
10.5
13.686
Oct 2, 2025
Best-of-16 Delta (Δ)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Best-of-16 Delta (Δ)
Expert Reasoning Reward Model
Reward Model Backbone=...
2025.10
16.4
Expert Reasoning Reward Model
Reward Model Backbone=...
2025.10
9.5
Expert Reasoning Reward Model
Reward Model Backbone=...
2025.10
7.8
Expert Reasoning Reward Model
Reward Model Backbone=...
2025.10
5.7
Expert Reasoning Reward Model
Reward Model Backbone=...
2025.10
5.1
Expert Reasoning Reward Model
Reward Model Backbone=...
2025.10
4.6
Feedback
Search any
task
Search any
task