Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Sampling on ManyWell-32 (N=10 chains)
Loading...
5,704
R Statistic
Diff-APT
1,395.28
2,513.89
3,632.5
4,751.11
Feb 14, 2025
R Statistic
CN-R Score
Updated 23d ago
Evaluation Results
Method
Method
Links
R Statistic
CN-R Score
Diff-APT
K=5, Neural Calls=6, I...
2025.02
5,704
950.7
CMCD-APT
K=5, Neural Calls=6, I...
2025.02
4,790
798.3
Diff-APT
K=2, Neural Calls=3, I...
2025.02
4,022
1,340.7
CMCD-APT
K=2, Neural Calls=3, I...
2025.02
3,640
1,213.3
CMCD-APT
K=1, Neural Calls=2, I...
2025.02
2,802
1,401
Diff-APT
K=1, Neural Calls=2, I...
2025.02
2,402
1,201
PT
Neural Calls=0, Iterat...
2025.02
1,879
939.5
Diff-PT
K=0, Neural Calls=2, I...
2025.02
1,561
780.5
Feedback
Search any
task
Search any
task