Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Faithfulness under retraining on Titanic
Loading...
13.988
AURC
AXIL
0.82784
4.24442
7.661
11.07758
Jan 5, 2023
AURC
Standard Error
Updated 5d ago
Evaluation Results
Method
Method
Links
AURC
Standard Error
AXIL
N=1,307, M=7
2023.01
13.988
3.24
TREX
N=1,307, M=7
2023.01
9.754
2.195
BoostIn
N=1,307, M=7
2023.01
7.158
1.504
LeafInf
N=1,307, M=7
2023.01
6.278
1.386
Random
N=1,307, M=7
2023.01
1.334
0.182
Feedback
Search any
task
Search any
task