Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Honesty Alignment on HonestyBench In-Domain

85.16NQ Score

EliCal

55.655263.315170.97578.6349Oct 20, 2025
Updated 3mo ago

Evaluation Results

MethodLinks
2025.10
85.1689.0986.0984.1988.8986.49
2025.10
84.8988.9685.6483.9788.0786.2
2025.10
82.3887.5184.4882.0584.3184.36
2025.10
80.6890.280.1255.462.9373.62
2025.10
77.8686.2377.2754.3662.0571.19
2025.10
72.1968.7574.3476.1778.6173.41
2025.10
68.8262.3570.5373.2471.568.9
2025.10
66.1172.9661.9659.3361.6764.75
2025.10
65.0274.9868.9867.8266.3569.8
2025.10
64.0270.2266.4965.0270.8167.22
2025.10
61.6972.5466.3965.0371.0667.7
2025.10
58.1563.3858.0855.2467.5859.48
2025.10
56.7970.2654.2941.7358.7155.48