Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Uncertainty Estimation on Secret-word taboo dataset (random-split)

42.4Accuracy

Bootstrap

19.72825.61431.537.386May 25, 2026
Updated 7d ago

Evaluation Results

MethodLinks
2026.05
42.419.321.31.02981
2026.05
41.933.430.32.63678.4
2026.05
41.840.435.44.40176.3
2026.05
41.840.837.14.67373.7
2026.05
41.554.453.410.7756.3
2026.05
41.549.346.17.82366.9
2026.05
41.425.624.90.74882.4
2026.05
41.425.524.60.7384
2026.05
41.49.717.10.55883
2026.05
41.458.25812.6651.6
2026.05
41.326.324.71.60880.3
2026.05
40.354.753.210.2857.9
2026.05
40.25.716.30.49882.9
2026.05
38.77.617.10.51982.3
2026.05
37.655.153.19.35860.1
2026.05
36.78.317.30.52282.4
2026.05
23.666.86210.8765.3
2026.05
23.413.115.70.48484.2
2026.05
23.413.115.80.48983.5
2026.05
23.47472.915.6152.8
2026.05
23.475.375.216.0240.4
2026.05
23.347.936.92.36783.7
2026.05
23.356.948.25.65876.7
2026.05
23.250.141.34.17272.5
2026.05
23.132.722.20.886.4
2026.05
2321.815.70.51486.3
2026.05
2373.872.215.0454.7
2026.05
22.83927.41.38385.8
2026.05
22.514.7130.43185.1
2026.05
21.174.672.213.9757.4
2026.05
20.912.512.50.41583.7
2026.05
20.610.312.90.42281.2