Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BOLT SMS

Benchmarks

Task NameDataset NameSOTA ResultTrend
DisclosureBOLT SMS single-turn
T-Statistic66.34
28
Emotional SupportBOLT SMS single-turn
T-Statistic66.45
28
DisclosureBOLT SMS multi-turn
M_steer5
28
Emotional SupportBOLT SMS multi-turn
M_steer Score0.05
28
DisclosureBOLT SMS single-turn (test)
Chi-Squared (χ2)0.41
14
Emotional SupportBOLT SMS single-turn (test)
Chi-squared (χ2)948.7
14
DisclosureBOLT SMS multi-turn (test)
Chi-Squared Statistic0.01
14
Emotional SupportBOLT SMS multi-turn (test)
Chi-Squared0.03
14
Showing 8 of 8 rows