Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PERSONALITYBENCH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Personality controlPERSONALITYBENCH
Score Variance0.1
21
Personality ExpressionPERSONALITYBENCH (test)
Agreeableness Score4.92
14
Personality performance evaluationPERSONALITYBENCH
Agreeableness Mean Score9.96
12
Showing 3 of 3 rows