Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BFI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Persona ManipulationBFI (test)
Success Score95.58
72
Personality AssessmentBFI Self-report 44-item (test)
MAE0.606
36
Showing 2 of 2 rows