Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DynSess

Benchmarks

Task NameDataset NameSOTA ResultTrend
Role-playing evaluationDynSess-Eval
Average Performance (Auto)4.35
8
Showing 1 of 1 rows