Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ToM-BPD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Strategy PredictionToM-BPD (test)
Strategy Accuracy39.78
27
Belief PredictionToM-BPD (test)
Belief Accuracy54.64
27
Desire PredictionToM-BPD (test)
Desire Accuracy72.82
27
Persuasive DialogueToM-BPD interactive evaluation
Win Rate: Identification55.23
3
Showing 4 of 4 rows