Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Personalization Benchmark

Benchmarks

Task NameDataset NameSOTA ResultTrend
CaptioningPersonalization Benchmark
Single Score83.5
23
MVQAPersonalization Benchmark
MVQA Score (Single)92
23
Short Text GenerationPersonalization Benchmark Short Text
R-10.157
4
Long Text GenerationPersonalization Benchmark Long Text
R-1 Score0.233
4
Showing 4 of 4 rows