Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DEMON Benchmark

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-modal EvaluationDEMON Benchmark zero-shot evaluation
Multi Modal Dialogue37.5
11
Showing 1 of 1 rows