Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MIA-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal Instruction FollowingMIA-Bench
Score8.86
12
Membership InferenceMIA-Bench 5% Forget Set
Average Performance69.6
12
Multimodal ReasoningMIA-Bench
Length (tokens)4,329.3
9
AlignmentMIA-Bench
Accuracy93.3
7
Multi-modal human-preference alignmentMIA-Bench
Score89.6
6
Visual Question AnsweringMIA-Bench
Accuracy68.8
4
Showing 6 of 6 rows