Abstract

Benchmarks

Task Name	Dataset Name	SOTA Result
Image Captioning Evaluation	Abstract-50S	Mean Accuracy76.9	4
Retrieval-Augmented Generation	Abstract single	F1 Score29	3
Concept Erasure Attack	Abstract	LPIPS0.44	3
Affective Reasoning and Emotion Prediction	Abstract	Sample Mean Dice0.8828	1

Showing 4 of 4 rows