Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAVEN

Benchmarks

Task NameDataset NameSOTA ResultTrend
Abstract Visual ReasoningRAVEN v1 (test)
Average Accuracy93.6
22
Relational ReasoningRAVEN (test)
Average Accuracy92.5
5
Abstract Visual ReasoningRaven
Accuracy34
3
Showing 3 of 3 rows