Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAVEN

Benchmarks

Task NameDataset NameSOTA ResultTrend
Abstract Visual ReasoningRaven
Accuracy98.7
25
Abstract Visual ReasoningRAVEN v1 (test)
Average Accuracy93.6
22
Relational ReasoningRAVEN (test)
Average Accuracy92.5
5
Showing 3 of 3 rows