Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GROVE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Behavior PredictionGROVE TechCrunch 1.0
Average Performance Score85.7
5
Behavior PredictionGROVE Full Wikipedia domains 1.0
Accuracy (Aero.)70.7
5
Behavior PredictionGROVE Wikipedia 1.0 (P2 & P3)
Accuracy (Aero.)76
3
Showing 3 of 3 rows