Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Star

Benchmarks

Task NameDataset NameSOTA ResultTrend
Oriented Object DetectionSTAR (test)
AP39.45
60
Video-based Question AnsweringSTAR
Accuracy78.6
50
Video Question AnsweringSTAR (test)
Interaction Score79.1
42
MAP InferenceSTAR-dataset
Runtime0.02
25
Video Question AnsweringSTAR (val)
Mean Score63.8
22
Video ReasoningSTAR
Score67.7
19
Oriented Object DetectionSTAR
AP5028.1
13
Statement RankingStaR Sports
Precision@55.547
12
Statement RankingStaR Beauty
Precision@56.2
12
Statement RankingStaR Clothes
Precision@515.136
12
Statement RankingStaR Toys
Precision@56.309
12
Selective Regressionstar (test)
Conditional Large-Loss Rate28.5
12
Video Question AnsweringSTAR v1.0 (test)
Interaction Accuracy73.7
10
Video Question AnsweringSTAR V (test)
Accuracy42.8
10
Video UnderstandingSTAR
Score58.77
7
Regressionstar (test)
Marginal Coverage91
7
Regressionstar 2161 (test outliers)
Mean Outlier Coverage88
7
Regressionstar
SMIS11.33
7
Regressionstar n=2161 (test)
ILR1
7
Video Question AnsweringSTAR zero-shot (test)
Interaction Score51.5
7
Link PredictionStar 1000 (test)
AUC100
5
ClassificationStar
AUC0.886
2
Showing 22 of 22 rows