Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Star

Benchmarks

Task NameDataset NameSOTA ResultTrend
Oriented Object DetectionSTAR (test)
AP39.45
60
Video Question AnsweringSTAR (test)
Interaction Score79.1
54
Video-based Question AnsweringSTAR
Accuracy78.6
50
Task#1STAR
Accuracy99.88
33
MAP InferenceSTAR-dataset
Runtime0.02
25
Video Question AnsweringSTAR (val)
Mean Score63.8
22
Video ReasoningSTAR
Score67.7
19
Astronomical Super-resolutionSTAR
PSNR34.66
16
Oriented Object DetectionSTAR
AP5028.1
13
Task#5STAR
Score82.66
12
Statement RankingStaR Sports
Precision@55.547
12
Statement RankingStaR Beauty
Precision@56.2
12
Statement RankingStaR Clothes
Precision@515.136
12
Statement RankingStaR Toys
Precision@56.309
12
Selective Regressionstar (test)
Conditional Large-Loss Rate28.5
12
Object DetectionSTAR
AP (Car)14.5
11
Video Question AnsweringSTAR v1.0 (test)
Interaction Accuracy73.7
10
Video Question AnsweringSTAR V (test)
Accuracy42.8
10
Conformal Predictionstar
MC (%)91.82
8
Task-Oriented DialogueSTAR
F1 Score68
7
Video UnderstandingSTAR
Score58.77
7
Regressionstar (test)
Marginal Coverage91
7
Regressionstar 2161 (test outliers)
Mean Outlier Coverage88
7
Regressionstar
SMIS11.33
7
Regressionstar n=2161 (test)
ILR1
7
Showing 25 of 41 rows