Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Experiment

Benchmarks

Task NameDataset NameSOTA ResultTrend
Annotation AccuracyExperiment 1 (test)
F1 Score (Ga)100
40
Touch Pointing Parameter EstimationExperiment 3 (Leave-One-Out Cross-Validation)
R20.948
22
Touch Pointing Parameter EstimationExperiment 3 Full Aggregate Data (train)
R20.991
22
Success Rate and Distribution Parameter RegressionExperiment 2 (LOOCV)
R2 Score0.947
14
Success Rate and Distribution Parameter RegressionExperiment 2
R20.999
14
Success Rate PredictionExperiment 1 (LOOCV)
R22.64
10
Success Rate PredictionExperiment 1
R22.65
10
Trajectory PlanningExperiment Simulation 1
Trajectory Time (s)4.46
7
Causal Graph Metric Agreement AnalysisExperiment 3 within-n centered
Pearson Correlation0.885
4
Debate GenerationExperiment 1 Input Set
Choose Rate76.32
4
Circuit board assemblyExperiment 4.4.3
Total Assembly Time (s)26.7
3
Gear assemblyExperiment 4.4.2
Total Assembly Time63.5
3
Robotic task executionExperiment 4.4.1
Task Completion Time (s)6.56
3
Interaction controlExperiment 4.2.2
Contact established (s)1.4
3
Interaction controlExperiment 4.2.1
Contact Establishment Time0.11
3
PossessionExperiment (iii) (test)
Strict Similarity35.3
3
One-to-One CorrespondenceExperiment (ii) (test)
Strict Similarity0.379
3
Concurrent ExistenceExperiment (i) (test)
Strict Similarity34.8
3
Trajectory trackingExperiment 4.3
RMSE Position (m)0
2
Standard Deviation PredictionExperiment 1 (LOOCV)
R20.851
1
Skewness PredictionExperiment 1 (LOOCV)
R20.761
1
Skewness PredictionExperiment 1 (Regression Analysis)
R20.789
1
Force control bandwidth measurementExperiment 4.1
Improvement Factor vs LF-based1.13
1
Best Arm IdentificationExperiment E
Metric-
0
Showing 24 of 24 rows