Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

benchmark

Benchmarks

Task NameDataset NameSOTA ResultTrend
Embodied vision-language reasoningOriginal benchmark B
Score61.26
13
Video Customization70-example benchmark 1.0 (test)
FaceSim Arc0.59
9
Class-conditional video generationBenchmark 17x256x256 resolution (test)
gFVD210.9
9
Theorem ProvingSmall-scale benchmark Overall
VR33
8
Text-driven Style TransferBenchmark of 52 prompts and 20 style images 1.0 (test)
Text Alignment0.235
8
Intent ClassificationBenchmark 03
In-Scope Accuracy84
8
Image Classification8-task benchmark
ID Score94.8
6
Robot parameter extraction and forward kinematics calculationBenchmark 1 (test)
M_C (Completeness/Score)97
6
3D face reconstructionbenchmark High-Quality (HQ) 1.0
Median Error (mm)1.58
6
Electric Vehicle Routing Problem (ECVRP)benchmark Small Instances
Objective Value263.33
5
Speculative DecodingBenchmark Second Turn
Block Efficiency2.32
5
Speculative DecodingBenchmark First Turn
Block Efficiency2.32
5
Object Detection100-image benchmark Brighten
AFFC1
4
Object Detection100-image benchmark Snow
AFFC0.365
4
Object Detection100-image benchmark Rain
AFFC62.1
4
Object Detection100-image benchmark Fog
AFFC0.7
4
Large Model Performance PredictionBenchmark Chinese pattern shift
RMSE16.94
3
Large Model Performance PredictionBenchmark OCR pattern shift
RMSE25.18
3
Visual Forward KinematicsBenchmark 10 visual problem instances 2 1.0 (test)
Consistency Score93
2
Entity LinkingBenchmark Skills
Top-1 Accuracy39.69
2
Protein-Ligand Binding Affinity Predictionbenchmark1k2101 (test)
Correlation (R)0.883
1
Showing 21 of 21 rows