Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General

Benchmarks

Task NameDataset NameSOTA ResultTrend
General AlignmentGeneral Alignment and Coherence
Alignment Score93.52
18
Machine Translationgeneral 2023 (test)
BLEU32.64
16
Image Manipulation DetectionGeneral Inference Speed Evaluation Images
FPS31.7
16
Instance ErasureGeneral
FID (General)13.24
13
StabilityGeneral (MMLU, BBH, TyDiQA, BoolQ, PIQA, GSM8K)
General Score55.75
9
Video CompressionGeneral
Parameters (M)18.34
9
SegmentationGeneral Efficiency Evaluation
Latency (ms)7.3
9
Underwater Image EnhancementGeneral Architectural Comparison 1.0 (UEIB-T90)
PSNR22.82
8
General Vision-Language UnderstandingGeneral
Avg Score72.4
8
Average evaluation across 7 tasksGeneral (test)
BERTScore76.5
8
Colon Polyp SegmentationGeneral
Parameters (M)32.55
8
Computational Complexity AnalysisGeneral Model Complexity
Parameters91,371
7
360-degree video saliency predictionGeneral
Params (M)3.7
7
Circuit LocalizationGeneral
CPR2.13
6
Model Efficiency AnalysisGeneral 16 frames, 512 text tokens (inference)
FPS20.74
6
Interactive SegmentationGeneral Efficiency Benchmarking
Parameters (MB)84.89
6
Optical Flow EstimationGeneral Architecture Evaluation
Parameters (M)0.074
5
Optimizer Property ComparisonGeneral Theoretical Analysis
FLOPs per Step1
5
Novel View SynthesisGeneral
MFLOPs / Pixel13.77
5
Ending event predictionGeneral (test)
MRR0.401
5
Speech RecognitionGeneral Throughput Evaluation
Throughput (tokens/s)168.9
4
Distributed OptimizationGeneral First-order optimization setting
Metric-
0
Automated Feature EngineeringGeneral
Metric-
0
Vulnerability AnalysisGeneral
Metric-
0
3D Scene Decomposition Capability AssessmentGeneral Method Capability Comparison
Metric-
0
Showing 25 of 26 rows