Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Random

Benchmarks

Task NameDataset NameSOTA ResultTrend
Variable Gapped Longest Common Subsequence ProblemRANDOM benchmark suite
Objective Score294.4
99
Multi-Agent Path Finding (MAPF)random 32x32-20
Success Rate100
83
Multi-Agent Path Finding (MAPF)random 64x64-20
Success Rate100
73
Adversarial AttackRandom-10M
Success Rate99.2
20
Adversarial AttackRandom-2M
Success Rate100
20
Adversarial AttackRandom-500k
Success Rate100
20
kNNRandom
QPS3,662,188
16
Robot NavigationRandom (Evaluation)
Goal Success Rate96.4
12
Trajectory ReconstructionRandom left out subjects (test)
MAE0.49
10
Generative Identity UnlearningRandom
Identity Distance (ID)0.6538
10
Image RetrievalRandom (test)
Recall@179.39
10
Path-followingRandom 1 cm, 1° 1.0 (test)
Success Rate99.3
8
RO reformulationRandom
Accuracy96.9
6
RO reformulationRandom In-Distribution
Accuracy97.4
6
Multi-Agent Path Findingrandom-32-32-20 # agents: 200
UA Conflicts6.74
6
Scene Graph ParsingRandom (test)
Set Match79.77
6
Compression CapacityRandom GloVe (test)
Max Tokens792
6
Lossless text compressionRandom
Compression Ratio (bits)2.83
5
Vector Retrievalrandom
Recall@1061
4
Transfer Learning (ip ← dk)Random Split
RMSE0.5257
4
Transfer Learning (hv ← kri)Random (Split)
RMSE0.908
4
Transfer Learning (hv ← ef)Random (Split)
RMSE0.8112
4
Transfer Learning (ip ← bp)Random Split
RMSE0.4631
4
Transfer Learning (hv ← ct)Random Split
RMSE0.9207
4
Transfer Learning (hv ← bp)Random Split
RMSE0.8267
4
Showing 25 of 33 rows