Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adv

Benchmarks

Task NameDataset NameSOTA ResultTrend
Toxicity DetectionAdv
Accuracy59.74
42
Code RetrievalAdv
MRR48.6
9
NL2Code SearchAdv (test)
MRR57.27
7
Online Optimization with Long-Term ConstraintsADV constant rho
Metric-
0
Showing 4 of 4 rows