Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FDA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Long-context language modeling evaluationFDA (test)
Score0.8004
120
Dynamic Multi-objective OptimizationFDA 2
Maximum Hypervolume (MHV)2
15
In-context retrievalFDA
Accuracy74.5
13
Showing 3 of 3 rows