Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LAM Evaluation Benchmark

Benchmarks

Task NameDataset NameSOTA ResultTrend
Benchmark Subset SelectionLAM Evaluation Benchmark 40 tasks
Pearson Correlation0.977
60
Showing 1 of 1 rows