Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

L3

Benchmarks

Task NameDataset NameSOTA ResultTrend
Action-component payoff optimizationL3 warmup (off-diagonal)
Per-Interaction Payoff4.17
8
Semantic ReasoningL3 (out-of-distribution)
Accuracy (Series Comparison)67
2
Showing 2 of 2 rows